[support] Learning More on Apache Solr Attachments

Kamal Palei palei.kamal at gmail.com
Thu Oct 18 04:19:45 UTC 2012


Thanks Cindy-Sue, Nestor, Nancy for your input.

There are two aspects here.

*1. Modifying/Tuning apache solr
2. Getting Drupal to work precisely what we need
*
I know, the first one I have reference books, and confident going forward I
will be able to manage it or if not I will put my queries in apache user
mailing list as pointed by Cindy-Sue.

But for 2nd option "Getting Drupal to work precisely what we need" ,
probably we need to work out or put the queries to Drupal experts.

On a overall note, I am able to index the attached document (.txt, .doc,
.docx, .pdf all types that I need).
But I am not liking the way I am doing it, as I create a node and attach
the document programatically.
Other option is created a node, per node attach huge number of document
(say 50,000 documents), but not sure what all side impacts and performance
issues will follow.  I am doing my best to find a way to do the indexing of
files without creating a node. Looks media module apachesolr_file module
may help, but could not get it working.

And, if you go to next level say, in hook _apachesolr_process_results just
print the results , you will there are huge informations associated with
result and the most important information (file belongs to which user) is
not available.

I believe we can achieve these things using hooks. My findings till today
are
1. While sending the document to solr for indexing by using removeParm
function in an appropriate hook, remove the informations we do not need.
2. While querying to get the files containing few words, that time probably
in query alter hook, tweak the query in such a way that, the result will
contain the precise informations we need.

Still a long way to go for me to get the things in order.

Best Regards
Austin

On Thu, Oct 18, 2012 at 3:35 AM, Ms. Nancy Wichmann
<nan_wich at bellsouth.net>wrote:

> We just installed Solr on two different sites. On one it is indexing the
> attachments, on the other it misses most. We have not totally tracked it
> down, but it seems to have something to do with the method of attachment.
>
> *Nancy*
> Injustice anywhere is a threat to justice everywhere. -- Dr. Martin L.
> King, Jr.
>
>   ------------------------------
>
> Austin Einter wrote:
>
> Looks whole list is silent in this regard (apache solr file attachment).
> If you can share whatever idea you have on below queries, it will help me
> to great extent.
>
>
> --
> [ Drupal support list | http://lists.drupal.org/ ]
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.drupal.org/pipermail/support/attachments/20121018/140f9593/attachment.html 


More information about the support mailing list