[development] Learning More on Apache Solr Attachments

Austin Einter austin.einter at gmail.com
Tue Oct 16 00:52:10 UTC 2012


Dear All
I want to learn more on apache solr file attachments.
My aim is to modify / add code in a way, that will suit my requirements.

I have been struggling for quite sometime

1. How to index a file
2. How to store minimal data during indexing and get minimal data during
searching
3. How to get results in smaller chunks, such as if my query records count
is 1000, I would like to receive only 100 results at a time, in next query
100 more and so on

Elaborating more on above points

*1. How to index a file
*
Today when a user uploads his/her document, I create a node
programatically, attach the document to node and then Drupal indexes file
in next cron run.
When a user attaches a new document, I delete the old attached document,
attach the new document programatically, then Drupal indexes file in next
cron run.
That's how today I am indexing a file.

I feel, the node creation is not required here, just to get the file
indexed, I am creating a node.
I want to know whats the best way to index a file (without creating a node).

*2. How to store minimal data during indexing and get minimal data during
searching
*If you see during a file indexing , lots of data are sent to apache solr
server and those are getting stored. To be specific here, for my
requirement I do not need link, snippet - string, snippet-array, fields
array having 22 elements etc etc. I may need to just store UID of user
associated with this file . How can I achieve it , or at least where can I
get more information on this.

*3. How to get results in smaller chunks, such as if my query records count
is 1000, I would like to receive only 100 results at a time, in next query
100 more and so on
*Lets say a query returns 1000 records, probably I may need only first 100
records first time, when user clicks more link, I may need next 100 records
so on.
Not sure if such facility is there with apache solr attachment module.

So I really will be happy to get more information on above 3 points. If any
book, or any links that explains details will be really quite handful to me.

Any information in this regard highly appreciated.

Thanks
Austin
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.drupal.org/pipermail/development/attachments/20121016/9c619bae/attachment.html 


More information about the development mailing list