Hi All I was doing bit R&D on how can we do searching in file contents. To search a particular token or word in a text/doc/pdf file, probably we can use apache solr / tika combination. With the help of apache solr attachment module, probably we can search a specific word in a attached word document.
Please correct me if wrong in my assumption so far.
On further study, I came to know if a node is created and a document is attached to the node then only in next cron run, the attached document will be indexed and one will be able to search a specific word in that document after certain delay (hope default delay is 2 minute).
Now the question is if somebody just uploads a document, and there is no need to attach that document to any node, is there any way out, we can get the document indexed in next cron run. I can programatically create a node and attach the document so that I will be able to index it. But just for indexing a document, do not want to create a node programatically. Because of this reason, day by day, number of nodes will keep incrementing.
Any suggestion highly appreciated.
Thanks Austin
Hi List Any input regarding this highly appreciated. Thanks, Austin
Hi All I was doing bit R&D on how can we do searching in file contents. To search a particular token or word in a text/doc/pdf file, probably we can use apache solr / tika combination. With the help of apache solr attachment module, probably we can search a specific word in a attached word document.
Please correct me if wrong in my assumption so far.
On further study, I came to know if a node is created and a document is attached to the node then only in next cron run, the attached document will be indexed and one will be able to search a specific word in that document after certain delay (hope default delay is 2 minute).
Now the question is if somebody just uploads a document, and there is no need to attach that document to any node, is there any way out, we can get the document indexed in next cron run. I can programatically create a node and attach the document so that I will be able to index it. But just for indexing a document, do not want to create a node programatically. Because of this reason, day by day, number of nodes will keep incrementing.
Any suggestion highly appreciated.
Thanks Austin
Hi Austin Are you using D7 or D6. With D6 I tried couple of months back. I had a simple form and any user can upload a document. I wanted that document should be indexed. I had used apache solr file attachment module also was creating a node programatically.
Looks it is impossible to think of indexing a file without creating a corresponding node. Thats what I know.
Thanks Kamal
On Thu, Sep 27, 2012 at 1:36 PM, Austin Einter austin.einter@gmail.comwrote:
Hi List Any input regarding this highly appreciated. Thanks, Austin
Hi All I was doing bit R&D on how can we do searching in file contents. To search a particular token or word in a text/doc/pdf file, probably we can use apache solr / tika combination. With the help of apache solr attachment module, probably we can search a specific word in a attached word document.
Please correct me if wrong in my assumption so far.
On further study, I came to know if a node is created and a document is attached to the node then only in next cron run, the attached document will be indexed and one will be able to search a specific word in that document after certain delay (hope default delay is 2 minute).
Now the question is if somebody just uploads a document, and there is no need to attach that document to any node, is there any way out, we can get the document indexed in next cron run. I can programatically create a node and attach the document so that I will be able to index it. But just for indexing a document, do not want to create a node programatically. Because of this reason, day by day, number of nodes will keep incrementing.
Any suggestion highly appreciated.
Thanks Austin
-- [ Drupal support list | http://lists.drupal.org/ ]