Kamal,

I guest then there is no way to index pdf, docs and ppt that are in your directory structure unless those
documents are attached to a node, eh?

Thanks for your help.


On Sat, Dec 15, 2012 at 3:29 AM, Austin Einter <austin.einter@gmail.com> wrote:
Dear Néstor
In point 2, I mean in settings (of apache solr), you must enable File , so that apache will do indexing for file. Also if you have created any new content type that by default is not enabled for indexing, you need to enable it in apachesolr settings.

Thanks
Kamal




On Sat, Dec 15, 2012 at 6:00 AM, Néstor <rotsen@gmail.com> wrote:
Hi Kamal,

Thanks for your prompt reply.

I did not understand your 2 point?

Yes, I have the suspicion that in order for the documents to be indexed, they need to be attached to a node.

Thanks again,

Néstor



On Fri, Dec 14, 2012 at 3:56 PM, Kamal Palei <palei.kamal@gmail.com> wrote:
Dear Néstor
Now I am using Drupal 7 , and I see all .doc and .pdf are getting indexed.
And finally, I was able to get .pdf and .doc files indexed in Druapl 6, but today I do not have that source code. Few things I can remember one need to do is

1. When somebody uploads a document, you can create a node and attach that document. In next cron, this will be indexed, if you want to see indexing quick, run cron manually.

2. Make sure in settings, you have ticked File and any new node types you have.

3. Any time if the document changes, you need to mark the node manually, so that in next cron the modified document will be indexed.

If I remember right, the hooks in Druapl 6 and 7 are different, you need to find some right hooks to get things working.

Thanks
Kamal
Net Cloud Systems, Bangalore





On Sat, Dec 15, 2012 at 3:22 AM, Néstor <rotsen@gmail.com> wrote:
HI Kamal,

I have similar set up as yours but content of my pdf, doc files are not getting indexed.

I re-index but I do not see a change or any errors in solr log file.

Tika does work from the command line..

Drupal 6.25, apache-soler-3.6.1, tika-app-1.2.jar, drupal apachesolr 6-x.1.7,
drupal_attachment 6.x-1.0-beta3, php 5.3

Thanks,

Néstor

Any hints you can give me.


On Sat, Sep 1, 2012 at 10:13 PM, Kamal Palei <palei.kamal@gmail.com> wrote:
Hi All
I am trying to integrate apache solr (3.6.x), tika with Drupal 6.26.
 
I found normal site contents (such as nodes, page types, panel types) are getting indexed.
I can go to apache solr home page and serach some phrases and can see the results.
 
Even file attachment also getting indexed but only from defined node types.
For example, if I attach a file while creating content from Stoy or Page or Panel type, that file is getting indexed.
 
This ENSURES the integration of drupal 6.26/SOLR-3.6/TIKA is working fine.
 
BUT, I have a custom module and it provides a custom form to users to register their details. and attach their resumes as well.
When they submit the form, I call Drupal file_save_xx api to save the file in a custom path.
In this case files are not getting indexed. Has anybody tried this thing previously. I searched quite a lot but could not get any
proper documentation in this regard.
 
Any suggestions/pointers highly appreciated.
 
Thanks
Kamal
NECS, Bangalore
 

--
[ Drupal support list | http://lists.drupal.org/ ]


--
[ Drupal support list | http://lists.drupal.org/ ]


--
[ Drupal support list | http://lists.drupal.org/ ]


--
[ Drupal support list | http://lists.drupal.org/ ]


--
[ Drupal support list | http://lists.drupal.org/ ]