[support] Drupal appache solr tika fileattachment indexing

Néstor rotsen at gmail.com
Mon Dec 17 18:26:08 UTC 2012


Kamal,

I guest then there is no way to index pdf, docs and ppt that are in your
directory structure unless those
documents are attached to a node, eh?

Thanks for your help.


On Sat, Dec 15, 2012 at 3:29 AM, Austin Einter <austin.einter at gmail.com>wrote:

> Dear Néstor
> In point 2, I mean in settings (of apache solr), you must enable File , so
> that apache will do indexing for file. Also if you have created any new
> content type that by default is not enabled for indexing, you need to
> enable it in apachesolr settings.
>
> Thanks
> Kamal
>
>
>
>
> On Sat, Dec 15, 2012 at 6:00 AM, Néstor <rotsen at gmail.com> wrote:
>
>> Hi Kamal,
>>
>> Thanks for your prompt reply.
>>
>> I did not understand your 2 point?
>>
>> Yes, I have the suspicion that in order for the documents to be indexed,
>> they need to be attached to a node.
>>
>> Thanks again,
>>
>> Néstor
>>
>>
>>
>> On Fri, Dec 14, 2012 at 3:56 PM, Kamal Palei <palei.kamal at gmail.com>wrote:
>>
>>> Dear Néstor
>>> Now I am using Drupal 7 , and I see all .doc and .pdf are getting
>>> indexed.
>>> And finally, I was able to get .pdf and .doc files indexed in Druapl 6,
>>> but today I do not have that source code. Few things I can remember one
>>> need to do is
>>>
>>> 1. When somebody uploads a document, you can create a node and attach
>>> that document. In next cron, this will be indexed, if you want to see
>>> indexing quick, run cron manually.
>>>
>>> 2. Make sure in settings, you have ticked File and any new node types
>>> you have.
>>>
>>> 3. Any time if the document changes, you need to mark the node manually,
>>> so that in next cron the modified document will be indexed.
>>>
>>> If I remember right, the hooks in Druapl 6 and 7 are different, you need
>>> to find some right hooks to get things working.
>>>
>>> Thanks
>>> Kamal
>>> Net Cloud Systems, Bangalore
>>>
>>>
>>>
>>>
>>>
>>> On Sat, Dec 15, 2012 at 3:22 AM, Néstor <rotsen at gmail.com> wrote:
>>>
>>>> HI Kamal,
>>>>
>>>> I have similar set up as yours but content of my pdf, doc files are not
>>>> getting indexed.
>>>>
>>>> I re-index but I do not see a change or any errors in solr log file.
>>>>
>>>> Tika does work from the command line..
>>>>
>>>> Drupal 6.25, apache-soler-3.6.1, tika-app-1.2.jar, drupal apachesolr
>>>> 6-x.1.7,
>>>> drupal_attachment 6.x-1.0-beta3, php 5.3
>>>>
>>>> Thanks,
>>>>
>>>> Néstor
>>>>
>>>> Any hints you can give me.
>>>>
>>>>
>>>> On Sat, Sep 1, 2012 at 10:13 PM, Kamal Palei <palei.kamal at gmail.com>wrote:
>>>>
>>>>> Hi All
>>>>> I am trying to integrate apache solr (3.6.x), tika with Drupal 6.26.
>>>>>
>>>>> I found normal site contents (such as nodes, page types, panel types)
>>>>> are getting indexed.
>>>>> I can go to apache solr home page and serach some phrases and can see
>>>>> the results.
>>>>>
>>>>> Even file attachment also getting indexed but only from defined node
>>>>> types.
>>>>> For example, if I attach a file while creating content from Stoy or
>>>>> Page or Panel type, that file is getting indexed.
>>>>>
>>>>> *This ENSURES the integration of drupal 6.26/SOLR-3.6/TIKA is working
>>>>> fine.*
>>>>>
>>>>> BUT, I have a custom module and it provides a custom form to users to
>>>>> register their details. and attach their resumes as well.
>>>>> When they submit the form, I call Drupal file_save_xx api to save the
>>>>> file in a custom path.
>>>>> In this case files are not getting indexed. Has anybody tried this
>>>>> thing previously. I searched quite a lot but could not get any
>>>>> proper documentation in this regard.
>>>>>
>>>>> Any suggestions/pointers highly appreciated.
>>>>>
>>>>> Thanks
>>>>> Kamal
>>>>> NECS, Bangalore
>>>>>
>>>>>
>>>>> --
>>>>> [ Drupal support list | http://lists.drupal.org/ ]
>>>>>
>>>>
>>>>
>>>> --
>>>> [ Drupal support list | http://lists.drupal.org/ ]
>>>>
>>>
>>>
>>> --
>>> [ Drupal support list | http://lists.drupal.org/ ]
>>>
>>
>>
>> --
>> [ Drupal support list | http://lists.drupal.org/ ]
>>
>
>
> --
> [ Drupal support list | http://lists.drupal.org/ ]
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.drupal.org/pipermail/support/attachments/20121217/d262e3cd/attachment-0001.html 


More information about the support mailing list