[support] Drupal appache solr tika fileattachment indexing

Néstor rotsen at gmail.com
Thu Jan 3 20:28:51 UTC 2013


Kamal et all,

Just wanted to let you know that I was able to attach and index the files
attached
but when you do a search it finds the document that the files are attached
to, but not
the document itself.  Once you click on the document then there is a list
of all of the
files that were attached.  That is not that good because then the user
needs to be
smart enough to know that they need to click on one of the links inside
this file.
Most users are not that smart.

I am using the File_import modules.

Thanks for your help,

Nestor


On Sat, Dec 29, 2012 at 8:38 AM, Kamal Palei <palei.kamal at gmail.com> wrote:

> Sorry for delayed response.
>
> Kind of. If you attach documents those will get indexed.
>
> Apache_solr_files module should be able to index files without attaching
> to node, but when I tried, it did not work that way.
>
>
> Thanks
> Kamal
> Net Cloud Systems, Bangalore
>
>
>
> On Mon, Dec 17, 2012 at 11:56 PM, Néstor <rotsen at gmail.com> wrote:
>
>> Kamal,
>>
>> I guest then there is no way to index pdf, docs and ppt that are in your
>> directory structure unless those
>> documents are attached to a node, eh?
>>
>> Thanks for your help.
>>
>>
>>
>> On Sat, Dec 15, 2012 at 3:29 AM, Austin Einter <austin.einter at gmail.com>wrote:
>>
>>> Dear Néstor
>>> In point 2, I mean in settings (of apache solr), you must enable File ,
>>> so that apache will do indexing for file. Also if you have created any new
>>> content type that by default is not enabled for indexing, you need to
>>> enable it in apachesolr settings.
>>>
>>> Thanks
>>> Kamal
>>>
>>>
>>>
>>>
>>> On Sat, Dec 15, 2012 at 6:00 AM, Néstor <rotsen at gmail.com> wrote:
>>>
>>>> Hi Kamal,
>>>>
>>>> Thanks for your prompt reply.
>>>>
>>>> I did not understand your 2 point?
>>>>
>>>> Yes, I have the suspicion that in order for the documents to be
>>>> indexed, they need to be attached to a node.
>>>>
>>>> Thanks again,
>>>>
>>>> Néstor
>>>>
>>>>
>>>>
>>>> On Fri, Dec 14, 2012 at 3:56 PM, Kamal Palei <palei.kamal at gmail.com>wrote:
>>>>
>>>>> Dear Néstor
>>>>> Now I am using Drupal 7 , and I see all .doc and .pdf are getting
>>>>> indexed.
>>>>> And finally, I was able to get .pdf and .doc files indexed in Druapl
>>>>> 6, but today I do not have that source code. Few things I can remember one
>>>>> need to do is
>>>>>
>>>>> 1. When somebody uploads a document, you can create a node and attach
>>>>> that document. In next cron, this will be indexed, if you want to see
>>>>> indexing quick, run cron manually.
>>>>>
>>>>> 2. Make sure in settings, you have ticked File and any new node types
>>>>> you have.
>>>>>
>>>>> 3. Any time if the document changes, you need to mark the node
>>>>> manually, so that in next cron the modified document will be indexed.
>>>>>
>>>>> If I remember right, the hooks in Druapl 6 and 7 are different, you
>>>>> need to find some right hooks to get things working.
>>>>>
>>>>> Thanks
>>>>> Kamal
>>>>> Net Cloud Systems, Bangalore
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> On Sat, Dec 15, 2012 at 3:22 AM, Néstor <rotsen at gmail.com> wrote:
>>>>>
>>>>>> HI Kamal,
>>>>>>
>>>>>> I have similar set up as yours but content of my pdf, doc files are
>>>>>> not getting indexed.
>>>>>>
>>>>>> I re-index but I do not see a change or any errors in solr log file.
>>>>>>
>>>>>> Tika does work from the command line..
>>>>>>
>>>>>> Drupal 6.25, apache-soler-3.6.1, tika-app-1.2.jar, drupal apachesolr
>>>>>> 6-x.1.7,
>>>>>> drupal_attachment 6.x-1.0-beta3, php 5.3
>>>>>>
>>>>>> Thanks,
>>>>>>
>>>>>> Néstor
>>>>>>
>>>>>> Any hints you can give me.
>>>>>>
>>>>>>
>>>>>> On Sat, Sep 1, 2012 at 10:13 PM, Kamal Palei <palei.kamal at gmail.com>wrote:
>>>>>>
>>>>>>> Hi All
>>>>>>> I am trying to integrate apache solr (3.6.x), tika with Drupal 6.26.
>>>>>>>
>>>>>>> I found normal site contents (such as nodes, page types, panel
>>>>>>> types) are getting indexed.
>>>>>>> I can go to apache solr home page and serach some phrases and can
>>>>>>> see the results.
>>>>>>>
>>>>>>> Even file attachment also getting indexed but only from defined node
>>>>>>> types.
>>>>>>> For example, if I attach a file while creating content from Stoy or
>>>>>>> Page or Panel type, that file is getting indexed.
>>>>>>>
>>>>>>> *This ENSURES the integration of drupal 6.26/SOLR-3.6/TIKA is
>>>>>>> working fine.*
>>>>>>>
>>>>>>> BUT, I have a custom module and it provides a custom form to users
>>>>>>> to register their details. and attach their resumes as well.
>>>>>>> When they submit the form, I call Drupal file_save_xx api to save
>>>>>>> the file in a custom path.
>>>>>>> In this case files are not getting indexed. Has anybody tried this
>>>>>>> thing previously. I searched quite a lot but could not get any
>>>>>>> proper documentation in this regard.
>>>>>>>
>>>>>>> Any suggestions/pointers highly appreciated.
>>>>>>>
>>>>>>> Thanks
>>>>>>> Kamal
>>>>>>> NECS, Bangalore
>>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>> [ Drupal support list | http://lists.drupal.org/ ]
>>>>>>>
>>>>>>
>>>>>>
>>>>>> --
>>>>>> [ Drupal support list | http://lists.drupal.org/ ]
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> [ Drupal support list | http://lists.drupal.org/ ]
>>>>>
>>>>
>>>>
>>>> --
>>>> [ Drupal support list | http://lists.drupal.org/ ]
>>>>
>>>
>>>
>>> --
>>> [ Drupal support list | http://lists.drupal.org/ ]
>>>
>>
>>
>> --
>> [ Drupal support list | http://lists.drupal.org/ ]
>>
>
>
> --
> [ Drupal support list | http://lists.drupal.org/ ]
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.drupal.org/pipermail/support/attachments/20130103/5e66b5de/attachment-0001.html 


More information about the support mailing list