Kamal,<br><br>I guest then there is no way to index pdf, docs and ppt that are in your directory structure unless those<br>documents are attached to a node, eh?<br><br>Thanks for your help.<br><div class="gmail_extra"><br>
<br><div class="gmail_quote">On Sat, Dec 15, 2012 at 3:29 AM, Austin Einter <span dir="ltr"><<a href="mailto:austin.einter@gmail.com" target="_blank">austin.einter@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Dear <span name="Néstor">Néstor</span> <span></span><br>In point 2, I mean in settings (of apache solr), you must enable File , so that apache will do indexing for file. Also if you have created any new content type that by default is not enabled for indexing, you need to enable it in apachesolr settings.<br>
<br>Thanks<br>Kamal<div class="HOEnZb"><div class="h5"><br><br><div class="gmail_extra"><br><br><div class="gmail_quote">On Sat, Dec 15, 2012 at 6:00 AM, Néstor <span dir="ltr"><<a href="mailto:rotsen@gmail.com" target="_blank">rotsen@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi Kamal,<br><br>Thanks for your prompt reply.<br><br>I did not understand your 2 point?<br><br>Yes, I have the suspicion that in order for the documents to be indexed, they need to be attached to a node.<br>
<br>Thanks again,<br>
<br>Néstor<div><div><br><div class="gmail_extra"><br><br><div class="gmail_quote">On Fri, Dec 14, 2012 at 3:56 PM, Kamal Palei <span dir="ltr"><<a href="mailto:palei.kamal@gmail.com" target="_blank">palei.kamal@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Dear Néstor<br>Now I am using Drupal 7 , and I see all .doc and .pdf are getting indexed.<br>And finally, I was able to get .pdf and .doc files indexed in Druapl 6, but today I do not have that source code. Few things I can remember one need to do is<br>
<br>1. When somebody uploads a document, you can create a node and attach that document. In next cron, this will be indexed, if you want to see indexing quick, run cron manually.<br><br>2. Make sure in settings, you have ticked File and any new node types you have.<br>
<br>3. Any time if the document changes, you need to mark the node manually, so that in next cron the modified document will be indexed.<br><br>If I remember right, the hooks in Druapl 6 and 7 are different, you need to find some right hooks to get things working.<br>
<br>Thanks<br>Kamal<br>Net Cloud Systems, Bangalore<div><div><br><br><br><br><br><div class="gmail_quote">On Sat, Dec 15, 2012 at 3:22 AM, Néstor <span dir="ltr"><<a href="mailto:rotsen@gmail.com" target="_blank">rotsen@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">HI Kamal,<br><br>I have similar set up as yours but content of my pdf, doc files are not getting indexed.<br><br>I re-index but I do not see a change or any errors in solr log file.<br>
<br>Tika does work from the command line..<br>
<br>Drupal 6.25, apache-soler-3.6.1, tika-app-1.2.jar, drupal apachesolr 6-x.1.7,<br>drupal_attachment 6.x-1.0-beta3, php 5.3<br><br>Thanks,<br><br>Néstor<br><br>Any hints you can give me.<br><div class="gmail_extra"><br>
<br><div class="gmail_quote"><div><div>On Sat, Sep 1, 2012 at 10:13 PM, Kamal Palei <span dir="ltr"><<a href="mailto:palei.kamal@gmail.com" target="_blank">palei.kamal@gmail.com</a>></span> wrote:<br></div>
</div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div><div>
<div>Hi All</div>
<div>I am trying to integrate apache solr (3.6.x), tika with Drupal 6.26.</div>
<div> </div>
<div>I found normal site contents (such as nodes, page types, panel types) are getting indexed.</div>
<div>I can go to apache solr home page and serach some phrases and can see the results.</div>
<div> </div>
<div>Even file attachment also getting indexed but only from defined node types.</div>
<div>For example, if I attach a file while creating content from Stoy or Page or Panel type, that file is getting indexed.</div>
<div> </div>
<div><b><font color="#000099">This ENSURES the integration of drupal 6.26/SOLR-3.6/TIKA is working fine.</font></b></div>
<div> </div>
<div>BUT, I have a custom module and it provides a custom form to users to register their details. and attach their resumes as well.</div>
<div>When they submit the form, I call Drupal file_save_xx api to save the file in a custom path.</div>
<div>In this case files are not getting indexed. Has anybody tried this thing previously. I searched quite a lot but could not get any</div>
<div>proper documentation in this regard.</div>
<div> </div>
<div>Any suggestions/pointers highly appreciated.</div>
<div> </div>
<div>Thanks</div>
<div>Kamal</div>
<div>NECS, Bangalore</div>
<div> </div>
<br></div></div><span><font color="#888888">--<br>
[ Drupal support list | <a href="http://lists.drupal.org/" target="_blank">http://lists.drupal.org/</a> ]<br></font></span></blockquote></div><br></div>
<br>--<br>
[ Drupal support list | <a href="http://lists.drupal.org/" target="_blank">http://lists.drupal.org/</a> ]<br></blockquote></div><br>
</div></div><br>--<br>
[ Drupal support list | <a href="http://lists.drupal.org/" target="_blank">http://lists.drupal.org/</a> ]<br></blockquote></div><br></div>
</div></div><br>--<br>
[ Drupal support list | <a href="http://lists.drupal.org/" target="_blank">http://lists.drupal.org/</a> ]<br></blockquote></div><br></div>
</div></div><br>--<br>
[ Drupal support list | <a href="http://lists.drupal.org/" target="_blank">http://lists.drupal.org/</a> ]<br></blockquote></div><br></div>