Have a look at https://issues.apache.org/jira/browse/SOLR-284
I've created a contrib module for Solr that integrates Tika. Feedback would be greatly appreciated by adding comments on the issue.
On Dec 4, 2008, at 11:56 AM, Steve Ruzila wrote:
I'm working on a project that would use Lucene/Solr as the backend for searching through thousands of MS Office and PDF files. I want to be able to do keyword searches on these files. I'm not quite clear as to how POI works with Lucene and what relationship the Tika project has with it....since Tika seems to use POI. Thanks. Steve
-------------------------- Grant Ingersoll Lucene Helpful Hints: http://wiki.apache.org/lucene-java/BasicsOfPerformance http://wiki.apache.org/lucene-java/LuceneFAQ
