Hi Angel,
I'm looking into it. Might need a new SolrRequest, but still playing
around and will let you know...
-Grant
On Sep 2, 2009, at 4:56 AM, Angel Ice wrote:
Hi everybody.
I hope it's the right place for questions, if not sorry.
I'm trying to index rich documents (PDF, MS docs etc) in SolR/Lucene.
I have seen a few examples explaining how to use tika to solve this.
But most of these examples are using curl to send documents to Solr
or an HTML POST with an input file.
But i'd like to do it in full java.
Is there a way to use Solrj to index the documents with the
ExtractingRequestHandler of SolR or at least to get the extracted
xml back (with the extract.only option) ?
Many thanks.
Laurent.
--------------------------
Grant Ingersoll
http://www.lucidimagination.com/
Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)
using Solr/Lucene:
http://www.lucidimagination.com/search