On Sun, 27 May 2012, Raphaël wrote:
I use Tika through the Solr ExtractingRequestHandler and I face a very
common use case namely: postprocessing Tika fields in order to normalize
some fields values or override them with explicitly passed
"literal" values.

I believe you'll need to ask on the SOLR list about this, as it's likely to be specific to ExtractingRequestHandler which is maintained by SOLR rather than Tika. Once metadata comes back from Tika you can do anything you want with it, the question is more what SOLR's ExtractingRequestHandler supports

I also would like to work at the API "field" level rather than working
with xpath on the raw Tika output.

Fields are entirely SOLR/Lucene specific. Tika outputs metadata and content (as XHTML or plain text)

Nick

Reply via email to