Tika analyzers

Tommaso Teofili Wed, 30 Jul 2014 06:01:58 -0700

Hi all,

while SolrCell works nicely when in need of indexing binary documents, I am
wondering about the possibility of having Lucene / Solr documents that have
binaries in specific Lucene fields, e.g. title="a nice doc",
name"blabla.doc", binary="0x1234...".


In that case the "binary" field should have an indexing analyzer which can
extract the text from the binary and index it.

Would it make sense to create a Tika based analyzer for that purpose?

Regards,
Tommaso

Tika analyzers

Reply via email to