Hi all,

while SolrCell works nicely when in need of indexing binary documents, I am
wondering about the possibility of having Lucene / Solr documents that have
binaries in specific Lucene fields, e.g. title="a nice doc",
name"blabla.doc", binary="0x1234...".

In that case the "binary" field should have an indexing analyzer which can
extract the text from the binary and index it.

Would it make sense to create a Tika based analyzer for that purpose?

Regards,
Tommaso

Reply via email to