Hi!

I'm importing BLOBs from an Oracle DB, and want to retrieve the textual
body/plaintext content for analyzing/indexing purposes. I'm using
TikaEntityProcessor to do the parsing of the documents, which works fine for
most of the documents. But in some cases , e.g. when a document is password
protected, the parsing fails, and Tika throws a Tika-198: IllegalIOException
(see stack trace at the end of post). This leads to the entire dataimport
being rollbacked, which really is an unfortunate behavior.

After finding the ignoreTikaException flag (Jira issue
https://issues.apache.org/jira/browse/SOLR-2480) patch, I thought my problem
was fixed, but adding this flag to my extractingRequestHandler doesn't seem
to do anything.

My requestHandler:


I've tried by adding ignoreTikaException=true as a custom parameter when
doing the dataImport as well, but it doesn't do anything.
Did I miss something, or has the ignoreTikaException mechanism changed in
later versions of Solr?

The ERROR stack trace:




--
View this message in context: 
http://lucene.472066.n3.nabble.com/gnoreTikaException-flag-not-working-tp4168526.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to