Hi! I'm importing BLOBs from an Oracle DB, and want to retrieve the textual body/plaintext content for analyzing/indexing purposes. I'm using TikaEntityProcessor to do the parsing of the documents, which works fine for most of the documents. But in some cases , e.g. when a document is password protected, the parsing fails, and Tika throws a Tika-198: IllegalIOException (see stack trace at the end of post). This leads to the entire dataimport being rollbacked, which really is an unfortunate behavior.
After finding the ignoreTikaException flag (Jira issue https://issues.apache.org/jira/browse/SOLR-2480) patch, I thought my problem was fixed, but adding this flag to my extractingRequestHandler doesn't seem to do anything. My requestHandler: I've tried by adding ignoreTikaException=true as a custom parameter when doing the dataImport as well, but it doesn't do anything. Did I miss something, or has the ignoreTikaException mechanism changed in later versions of Solr? The ERROR stack trace: -- View this message in context: http://lucene.472066.n3.nabble.com/gnoreTikaException-flag-not-working-tp4168526.html Sent from the Solr - User mailing list archive at Nabble.com.