Hi,

I've created OAK-7996 [0] to discuss allowing us to disable automatic text
extraction by configuration instead of using a tika.config in an index
definition to do it.

This was originally proposed as a possible Oak change last November, but in
discussion we agreed not to attempt this change at that time due to the
close proximity of the 1.10 release.  Now that 1.10 is out, I wonder if we
could consider this for a future release.

One use case for disabling text extraction would be if a user is performing
the text extraction for a new binary on their own, outside of Oak.

WDYT?


[0] - https://issues.apache.org/jira/browse/OAK-7996


-MR

Reply via email to