Hi, I've created OAK-7996 [0] to discuss allowing us to disable automatic text extraction by configuration instead of using a tika.config in an index definition to do it.
This was originally proposed as a possible Oak change last November, but in discussion we agreed not to attempt this change at that time due to the close proximity of the 1.10 release. Now that 1.10 is out, I wonder if we could consider this for a future release. One use case for disabling text extraction would be if a user is performing the text extraction for a new binary on their own, outside of Oak. WDYT? [0] - https://issues.apache.org/jira/browse/OAK-7996 -MR