Hi Tim, All, A simple Quarkus Tika extension I was working upon has made it into the latest Quarkus 0.19.1 release: https://github.com/quarkusio/quarkus/tree/0.19.1/extensions/tika
Happy about it and intend to grow it further. It is pretty basic at the moment (AutoDetectParser + ToTextCotentHandler). But what can be of interest is that it can be run in the GraalVM native image (so it will be all just a C compiled executable AFAIK). The basic test shows how to use it: https://github.com/quarkusio/quarkus/tree/master/integration-tests/tika (Simple PDF, ODT and text files are checked) I'll work on a demo in the next couple of months, and I will present about it at Apache Con EU in Oct. But if someone is interested to try to run this extension in the native mode, let me know please and I'll try to prepare some simple instructions on how to quickly try with some massive PDF, etc. Cheers, Sergey
