Hi Tim, All,

A simple Quarkus Tika extension I was working upon has made it into the
latest Quarkus 0.19.1 release:
https://github.com/quarkusio/quarkus/tree/0.19.1/extensions/tika

Happy about it and intend to grow it further. It is pretty basic at the
moment (AutoDetectParser + ToTextCotentHandler). But what can be of
interest is that it can be run in the GraalVM native image (so it will be
all just a C compiled executable AFAIK).
The basic test shows how to use it:

https://github.com/quarkusio/quarkus/tree/master/integration-tests/tika
(Simple PDF, ODT and text files are checked)

I'll work on a demo in the next couple of months, and I will present about
it at Apache Con EU in Oct. But if someone is interested to try to run this
extension in the native mode, let me know please and I'll try to prepare
some simple instructions on how to quickly try with some massive PDF, etc.

Cheers, Sergey

Reply via email to