Hello, Oh! Thanks for pointing that out! My bad... my pdfbox dependency comes from quarkus-pdfbox 1.0.0 (using pdfbox 3.0.2) and I had a conflict with fontbox used by the apache-tika dependency. But during my tests, I didn't notice that I had only changed the fontbox version and not the pdfbox one. As a result, there was a desynchronization between pdfbox and fontbox, which explains this strange behavior and the fact that the stack didn't make sense to you.
I tried with pdfbox 3.0.5 (and fontbox 3.0.5) and it works fine. Thanks very much for your time, Tilman (and sorry for the confusion). Maxime On 2025/05/27 03:08:10 Tilman Hausherr wrote: > On 26.05.2025 20:55, Maxime WIEWIORA wrote: > > Maybe not decreasing the value of lastIndexOfPrevMatch when the > > CompoundCharacterTokenizer has been initialized via the > > constructor CompoundCharacterTokenizer(Pattern pattern) could be a solution > > That constructor is no longer used in PDFBox, that was changed in the > issue you mention. Also your stack trace shows lines that are from > BEFORE the change you mention?! > > Tilman > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org > For additional commands, e-mail: users-h...@pdfbox.apache.org > >