Hello,

Oh! Thanks for pointing that out! My bad... my pdfbox dependency comes from
quarkus-pdfbox 1.0.0 (using pdfbox 3.0.2) and I had a conflict with fontbox
used by the apache-tika dependency. But during my tests, I didn't notice
that I had only changed the fontbox version and not the pdfbox one. As a
result, there was a desynchronization between pdfbox and fontbox, which
explains this strange behavior and the fact that the stack didn't make
sense to you.

I tried with pdfbox 3.0.5 (and fontbox 3.0.5) and it works fine.

Thanks very much for your time, Tilman (and sorry for the confusion).

Maxime

On 2025/05/27 03:08:10 Tilman Hausherr wrote:
> On 26.05.2025 20:55, Maxime WIEWIORA wrote:
> > Maybe not decreasing the value of lastIndexOfPrevMatch when the
> > CompoundCharacterTokenizer has been initialized via the
> > constructor CompoundCharacterTokenizer(Pattern pattern) could be a
solution
>
> That constructor is no longer used in PDFBox, that was changed in the
> issue you mention. Also your stack trace shows lines that are from
> BEFORE the change you mention?!
>
> Tilman
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
> For additional commands, e-mail: users-h...@pdfbox.apache.org
>
>

Reply via email to