Is it possible that this is due to extra whitespace in the PDF? On Sun, Jul 30, 2023 at 2:17 PM Keith Bennett <[email protected]> wrote:
> Hi, all. I am finally getting around to updating the "rika" Ruby gem for
> interacting with Tika in JRuby, and encountered something weird. When I
> test parsing a text file with max content length of 8, I get 8 characters
> ("Stopping"). When I test parsing a PDF file with max content length of 8,
> I only get 7 characters ("Stoppin"). Is this expected?
>
>
