On Sun, 30 Oct 2022, Christian Ribeaud wrote:
I am using the default configuration. I think, we could reduce my problem to following code snippet:

Is there a reason that you aren't using one of the built-in Tika content handlers? Generally they should be taking care of everything for you with paragraphs, plain text vs html etc

Nick

Reply via email to