Y, I agree with Nick. Tika appears to add a new line in the correct spot at least for IDEC-102...
On Mon, Oct 31, 2022 at 9:22 AM Nick Burch <[email protected]> wrote: > On Sun, 30 Oct 2022, Christian Ribeaud wrote: > > I am using the default configuration. I think, we could reduce my > > problem to following code snippet: > > Is there a reason that you aren't using one of the built-in Tika content > handlers? Generally they should be taking care of everything for you with > paragraphs, plain text vs html etc > > Nick >
