I've ran your shell script and got the same result, the first char is missing in the pdf.

It seems to be related to the way you are calling TextToPDF. You are simply print the text to the console and redirect it to TextToPDF.

I've changed that and echoed the text to a file and used that file as input for TextToPDF. Voila, everything works fine.

PDFBOX-5554 added support for a charset parameter and a leading UTF-8 BOM is removed automatically. I assume the latter is the issue here. It reads the input twice and somehow this doesn't work with a redirected input on linux

Andreas

Am 25.07.23 um 08:10 schrieb michael.a...@universa.de:
the question is, where does the char got lost, when creating the pdf or when 
extracting the text?

Sorry if i was not precise enough. The created pdf misses the first char. So 
the TextToPDF function has a problem.

Did you check the created pdf? Does it contain the whole text?

I tested/viewed it. The first char is missing.


Hinweise zur Datensicherheit und zur Vertraulichkeit von E-Mails finden Sie 
hier:
https://www.universa.de/e-mail-kommunikation

Informationen zum Datenschutz und zu den Betroffenenrechten können Sie 
nachlesen unter:
https://www.universa.de/datenschutz

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org

Reply via email to