[
https://issues.apache.org/jira/browse/PDFBOX-945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Martin Koegler updated PDFBOX-945:
----------------------------------
Attachment: 03-standard-lf.patch
Patch 03:
There are other newline charachters out there (eg. like NEL).
The PDF specification only talks about CR/LF, so its not safe to use
line.separator in PDFs
> PDFBOX may not depend on plattform encoding
> -------------------------------------------
>
> Key: PDFBOX-945
> URL: https://issues.apache.org/jira/browse/PDFBOX-945
> Project: PDFBox
> Issue Type: Bug
> Affects Versions: 1.5.0
> Reporter: Martin Koegler
> Attachments: 01-static-init-encoding.patch, 02-encoding.patch,
> 03-standard-lf.patch
>
>
> The pdf specification states, that PDFs use an ASCII compatible, 8 bit
> characterset.
> PDFBOX uses on various places the plattform encoding to convert bytes from/to
> strings.
> On JREs with an non ASCII compatible plattform encoding (there are such
> systems out there), this will break pdfbox.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.