[ https://issues.apache.org/jira/browse/PDFBOX-4728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Matti Oinas reopened PDFBOX-4728: --------------------------------- The problem still exists. > Broken PDF after load and save > ------------------------------ > > Key: PDFBOX-4728 > URL: https://issues.apache.org/jira/browse/PDFBOX-4728 > Project: PDFBox > Issue Type: Bug > Components: Parsing, Writing > Affects Versions: 2.0.18, 3.0.0 PDFBox > Reporter: Matti Oinas > Priority: Major > > If read was done using WINDOWS-1252 charset and writing is done using > UTF-8 then resulting PDF will be broken after load and save operations. > {{PDDocument document = PDDocument.load(sourcePath);}} > {{document.save(targetPath);}} > If source PDF contains XObject dictionary reference whose name isn't > encoded in UTF-8. For example. > /L#f8vetann 16 0 R > That is read using WINDOWS-1252 encoding. Now if write operation is > using UTF-8 then the resulting name will be > /L#3Fvetann 16 0 R > And resulting PDF is broken and image is missing. > FIX in pull request: https://github.com/apache/pdfbox/pull/77 -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org