Tim Allison created PDFBOX-5358:
-----------------------------------

             Summary: Add support for UTF-8 in strings
                 Key: PDFBOX-5358
                 URL: https://issues.apache.org/jira/browse/PDFBOX-5358
             Project: PDFBox
          Issue Type: Improvement
            Reporter: Tim Allison
         Attachments: Screen Shot 2022-01-06 at 9.18.09 AM.png

Peter Wyatt recently published an article on UTF-8 strings in PDF 2.0: 
[https://www.pdfa.org/understanding-utf-8-in-pdf-2-0/]

The article includes a link to a test file he created: 
[https://github.com/pdf-association/pdf20examples/blob/master/pdf20-utf8-test.pdf]
 

Our debugger shows that we may need to add support for this (see attached).  
This was with PDFBox 2.0.25.  I didn't have a chance to test with 3.x or the 
2.x snapshot.

I don't think we're necessarily covering all the changes yet in PDF 2.0, but I 
thought I'd open this issue for at least discussion.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to