[ https://issues.apache.org/jira/browse/PDFBOX-283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14056388#comment-14056388 ]
Tilman Hausherr commented on PDFBOX-283: ---------------------------------------- Thanks for the test code and the test file. PDTextbox is gone in the trunk but is in the 1.8 version, that got me confused. Anyway, now that I had your code and pdf I understand what was happening, and that there is no test at this time. I committed your changes in 1609211 in the 1.8 version and rev 1609212 in the trunk. Please do visual tests on everything acroform-related as much as you can to be sure. > Character encoding/appearance issues when filling forms > ------------------------------------------------------- > > Key: PDFBOX-283 > URL: https://issues.apache.org/jira/browse/PDFBOX-283 > Project: PDFBox > Issue Type: Bug > Components: AcroForm > Attachments: PDAppearance.diff, PDAppearance.patch, > PDAppearance_bis.diff, acroform.pdf > > > [imported from SourceForge] > http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1735902 > Originally submitted by scop on 2007-06-12 10:23. > When filling a text field with non-ASCII characters such as in my surname > "Skyttä" and saving the document in a UTF-8 environment, something goes > wrong with the appearance of the text. > The value itself seems to be stored correctly, but when opening the doc, the > appearance of "ä" is not that, but rather something which happens when UTF-8 > is mistakenly treated as ISO-8859-1 (two garbage characters). > PDAppearance uses the platform default encoding in quite a few places which > apparently has potential to mess things up. In particular, > insertGeneratedAppearance() generates a PrintWriter from an OutputStream > without specifying the encoding. In fact, if I hack that to use ISO-8859-1, > the appearance of my "ä" case is correct, but that won't obviously work with > anything else than chars that are valid ISO-8859-1. > In which char encoding should the value be written to the appearance stream > (at end of insertGeneratedAppearance())? -- This message was sent by Atlassian JIRA (v6.2#6252)