All,

I’m new to the list, but tried to search pdfbox-users.markmail.org 
<http://pdfbox-users.markmail.org/> before asking with no luck.

We are using pdfbox to fill in some form fields in an Adobe generated template 
but getting odd results when certain international characters are used (some, 
not all).  When the pdf is first opened, the characters shown are basically 
garbage.  Here is an example: þÿB D Aóz

However, when you click into the field (or in certain readers like Okular on 
Linux choose “Show Forms”), the correct characters are shown.  Here is what is 
inserted into the field and shown when field is selected: ł ń Ł ó ź

It is almost like the PDF has one font selected for a read-only view and the 
correct font for the view when editing a field.

This is happening with Polish, Russian, Chinese and other languages.

This is how I am populating the fields:

PDDocument pdfDoc = LoadPDF.load(cs, document);
PDDocumentCatalog docCatalog = pdfDoc.getDocumentCatalog();
PDAcroForm acroForm = docCatalog.getAcroForm();

if (acroForm != null) {
        for (PDField field : acroForm.getFieldTree()) {
                for (PdfField pdfField : pdfFields) {
                        if (field.getPartialName() != null && 
field.getPartialName().equalsIgnoreCase(pdfField.getName())) {
                                field.setValue(pdfField.getValue());
                        }
                }
        }
}
pdfDoc.save(tempPdf);
pdfDoc.close();


Thanks for any help,
Scott

Reply via email to