All,
I’m new to the list, but tried to search pdfbox-users.markmail.org
<http://pdfbox-users.markmail.org/> before asking with no luck.
We are using pdfbox to fill in some form fields in an Adobe generated template
but getting odd results when certain international characters are used (some,
not all). When the pdf is first opened, the characters shown are basically
garbage. Here is an example: þÿB D Aóz
However, when you click into the field (or in certain readers like Okular on
Linux choose “Show Forms”), the correct characters are shown. Here is what is
inserted into the field and shown when field is selected: ł ń Ł ó ź
It is almost like the PDF has one font selected for a read-only view and the
correct font for the view when editing a field.
This is happening with Polish, Russian, Chinese and other languages.
This is how I am populating the fields:
PDDocument pdfDoc = LoadPDF.load(cs, document);
PDDocumentCatalog docCatalog = pdfDoc.getDocumentCatalog();
PDAcroForm acroForm = docCatalog.getAcroForm();
if (acroForm != null) {
for (PDField field : acroForm.getFieldTree()) {
for (PdfField pdfField : pdfFields) {
if (field.getPartialName() != null &&
field.getPartialName().equalsIgnoreCase(pdfField.getName())) {
field.setValue(pdfField.getValue());
}
}
}
}
pdfDoc.save(tempPdf);
pdfDoc.close();
Thanks for any help,
Scott