philip wrote: > Looking at the debug window I noticed that now the field names for the > new PDF have a strange pattern like (notice the "þÿ" after each period > and the spaces bettween characters): > > þÿ F [ 0 ].þÿ P 1 [ 0 ].þÿ c a n a d i a n _ p r o v i n c e s [ 0 ] > > instead of > F[0].P1[0].canadian_provinces[0]
That's like saying: yesterday there were 25 pupils in my classroom, today there are 25 students. What's different? The þÿ should ring a bell: From the PDF Reference: "for text strings encoded in Unicode, the first two bytes must be 254 followed by 255. These two bytes represent the Unicode byte order marker, U+FEFF, indicating that the string is encoded in the UTF-16BE (big-endian) encoding scheme specified in the Unicode standard. (This mechanism precludes beginning a string using PDFDocEncoding with the two characters thorn ydieresis, which is unlikely to be a meaningful beginning of a word or phrase)." In other words: the content of both Strings is identical, but the encoding is different. br, Bruno ------------------------------------------------------------------------- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions Buy the iText book: http://itext.ugent.be/itext-in-action/