philip wrote:
> Looking at the debug window I noticed that now the field names for the 
> new PDF have a strange pattern like (notice the "þÿ" after each period 
> and the spaces bettween characters):
> 
> þÿ F [ 0 ].þÿ P 1 [ 0 ].þÿ c a n a d i a n _ p r o v i n c e s [ 0 ]
> 
> instead of
> F[0].P1[0].canadian_provinces[0]

That's like saying: yesterday there were 25 pupils in
my classroom, today there are 25 students. What's different?

The þÿ should ring a bell:

 From the PDF Reference: "for text strings encoded in Unicode,
the first two bytes must be 254 followed by 255. These two
bytes represent the Unicode byte order marker, U+FEFF, indicating
that the string is encoded in the UTF-16BE (big-endian) encoding
scheme specified in the Unicode standard. (This mechanism precludes
beginning a string using PDFDocEncoding with the two characters thorn
ydieresis, which is unlikely to be a meaningful beginning of a word
or phrase)."

In other words: the content of both Strings is identical,
but the encoding is different.

br,
Bruno

-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions
Buy the iText book: http://itext.ugent.be/itext-in-action/

Reply via email to