Hello, I came across a PDF file created using NeoOffice 3.0, which uses some special glyphs for ligatures. Copying text or text extraction with PDFBox (1.6) yields results like "…umstrien, was die Unsierheit…". The additional glyphs can be mapped using the table
ft;E039 ch;E03B tt;E03C Qu;E048 Th;E049 I want to ask if anybody knows anything about ligatures in the OpenOffice family: e.g. is this a standard behaviour or will every new PDF file create new numbers for these glyphs (the codes are in the "Private Use" area), or wether there is a table somewhere for the glyphs used in OpenOffice. Any help would be appreciated. Best regards Thomas Fischer

