https://bugs.documentfoundation.org/show_bug.cgi?id=62846

--- Comment #38 from [email protected] ---
I would like to point out that this same bug exists in OpenType fonts as well.
If an OT font creates a ligature it will do the wrong associations with the
glyphs. There is no reliable way to use just a ToUnicode cmap to ensure correct
reconstruction of text from a pdf file. Instead if you want reliability the
only way is to use ActualText. This will involve development work on the pdf
writer to generate.

The reason we can't get the results we want is that the information used by the
pdf writer is the same information required for cursor positioning and when it
comes to ligatures (in the pdf case to ignore the glyph and in the cursor
tracking case to map to the underlying character) these values are inherently
at odds with each other.

This is a wider bug than just graphite integration.

For example, test the following string ពពកកឿ in an OT font you will get this
out from text extraction ពពកកកក.

If you want a workaround for this particular case in Linux Libertine G, then I
would use kerning to introduce the space rather then trying to insert a glyph.

-- 
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs

Reply via email to