On Fri, Jun 24, 2011 at 09:40:12AM +0200, Ulrike Fischer wrote: > The dvi's generated by luatex (if I inspect them with dv2dt) look > actually quite similar to dvi's generated by latex. E.g. they > contain - if I use an unicode font and input an € - the entry "s1 > 8364" (8364 = hex 20AC). "Normal" dvi have similar entries - only > the numbers are smaller.
There is also the problem of unencoded glyphs (e.g. smallcaps or oldstyle figures in most OpenType fonts), IIRC the dvi will contain the internal PUA code points that luatex maps these glyphs to, I'm not sure if it is easy/possible to do the reverse mapping outside luatex (and even getting glyph indices is not enough as you need to map them back to Unicode code points using some heuristics based on glyph names). Regards, Khaled -- Khaled Hosny Egyptian Arab
