Besides a few exceptions, the alternate representation of a unicode
character defaults to printing the number entity unaltered.  This isn't
very pretty to look at and happens fairly regularly in the e-book world,
especially with Greek.

There are quite a few characters that we could set better alternates
for.  I'm particularly interested in Greek and some Latin Extended-B
characters that Esperanto uses because it would be handy for
Pluckerbooks.  Both of these languages have standard transliteration
(ascii substitution) schemes.

What do you think to basing the unknown_charref function in
TextParser.py on a more comprehensive dictionary instead of a big
if/else block?  I'll code it...

-- 
Dave Maddock <[EMAIL PROTECTED]>

_______________________________________________
plucker-dev mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-dev

Reply via email to