On 25.07.08, Steve Litt wrote: ... > I know nothing about unicode, so I'd need to be brought up to speed on that > before writing the program.
References: * The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) http://www.joelonsoftware.com/articles/Unicode.html * On the Goodness of Unicode http://www.tbray.org/ongoing/When/200x/2003/04/06/Unicode * Wikipedia article on Unicode http://en.wikipedia.org/wiki/Unicode > I assume unicode is a 16 bit representation of characters. Actually, unicode is a character <--> number mapping without an upper limit to the numbers. Once upon a time, 16 bit where enough to represent all defined unicode characters, but even then several different encodings into a computer readable format existed. Programs that relied on unicode == 16 bit (including LaTeX) have problems now with higher unicode numbers. Günter