On 25.07.08, Steve Litt wrote:
...
> I know nothing about unicode, so I'd need to be brought up to speed on that
> before writing the program.
References:
* The Absolute Minimum Every Software Developer Absolutely,
Positively Must Know About Unicode and Character Sets (No Excuses!)
http://www.joelonsoftware.com/articles/Unicode.html
* On the Goodness of Unicode
http://www.tbray.org/ongoing/When/200x/2003/04/06/Unicode
* Wikipedia article on Unicode
http://en.wikipedia.org/wiki/Unicode
> I assume unicode is a 16 bit representation of characters.
Actually, unicode is a character <--> number mapping without an upper
limit to the numbers.
Once upon a time, 16 bit where enough to represent all defined unicode
characters, but even then several different encodings into a computer
readable format existed. Programs that relied on unicode == 16 bit
(including LaTeX) have problems now with higher unicode numbers.
Günter