> I have a Unicode file containing the word "coeur", where the "oe" is a > ligature character having the Unicode value 0x153. I can open this > file and retrieve the proper data using "fconfigure $infile -encoding > unicode". > > However, if I try to translate this data to iso8859-1 using either > "encoding convertto iso8859-1 $data" or "fconfigure $outfile -encoding > iso8859-1", the "oe" character is converted to an ASCII "?" (0x3f).
Tcl 8.3.4 has all you need for multilingual support. What you are seeing is also correct. The "oe" ligature is not in iso8859-1 (see http://www.unicode.org/charts/). It is part of iso8859-15 or iso8859-16, mapped to \xBD. If you try this on Windows, it will show you 1/2, because \xBD is 1/2 for cp1252 (default Windows charset). Jeff Hobbs The Tcl Guy Senior Developer http://www.ActiveState.com/ Tcl Support and Productivity Solutions
