Dear Lu Song, I don't have much to add to Art's comments. "characters" in Unicon are all single-byte at present, I think this is without a particular 8-bit character set specified; interpretation is up to font selection during output. It might be somewhat difficult to get UTF-8 support within source code string constants, but you could write library procedures or classes that work on UTF-8 or other multi-byte and variable-byte string data internally. Such a contribution to the language's libraries would be welcome.
At some point in the past, Windows Unicon was modified to be able to print out a mixed-byte character set, in support of a Japanese user. There has been talk and thought about extending Unicon's built-in string data type for more than current 8-bit characters, but so far no volunteer or sponsor has taken it on. Clint _______________________________________________ Unicon-group mailing list Unicon-group@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/unicon-group