FYI: String comparison

2004-12-01 Thread Mark Davis
I just wanted to let people know that we have a new web tool available for doing basic Unicode string comparison, at http://oss.software.ibm.com/icu/demo/scompare.html Mark

Marking Elongated script in scholarly editions

2004-12-01 Thread Georg Vogeler
Hello everybody, in medieval german charters there are parts highlighted by using an elongated script, the so called Elongata. In scholarly editions it is usual to mark those section by a triple St. Andrew's Cross (like an x). As I didn't find anything like that at unicode.org I wonder if this

Re: Marking Elongated script in scholarly editions

2004-12-01 Thread Otto Stolz
Hallo, Georg Vogeler schrieb: a triple St. Andrew's Cross (like an x). As I didn't find anything like that at unicode.org I wonder if this character - or something likely - is already part of the unicode standard? Alas, U+3E1A has one cross too much ;-) What do you mean by triple: How are those

Nicest UTF

2004-12-01 Thread Theodore H. Smith
Assuming you had no legacy code. And no handy libraries either, except for byte libraries in C (string.h, stdlib.h). Just a C++ compiler, a blank page to draw on, and a requirement to do a lot of Unicode text processing. Apart from that the real world would still apply, you may want to use

Re: Nicest UTF

2004-12-01 Thread Marcin 'Qrczak' Kowalczyk
Theodore H. Smith [EMAIL PROTECTED] writes: Assuming you had no legacy code. And no handy libraries either, [...] What would be the nicest UTF to use? For internals of my language Kogut I've chosen a mixture of ISO-8859-1 and UTF-32. Normalized, i.e. a string with chracters which fit in narrow