Re: All-kana documents

2002-03-05 Thread Martin Duerst
all-kana documents (like, say, if I decide to encode some old women's literature, not that I will, but you might), is there an extension of UTF-8 that will alow me to strip off the redundant "this is kana" byte from most of the kana? After the first few thousand kana, it might be like,

Re: All-kana documents

2002-03-05 Thread Doug Ewell
. This is exactly what SCSU does best. Question: Are there really all-kana documents in the real world (other than children's books)? Or is this one of those exercises like writing an English-language novel without the letter E? -Doug Ewell Fullerton, California

All-kana documents

2002-03-04 Thread ろ〇〇〇〇 ろ〇〇〇
If I have some all-kana documents (like, say, if I decide to encode some old women's literature, not that I will, but you might), is there an extension of UTF-8 that will alow me to strip off the redundant "this is kana" byte from most of the kana? After the first few thousand kana,

Re: All-kana documents

2002-03-04 Thread Markus Scherer
You could - use SCSU (UTR 6) - use BOCU-1 (http://oss.software.ibm.com/cvs/icu/~checkout~/icuhtml/design/conversion/bocu1/bocu1.html) - invent your own... markus

Re: All-kana documents

2002-03-04 Thread Kenneth Whistler
If I have some all-kana documents , is there an extension of UTF-8 that will alow me to strip off the redundant this is kana byte from most of the kana? No After the first few thousand kana, it might be like, Yeah, we get it already! It's kana! It's KANA!! You can stop reminding us

Re: All-kana documents

2002-03-04 Thread Michael \(michka\) Kaplan
, March 04, 2002 3:47 PM Subject: All-kana documents If I have some all-kana documents (like, say, if I decide to encode some old women's literature, not that I will, but you might), is there an extension of UTF-8 that will alow me to strip off the redundant "this is kana" byte f