all-kana documents (like, say, if I decide to encode some
old women's literature, not that I will, but you might), is there an
extension of UTF-8 that will alow me to strip off the redundant "this is
kana" byte from most of the kana? After the first few thousand kana, it
might be like,
. This is exactly what SCSU does best.
Question: Are there really all-kana documents in the real world (other
than children's books)? Or is this one of those exercises like writing
an English-language novel without the letter E?
-Doug Ewell
Fullerton, California
If I have some all-kana documents (like, say, if I decide to encode some
old women's literature, not that I will, but you might), is there an
extension of UTF-8 that will alow me to strip off the redundant "this is
kana" byte from most of the kana? After the first few thousand kana,
You could
- use SCSU (UTR 6)
- use BOCU-1
(http://oss.software.ibm.com/cvs/icu/~checkout~/icuhtml/design/conversion/bocu1/bocu1.html)
- invent your own...
markus
If I have some all-kana documents , is there an
extension of UTF-8 that will alow me to strip off the redundant this is
kana byte from most of the kana?
No
After the first few thousand kana, it
might be like, Yeah, we get it already! It's kana! It's KANA!! You can
stop reminding us
, March 04, 2002 3:47 PM
Subject: All-kana documents
If I have some all-kana documents (like, say, if I decide to encode some
old women's literature, not that I will, but you might), is there an
extension of UTF-8 that will alow me to strip off the redundant "this is
kana" byte f
6 matches
Mail list logo