Yitzchak Gale wrote: > Sean Leather wrote: > > Which one do you use for strings in HTML or XML in which UTF-8 has become > > the commonly accepted standard encoding? > > UTF-8 is only becoming the standard for non-CJK languages. > We are told by members of our community in CJK countries > that UTF-8 is not widely adopted there, and there is no sign that > it ever will be. And one should be aware that the proportion of > CJK in global Internet traffic is growing quickly. >
So then, what is the standard? Being not familiar with this area, I googled a bit, and I don't see a consensus. But I also noticeably don't see UTF-16. So, if this is the case, then a similar question still arises for CJK text: What format/library to use for it (assuming one doesn't want a performance penalty for translating between Data.Text's internal format and the target format)? It appears that there are no ideal answers to such questions. Regards, Sean
_______________________________________________ Haskell-Cafe mailing list Haskell-Cafe@haskell.org http://www.haskell.org/mailman/listinfo/haskell-cafe