Re: [Dspam-devel] Quick comment on non Western languages

Stevan Bajić Wed, 18 Nov 2009 13:12:03 -0800

On Wed, 18 Nov 2009 14:29:53 -0600
Kenneth Marshall <[email protected]> wrote:


> > Alexander
> > 
> I thought that UTF8, UTF-16 and UTF-32 can represent all the characters.
> In that case, why wouldn't you use the UTF8 equivalent? At the least it
> would save space.
> 
* UTF-16 and UTF-32 are not widely used.
* Wrongly coded UTF-8 can not be 100% saved (without loss) in UTF-16.
* The UTF-16 systems I know do not always display stuff in UTF-16 but show the 
symbol with 8-bit encodings.
* I am not aware of much applications using really UTF-16 for encoding. Let 
alone UTF-32. Out of my mind I can't remember seen any mail application 
handling UTF-16/UTF-32.
* UTF-16 and UTF-32 are not byte oriented and this makes thing really a big 
pain. It's not easy to handle that everywhere the proper way if the application 
sending the message is not appending the proper MIME charset (UTF-16E for 
(big-)endian encoding and UTF-16LE for little-endian).

I think in our case we could easy stay on UTF-8. I would say that we should 
think global (UTF-16/UTF-32) but act local (UTF-8). I think that we could easy 
handle those non western languages in the UTF-8 character space. I THINK. I am 
not sure. We should invest more time for that issue.


> Ken
> 
-- 
Kind Regards from Switzerland,

Stevan Bajić

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Dspam-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspam-devel

Re: [Dspam-devel] Quick comment on non Western languages

Reply via email to