Frank Yung-Fong Tang <ytang0648 at aol dot com> wrote: >> UTF-16 6,634,430 bytes >> UTF-8 7,637,601 bytes >> SCSU 6,414,319 bytes >> BOCU-1 5,897,258 bytes >> Legacy encoding (*) 5,477,432 bytes >> (*) KS C 5601, KS X 1001, or EUC-KR) > > What is the size of gzip these? Just wonder > gzip of UTF-16 > gzip of UTF-8 > gzip of SCSU > gzip of BOCU-1 > gzip of Legacy encoding
I don't have gzip, but I can give you the PKZip sizes, which should be quite similar: UTF-16 2,685,232 bytes UTF-8 2,774,356 bytes SCSU 2,756,470 bytes BOCU-1 2,772,418 bytes EUC-KR 2,518,201 bytes Note that the largest of these is only 10.2% larger than the smallest. -Doug Ewell Fullerton, California http://users.adelphia.net/~dewell/