Re: DBCS and Unicode 3.1

Markus Scherer Mon, 17 Feb 2003 16:41:39 -0800

Michael (michka) Kaplan wrote:

Well, DBCS means "double byte character set" and thus it is always two
bytes. But its a theoretical definition since there are no actual DBCS
code pages -- all of the ones that exist are MBCS (multibyte character
set) since they support both one-byte and two-byte characters.

More or less. There are systems that use pure DBCS, for example in databases, to get a fixed-width encoding. Usually those DBCS codepages are the double-byte portions of some MBCS codepages though.

There are standards like the Chinese GB18030 which supports characters
of 1, 2, or 4 bytes -- definitely MBCS again.

Other examples: There are EUC-JP (1/2/3 bytes per character) and EUC-CN (1/2/4 BpC) which are quite "old" (much older than GB 18030).

But these code pages are generally owned by outside
governments/agencies, so there is no rule that they need to update
when Unicode does.

Right. No one codepage _has_ to be upgraded just because another one is.


markus

--
Opinions expressed here may not reflect my company's positions unless otherwise noted.

Re: DBCS and Unicode 3.1

Reply via email to