BOCU-1 is now an IANA-registered charset: 
http://www.iana.org/assignments/character-sets

I thought it might be useful and interesting to show the list of Unicode charsets that 
are registered:

Charset name, MIBenum, aliases (if any *)

UTF-7    (MIBenum 1012)
UTF-8    (MIBenum  106)
UTF-16   (MIBenum 1015)
UTF-16BE (MIBenum 1013)
UTF-16LE (MIBenum 1014)
UTF-32   (MIBenum 1017)
UTF-32BE (MIBenum 1018)
UTF-32LE (MIBenum 1019)
CESU-8   (MIBenum 1016)
SCSU     (MIBenum 1011)
BOCU-1   (MIBenum 1020)

There are also some (obsolete) Unicode 1.1 charsets registered:

UNICODE-1-1-UTF-7 (MIBenum  103) csUnicode11UTF7
UNICODE-1-1       (MIBenum 1010) csUnicode11 [this uses the UCS-2BE encoding scheme]

This is even more obsolete:

ISO-10646-UTF-1   (MIBenum  27) csISO10646UTF1 [Unicode 1.0? Very obsolete encoding 
scheme.]

The list also contains some registrations that are
- encoding forms, not charsets
- repertoires, not charsets
- subsets of Unicode, with UCS-2BE encoding scheme

* There is always one implicit alias: "cs" + the primary charset name.


Best regards,
markus

-- 
Opinions expressed here may not reflect my company's positions unless otherwise noted.


Reply via email to