#159: charset attribute
----------------------------+------------------------------
Reporter: bob.simons | Owner: cf-conventions@…
Type: enhancement | Status: new
Priority: medium | Milestone:
Component: cf-conventions | Version:
Keywords: |
----------------------------+------------------------------
In order to specify the character set of char and string variables,
I propose that we append this paragraph to the end of CF section 2.2:
All char and string variables must include a charset attribute to
identify the character set (encoding) used by the variable. The
value of the attribute must be the "Preferred MIME Name" or "Name"
of one of the 8-bit encodings (so not UTF-16 or UTF-32, since CF
chars are 8-bits) listed at
http://www.iana.org/assignments/character-sets/character-sets.xhtml .
Charset names are case-insensitive.
The only recommended charset names are "ISO-8859-1" (which is
useful for European languages and for backwards compatibility
with 7-bit ASCII characters) and "UTF-8" (which is useful when
full Unicode is needed). (In older files with variables that
don't specify a charset, the character set being used remains
ambiguous.)
--
Ticket URL: <http://cf-trac.llnl.gov/trac/ticket/159>
CF Metadata <http://cf-convention.github.io/>
CF Metadata