#159: charset attribute
----------------------------+------------------------------
 Reporter:  bob.simons      |      Owner:  cf-conventions@…
     Type:  enhancement     |     Status:  new
 Priority:  medium          |  Milestone:
Component:  cf-conventions  |    Version:
 Keywords:                  |
----------------------------+------------------------------
 In order to specify the character set of char and string variables,
 I propose that we append this paragraph to the end of CF section 2.2:

   All char and string variables must include a charset attribute to
   identify the character set (encoding) used by the variable. The
   value of the attribute must be the "Preferred MIME Name" or "Name"
   of one of the 8-bit encodings (so not UTF-16 or UTF-32, since CF
   chars are 8-bits) listed at
   http://www.iana.org/assignments/character-sets/character-sets.xhtml .
   Charset names are case-insensitive.
   The only recommended charset names are "ISO-8859-1" (which is
   useful for European languages and for backwards compatibility
   with 7-bit ASCII characters) and "UTF-8" (which is useful when
   full Unicode is needed). (In older files with variables that
   don't specify a charset, the character set being used remains
   ambiguous.)

--
Ticket URL: <http://cf-trac.llnl.gov/trac/ticket/159>
CF Metadata <http://cf-convention.github.io/>
CF Metadata

Reply via email to