@Dave-Allured and @DocOtak,

1) Most of the character/string attributes in the CF conventions contain a 
concatenation of sub-strings selected from a standardized vocabulary, variable 
names, and some numbers and separator symbols. It seems that for those 
attributes the discuss about the encoding is not so relevant as these 
sub-strings contain only a very basic set of characters (assuming that variable 
names are not allowed to contain extended characters). Even for flag_meanings 
the CF conventions state "Each word or phrase should consist of characters from 
the alphanumeric set and the following five: '_', '-', '.', '+', '@'." If the 
alphanumeric set doesn't include extended characters this again doesn't create 
any problems for encoding. The only attributes that might contain extended 
characters (and thus be influenced by this encoding choice) are attributes like 
long_name, institution, title, history, ... However CF inherits most of them 
from the NetCDF User Guide which explicitly states that they should be stored 
as **character arrays** (see [NUG Appendix 
A](https://www.unidata.ucar.edu/software/netcdf/docs/attribute_conventions.html))
 So, is it then up to CF to allow strings here? In short, I'm not sure the 
encoding is important for string/character attributes at this moment.

2) I initially raised the encoding topic in the related issue #139 because we 
want our model users to use local names for observation points and they will 
end up in label variables. In that context I would like to make sure that what 
I store is properly described.

-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/cf-convention/cf-conventions/issues/141#issuecomment-407678139

Reply via email to