Regarding the encoding, I agree that for attribute contents which are 
standardised by CF it is fine to restrict ourselves to ASCII, in both `char` 
and `string`. For these attributes, we prescribe the possible values (they have 
controlled vocabulary) and so we don't need to make a rule in the convention 
about it for the sake of the *users* of the convention. If we put it in the 
convention, it would be as guidance for future *authors* of the convention. I 
don't have a view about whether we should do this. It would be worth noting to 
users that whitespace, which often appears in a "black-separated list of 
words", should be ASCII space. I agree that UTF-8 is fine for contents which 
aren't standardised.

Regarding arrays of strings, I realise I wasn't thinking clearly yesterday, 
sorry. As we've agreed, `string` attributes will not be expected by much 
existing software. Hence software has to be rewritten to support the use of 
strings in any case, and support for arrays of strings could be added at the 
same time, if it's really valuable. I don't see the particular value for the 
use of `string` arrays for `comment` - do other people? For `flag_meanings`, 
the argument was that it would allow a meaning to be a string which contained 
spaces (instead of being joined up with underscores, as is presently 
necessary); that is, it would be an enhancement to functionality.

Happy weekend - Jonathan


-- 
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub:
https://github.com/cf-convention/cf-conventions/issues/141#issuecomment-408441376

Reply via email to