Am Mittwoch, 16. August 2006 18:01 schrieb Lars Gullik Bjønnes:
> Georg Baum <[EMAIL PROTECTED]> writes:
> | Here comes the next bit: I discovered that the result of
> | 
> | std::vector<char> ucs4_to_utf8(boost::uint32_t c)
> | 
> | was never used as a vector. I changed it to std::string, and that 
simplifies
> | the code.
> 
> Perhaps it simplifies... but imho it also makes it more confusing...
> 
> | In particular it removes manual fiddling with the terminating
> | '\0', which we should not do IMHO.
> | 
> | Is this change OK with you?
> 
> Can you wait with it?

Yes.

> I am not sure at all that we want to confuse a multi-byte encoding
> into std::string.

Then I think we should not use utf8 at all but for output to external 
files. We have docstring for storing unicode, and I don't think we should 
invent (or get from some library) another string class for utf8.

Sure that will mean to touch a lot of code, but much of that is mechanical, 
and the advantage is that we do not need to do everything at once.


Georg

Reply via email to