Am Mittwoch, 16. August 2006 18:01 schrieb Lars Gullik Bjønnes: > Georg Baum <[EMAIL PROTECTED]> writes: > | Here comes the next bit: I discovered that the result of > | > | std::vector<char> ucs4_to_utf8(boost::uint32_t c) > | > | was never used as a vector. I changed it to std::string, and that simplifies > | the code. > > Perhaps it simplifies... but imho it also makes it more confusing... > > | In particular it removes manual fiddling with the terminating > | '\0', which we should not do IMHO. > | > | Is this change OK with you? > > Can you wait with it?
Yes. > I am not sure at all that we want to confuse a multi-byte encoding > into std::string. Then I think we should not use utf8 at all but for output to external files. We have docstring for storing unicode, and I don't think we should invent (or get from some library) another string class for utf8. Sure that will mean to touch a lot of code, but much of that is mechanical, and the advantage is that we do not need to do everything at once. Georg
