Hi!
It can be done through conversion error handlers. You can append an encoded form of a codepoint for such unassigned characters to the buffer within the handler.
OK, if so we may want to add implementation of this behavior to our ICU support.
And yes, it's worth providing separate conversion system. You might not be aware of it, but there are several sets of different character sets, each of which is often represented with a specific encoding scheme. Shift_JIS is one of those.
I'm not sure I understand. There are tons of character sets, etc. but as I understand ICU conversion routines handle them, including Shift_JIS - isn't it true?
What I am mainly interested in is 5.4, or something that will come before 6. BTW, it would be much better if there had been a sort of coordination between the developers of mbstring and intl extension.
I'm not sure what will happen about 5.4 etc. but sure I'd be glad to help as much as I could with anything regarding intl extension. DO you have some specific things that need to be done?
-- Stanislav Malyshev, Zend Software Architect s...@zend.com http://www.zend.com/ (408)253-8829 MSN: s...@zend.com -- PHP Internals - PHP Runtime Development Mailing List To unsubscribe, visit: http://www.php.net/unsub.php