On Mar 11, 2006, at 11:23 AM, Trustin Lee wrote:
Yep I noticed it by myself and fixed it again:http://svn.apache.org/viewcvs?rev=385102&view=rev
excellent, ty :)
The core of my concern is that the expectedLength is only based onthe average bytes per character, which for UTF-8 is only 1.1. The max is 4, so its not that hard for the expected length to be way off if you have many wide characters.I thought the max is 6. It seems like there's a wrong documentation: http://www.cl.cam.ac.uk/~mgk25/unicode.html#utf-8 The official site claims that it takes up to four octets: http://www.utf-8.com/
Odd, I was just going by what the UTF-8 encoder said was its maximum, figuring that it wouldn't lie about what it was going to do :)
-pete -- [EMAIL PROTECTED] - http://fotap.org/~osi
smime.p7s
Description: S/MIME cryptographic signature
