On Mar 11, 2006, at 11:23 AM, Trustin Lee wrote:
Yep I noticed it by myself and fixed it again:

http://svn.apache.org/viewcvs?rev=385102&view=rev

excellent, ty :)

The core of my concern is that the expectedLength is only based on
the average bytes per character, which for UTF-8 is only 1.1. The max
is 4, so its not that hard for the expected length to be way off if
you have many wide characters.


I thought the max is 6.  It seems like there's a wrong documentation:

http://www.cl.cam.ac.uk/~mgk25/unicode.html#utf-8

The official site claims that it takes up to four octets:

http://www.utf-8.com/

Odd, I was just going by what the UTF-8 encoder said was its maximum, figuring that it wouldn't lie about what it was going to do :)
-pete

--
[EMAIL PROTECTED] - http://fotap.org/~osi


Attachment: smime.p7s
Description: S/MIME cryptographic signature

Reply via email to