Hm, can't quite recall why we do it that way. I agree that we should focus on getting 2.1 out the door and fix this for a 2.2 release. At that point I'd like us to lazy load the various tables like HTML32 which is not used directly right now.
I'm out of town this week for work, so I can't get into the meat of this one. Gary -----Original Message----- From: Henri Yandell [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 15, 2005 8:47 PM To: Jakarta Commons Users List; [EMAIL PROTECTED] Subject: Re: [lang] Bugs of StringEscapeUtils.escapeHtml()? Relatively clueless about the code (Gary and Alex were the main authors, Alex isn't very active now). If I'm understanding correctly, then the issue is that the code is escaping values that are not specifically listed in the spec listed below. Scanning the code, it looks like the cause is on line 611 of the current version, if entityName is null (ie it is not specified) and it is > 0x7f, then it escapes the value as a number. The same issue I presume is in the unescape side. Given that the code is pretty focused on display by a browser, failure to fit IE and Firefox's feature-set seems like a pretty critical problem. Still. I think it's going to be better to work on a 2.1 release and then focus on this issue, than to try and squeeze this one in just before release. If you can provide a patch and unit test, then I'm sure we'll be able to apply it to head as soon as the release goes out. Otherwise a bug report would still be great and it'll be gotten to eventually. Hen On Wed, 9 Mar 2005 11:28:17 +0800, Carfield Yim <[EMAIL PROTECTED]> wrote: > I guess so.... seem IE and firefox won't change the escaped multibyte > characters back to human readable character to our user. And see it is > not fit to the specification? > > http://www.w3.org/TR/REC-html40/sgml/entities.html > > On Tue, 8 Mar 2005 22:14:39 -0000, Stephen Colebourne > <[EMAIL PROTECTED]> wrote: > > Is that a problem? > > Stephen > > > > ----- Original Message ----- > > From: "Carfield Yim" <[EMAIL PROTECTED]> > > > Other than escapeHTML entities like space and < , all multibyte > > > Characters like Chinese, Japanese and Russian will escaped. > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
