Hm, can't quite recall why we do it that way. I agree that we should
focus on getting 2.1 out the door and fix this for a 2.2 release. At
that point I'd like us to lazy load the various tables like HTML32 which
is not used directly right now.

I'm out of town this week for work, so I can't get into the meat of this
one.

Gary

-----Original Message-----
From: Henri Yandell [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, March 15, 2005 8:47 PM
To: Jakarta Commons Users List; [EMAIL PROTECTED]
Subject: Re: [lang] Bugs of StringEscapeUtils.escapeHtml()?

Relatively clueless about the code (Gary and Alex were the main
authors, Alex isn't very active now).

If I'm understanding correctly, then the issue is that the code is
escaping values that are not specifically listed in the spec listed
below. Scanning the code, it looks like the cause is on line 611 of
the current version, if entityName is null (ie it is not specified)
and it is > 0x7f, then it escapes the value as a number. The same
issue I presume is in the unescape side.

Given that the code is pretty focused on display by a browser, failure
to fit IE and Firefox's feature-set seems like a pretty critical
problem.

Still. I think it's going to be better to work on a 2.1 release and
then focus on this issue, than to try and squeeze this one in just
before release. If you can provide a patch and unit test, then I'm
sure we'll be able to apply it to head as soon as the release goes
out. Otherwise a bug report would still be great and it'll be gotten
to eventually.

Hen

On Wed, 9 Mar 2005 11:28:17 +0800, Carfield Yim <[EMAIL PROTECTED]>
wrote:
> I guess so.... seem IE and firefox won't change the escaped multibyte
> characters back to human readable character to our user. And see it is
> not fit to the specification?
> 
> http://www.w3.org/TR/REC-html40/sgml/entities.html
> 
> On Tue, 8 Mar 2005 22:14:39 -0000, Stephen Colebourne
> <[EMAIL PROTECTED]> wrote:
> > Is that a problem?
> > Stephen
> >
> > ----- Original Message -----
> > From: "Carfield Yim" <[EMAIL PROTECTED]>
> > > Other than escapeHTML entities like space and < , all multibyte
> > > Characters like Chinese, Japanese and Russian will escaped.
> >
> >
---------------------------------------------------------------------
> > To unsubscribe, e-mail: [EMAIL PROTECTED]
> > For additional commands, e-mail:
[EMAIL PROTECTED]
> >
> >
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
>

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to