Hi,

From: Ian Hickson <[EMAIL PROTECTED]>
On Mon, 14 Aug 2006, Simon Pieters wrote:
>
> I guess that for compat with IE and the Web[1] we have to treat
> "R&eacutesum&eacute" as if it were "R&eacute;sum&eacute;". So how do we
> handle "&noti;"? When the parser has come as far as "&not" it can't
> return U+00AC yet because it could well be "&notin;". But when it has
> reached "&noti;" then it can't be "&notin;", thus it returns U+00AC, but
> then you also have to reparse the "i;", right? Unless I'm mistaken the
> spec doesn't say anything about that.

Section 8.2.3.1 "Tokenising entities", under "Anything else", covers this:
"Consume the maximum number of characters possible, with the consumed
characters case-sensitively matching one of the identifiers in the first
column of the entities table".

I've read that several times. But it doesn't say that "i;" in "&noti;" is going to be reparsed. ("noti" isn't an entity, but "not" and "notin" are.)

Regards,
Simon Pieters


Reply via email to