In the charset meta sniffing algorithm under "Attribute name:":
If it is 0x2F (ASCII '/'), 0x3C (ASCII '<'), or 0x3E (ASCII '>')
Stop looking for an attribute. The attribute's name is the
value of attribute name, its value is the empty string.
In general, it seems to me the algorithm isn't quite clear on when to
stop looking for the current attribute and when to stop looking for
attributes for the current tag altogether.
In this step, it seems to me that '/' should advance the pointer and
end getting the current attribute followed by getting another
attribute. '>' should end getting attributes on the whole tag without
changing the pointer. '<' should move the pointer backwards a step
and end getting attributes on the whole tag.
Correct?
--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/