For XHTML pages, which the Brighthand pages claim to be, JPluck uses the XML
parser, which explains the error. XML, whether it's XHTML, RSS or whatever
must be well-formed and all entities must be declared in the DTD. Otherwise
the page won't parse at all.

That's interesting. Wired is also an XHTML site, but it will let the & slip through in news articles as well. Think we might need a "Parse as HTML" option, or catch the XML exception and then automatically parse through jtidy?

Ed


_______________________________________________ plucker-list mailing list [EMAIL PROTECTED] http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to