Re: character sets in HTML files?

David A. Desrosiers Thu, 18 Oct 2001 20:38:11 -0700


> Should plucker just parse XML and feed non-xml stuff to tidy to
> reformat?  Just an idea to simplify things.  I think it simplifies
> things, at least.


        Remember, implementing an XML parser is no trivial matter. If the
XML page or application fails validation, the page is bitbucketed. In the
current scheme, Plucker tries to make sense of what's left of the broken
HTML, but with XML, that's not allowed.

> Indeed, but I thought XML was in unicode?  Or did I dream that? Probably
> did, as I'm sure I've seen encoding="iso-8859-1" in some files,
> actually.

        It is indeed unicode, however, you can override it.


/d

Re: character sets in HTML files?

Reply via email to