On Fri, 2003-02-14 at 13:54, David A. Desrosiers wrote: > > > is the current Python parser (from CVS) and the V1.2 viewer supposed to be > > able to handle German umlauts in UTF-8 encoding correctly? > > Yes, except the webpage below doesn't properly encode them, so the > parser fails (as does the three web validators I just pointed to the page).
Which validators have your tried? I tried http://validator.w3.org/ and http://www.htmlhelp.com/tools/validator/ and neither of them complains about the character encoding, "only" about bad HTML. http://validator.w3.org/ had problems parsing/downloading the url, but when I gave it a copy of the page in a local file plus the character encoding it was more or less okay. > I would email the maintainer/author of the page and suggest that they > properly encode their entities[1], and then Plucker will suport them. > > Example: they use � for example, when they should be using ä I haven't verified whether their encoding of the � is correct, but it seems to be, because it comes up correctly in galeon, unless I override the encoding. Although I really don't know much about valid HTML, I agree with MJR: my gut feeling is that the whole purpose of choosing a character set is not having to use entities for characters in the set. -- Freundliche Gruesse / Best Regards Patrick Ohly Senior Software Engineer -------------------------------------------------------------------- //// pallas Pallas GmbH / Hermuelheimer Str. 10 / 50321 Bruehl / Germany [EMAIL PROTECTED] / www.pallas.com Tel +49-2232-1896-30 / Fax +49-2232-1896-29 -------------------------------------------------------------------- _______________________________________________ plucker-list mailing list [EMAIL PROTECTED] http://lists.rubberchicken.org/mailman/listinfo/plucker-list

