On Wed, May 03, 2006 at 02:29:29PM +0200, Olivier Sirven wrote:
> Le Mardi 2 Mai 2006 23:51, A. Pagaltzis a écrit :
> > You don’t. But if it’s data on the English or French part of the
> > web, then invalid bytes are ISO-8859-1 with 99.999% certainty.
> Yes but I have to handle xml feeds in various languages (english, french, 
> japanese, ...) so I can not rely on this postulate.
> My problem is that a well known blog software like wordpress can generate 
> invalid xml feeds so I guess I will simply reject them as Daniel told me...

  It's a terribly hard battle to fight unfortunately, but it's the right
thing to do :-\

Daniel

-- 
Daniel Veillard      | Red Hat http://redhat.com/
[EMAIL PROTECTED]  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
[email protected]
http://mail.gnome.org/mailman/listinfo/xml

Reply via email to