In my older work image, the following just works:
XMLDOMParser parse:
('http://forum.world.st/file/n4908531/illegal-UTF-sms.xml' asUrl
retrieveContents).
But I guess that is because my (older) XML parser version ignores the encoding,
or is more lenient.
You could try to edit the incoming file, or have a look at #decodesCharacters:
(XMLDOMParser on:
('http://forum.world.st/file/n4908531/illegal-UTF-sms.xml' asUrl
retrieveContents) readStream) decodesCharacters: false; parseDocument.
But I am no expert in the deeper aspects of XML Support.
> On 28 Jul 2016, at 22:29, Sean P. DeNigris <[email protected]> wrote:
>
> Sven Van Caekenberghe-2 wrote
>> Your XML file is not UTF-8 encoded, it is plain Unicode. At least the way
>> it is served from the URL you gave.
>> ..
>> You see ?
>
> Unfortunately, no! ha ha. I didn't generate the file and I took it's
> assertion that it was UTF-8 at face value. How do I properly feed the file
> into XMLParser?
>
>
>
> -----
> Cheers,
> Sean
> --
> View this message in context:
> http://forum.world.st/XMLParser-Claims-U-00A0-is-Invalid-UTF-8-tp4908525p4908539.html
> Sent from the Pharo Smalltalk Users mailing list archive at Nabble.com.
>