Re: [MBS] Check and replacing non valid UTF8 characters

Christian Schmitz Tue, 12 Feb 2013 04:52:05 -0800

Am 12.02.2013 um 12:14 schrieb Lee Badham <[email protected]>:

> More Info…
> 
> To be clear, I want to keep as much of the original string as possible, so if 
> most of the string is in Chinese (but valid UTF-8) that is ok.
> 
> Some of the files I get have Various Chinese encodings and sometimes the 
> universal parser gueses the encoding wrong.
>


Well, it should be possible to develop an algorithm which checks UTF-8 validity 
and replaces invalid sequences with a filler character.

Greetings
Christian

-- 
Read our blog about news on our plugins:

http://www.mbsplugins.de/

_______________________________________________
Mbsplugins_monkeybreadsoftware.info mailing list
[email protected]
https://ml01.ispgateway.de/mailman/listinfo/mbsplugins_monkeybreadsoftware.info

Re: [MBS] Check and replacing non valid UTF8 characters

Reply via email to