Am 12.02.2013 um 12:14 schrieb Lee Badham <[email protected]>: > More Info… > > To be clear, I want to keep as much of the original string as possible, so if > most of the string is in Chinese (but valid UTF-8) that is ok. > > Some of the files I get have Various Chinese encodings and sometimes the > universal parser gueses the encoding wrong. >
Well, it should be possible to develop an algorithm which checks UTF-8 validity and replaces invalid sequences with a filler character. Greetings Christian -- Read our blog about news on our plugins: http://www.mbsplugins.de/ _______________________________________________ Mbsplugins_monkeybreadsoftware.info mailing list [email protected] https://ml01.ispgateway.de/mailman/listinfo/mbsplugins_monkeybreadsoftware.info
