On Sat, Nov 01, 2008 at 01:51:42AM +0800, Ray Chuan wrote: > using an edonkey client, which has a function to convert file names to > url-friendly strings (aka ed2k links), i was able to see that "é" > showed up as %C3%83%C2%A9, while the more complex "专辑" > (专辑) would be %C3%A4%C2%B8%C2%93%C3%A8%C2%BE%C2%91.
You converted twice to UTF-8, so have to go back once. (é is U+00e9 which is 11000011 10101001 in UTF-8, but if you read the latter as Latin-1 and convert once more to UTF-8 you get 11000011 10000011 11000010 10101001, that is, %C3%83%C2%A9 as you reported) -- Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/linux-utf8/