Francois Piette wrote:
>> Yes, if someone has Apache or a newer IIS installed he could help.
>> Create a file name with characters not in current ANSI code page by
>> copy those characters from the Windows application charmap.exe.
>> Than start a packet sniffer and log a directory listing.
> 
> Using IIS6 on W2K3.

Thanks!

> The twothird character (U+2154) is sent in the dirlist as 3
> characters : 0xE2 0x85 0x94. In the href link, the 3 characters are
> expressed as %e2%85%94 

That's UTF-8 URL-encoded.

> while they are binary in the text itself.

The twothird character is not 'encoded' either as "⅔" (decimal) or
as "⅔" (hex)? If so, IIS sends plain UTF-16! 

> There is nothing in the html header to tell which code page or
> charset is used. --

Browsers seem to be very good in detecting the correct character set
nowadays.

--
Arno Garrels
-- 
To unsubscribe or change your settings for TWSocket mailing list
please goto http://lists.elists.org/cgi-bin/mailman/listinfo/twsocket
Visit our website at http://www.overbyte.be

Reply via email to