Re: [twsocket] URL encoding

2008-09-28 Thread Francois PIETTE
Can somebody confirm that characters above #127 have to be encoded UTF-8 first before they are percent-encoded? If that's correct, Url.pas was and is currently buggy. When I use IE to get the url http://www.myhost.com/Fête (note the lowercase e with circumflex), it sends GET /F%C3%AAte to the

Re: [twsocket] URL encoding

2008-09-28 Thread Arno Garrels
Francois PIETTE wrote: Can somebody confirm that characters above #127 have to be encoded UTF-8 first before they are percent-encoded? If that's correct, Url.pas was and is currently buggy. When I use IE to get the url http://www.myhost.com/Fête (note the lowercase e with circumflex), it

Re: [twsocket] URL encoding

2008-09-28 Thread DZ-Jay
On Sep 27, 2008, at 12:14, Arno Garrels wrote: Can somebody confirm that characters above #127 have to be encoded UTF-8 first before they are percent-encoded? If that's correct, Url.pas was and is currently buggy. I can't find anything specific on the HTTP and URI RFCs regarding this

Re: [twsocket] URL encoding

2008-09-28 Thread Arno Garrels
DZ-Jay wrote: I've seen UTF-8 used all the time (and that's what I've used, too), and in fact that's probably what IE uses--but I can't find it anywhere specified as the HTTP protocol character set--unless I'm missing something. It may be that UTF-8, by convention or tradition, is the de

Re: [twsocket] URL encoding

2008-09-28 Thread Fastream Technologies
I can confirm both browsers also translate non-ansi Turkish chars as unicode: ğ = %C4%9F This is soft g, specific to Turkish on all languages. Regards, SZ On Sun, Sep 28, 2008 at 4:49 PM, Arno Garrels [EMAIL PROTECTED] wrote: DZ-Jay wrote: I've seen UTF-8 used all the time (and that's what

Re: [twsocket] URL encoding

2008-09-28 Thread DZ-Jay
On Sep 28, 2008, at 09:49, Arno Garrels wrote: It doesn't seem to be mandatory, however suggested to use UTF-8 since January 2005, RFC 3986 Thank you! For some reason I missed that 3986 obsoletes 2396. dZ. -- DZ-Jay [TeamICS]

Re: [twsocket] URL encoding

2008-09-28 Thread Arno Garrels
DZ-Jay wrote: On Sep 28, 2008, at 09:49, Arno Garrels wrote: It doesn't seem to be mandatory, however suggested to use UTF-8 since January 2005, RFC 3986 Thank you! For some reason I missed that 3986 obsoletes 2396. If you are interested, I just checked in my UTF-8 changes (v7). The

Re: [twsocket] URL encoding

2008-09-27 Thread Arno Garrels
Arno Garrels wrote: Hi, Can somebody confirm that characters above #127 have to be encoded UTF-8 first before they are percent-encoded? If that's correct, Url.pas was and is currently buggy. Same or similar functions are used in the HTTP server. I have a fix for OverbyteIcsUrl.pas but won't

Re: [twsocket] URL encoding

2008-09-27 Thread Arno Garrels
Fastream Technologies wrote: I think this bug could be a reason why our web server customers have been getting complained with html file manipulation in non-ansi... Can you confirm that it's a bug? -- Arno Garrels On Sat, Sep 27, 2008 at 7:14 PM, Arno Garrels [EMAIL PROTECTED] wrote:

Re: [twsocket] URL encoding

2008-09-27 Thread Fastream Technologies
I am not sure of a 100% covering test scenario. Our server is now free, why don't you test it yourself? (http://www.fastream.com/iqwebftpserver.php) Regards, Gorkem On Sat, Sep 27, 2008 at 7:38 PM, Arno Garrels [EMAIL PROTECTED] wrote: Fastream Technologies wrote: I think this bug could be a