Re: ap_unescape_url can't escape %uXXXX

2003-01-28 Thread IKEBE Tomohiro

I have written a little patch for libapreq which unescape
the %u style URI-escaped string.
the unescape algorithm is based on CGI.pm

At Fri, 29 Nov 2002 06:55:54 +0900,
Tatsuhiko Miyagawa wrote:
 
 It seems that Apache's ap_unescape_url() can't handle %u style
 URI-escaped Unicode string, hence Apache::Request cannot neighther,
 while CGI.pm can.
 
 Is this a known issue?
 
 
 -- 
 Tatsuhiko Miyagawa [EMAIL PROTECTED]
 

-- 
IKEBE Tomohiro [EMAIL PROTECTED]




apache_request_unicode.patch
Description: Binary data


Re[2]: ap_unescape_url can't escape %uXXXX

2003-01-28 Thread Lee Goddard
-BEGIN PGP SIGNED MESSAGE-
Hash: MD5

Hi IKEBE,

On Tuesday, January 28, 2003 at 1:32:43 PM, you wrote:


IT I have written a little patch for libapreq which unescape
IT the %u style URI-escaped string.
IT the unescape algorithm is based on CGI.pm

Any idea if/when that'll be incorporated to the distributions?

I currently have to have a handler check every incoming request
for failure, and then convert with Unicode::String 

- --
Cheers
 Lee Of course, if everyone used Lynx, this wouldn't be a problem
 Goddard

-BEGIN PGP SIGNATURE-
Version: 2.6

iQCVAwUAPjaBYadrfekeF/QBAQHDJgP/YnPgGH8c3emGgbwnuvAB3B2jIvnetcD0
2nyE4ODThKoRuITHRX5qa9FvHtz2ouNM+pgDr0wo6TRyJM7sqmpzXVy/0XYw6NUV
j8nxkBiELC4F9JWyf+a91rzvTOA/eXDPizrOC9/OgKKn+ZH86GftoeNd+KvhM4TG
kRmE6bJ5O+4=
=ZPvi
-END PGP SIGNATURE-




Re: ap_unescape_url can't escape %uXXXX

2002-11-28 Thread Joe Schaefer
Tatsuhiko Miyagawa [EMAIL PROTECTED] writes:

 It seems that Apache's ap_unescape_url() can't handle %u style
 URI-escaped Unicode string, hence Apache::Request cannot neighther,
 while CGI.pm can.

You may want to take this issue up on [EMAIL PROTECTED]
Personally I've never seen this kind of character encoding, 
and my reading of

  Section 8 at http://www.w3.org/TR/charmod/ 
  and RFC 2718, Section 2.2.5, 

seems to indicate that this isn't a recommended practice. OTOH, IIRC the 
apache source claims to support utf8 extension(s) of www-urlencoded
ASCII, so if people really are using such encodings, supporting 
%u in ap_unescape_url shouldn't hurt server performance at all.

In any case, putting together a patch of ap_unescape_url along the lines 
of CGI::Util's utf8_chr() can't hurt :-).

-- 
Joe Schaefer



Re: ap_unescape_url can't escape %uXXXX

2002-11-28 Thread Tatsuhiko Miyagawa
At 29 Nov 2002 02:17:31 -0500,
Joe Schaefer wrote:
   It seems that Apache's ap_unescape_url() can't handle %u style
  URI-escaped Unicode string, hence Apache::Request cannot neighther,
  while CGI.pm can.

my WinIE 5.5 / WinIE 6.0 uses this style of URI escaping when you use
javascript to submit page's content. (Well, I'm talking about
MovableType's bookmarklet, if you're interested)
 
 seems to indicate that this isn't a recommended practice. OTOH, IIRC the 
 apache source claims to support utf8 extension(s) of www-urlencoded
 ASCII, so if people really are using such encodings, supporting 
 %u in ap_unescape_url shouldn't hurt server performance at all.
 
 In any case, putting together a patch of ap_unescape_url along the lines 
 of CGI::Util's utf8_chr() can't hurt :-).

Yep ;-)


-- 
Tatsuhiko Miyagawa [EMAIL PROTECTED]