Unfortunately we have to support wireless (mobile phone) browser software
which is not Unicode enabled, uses only codepage, and sends non-ASCII in
URLs that it generates for form input etc. as % hex escaped bytes
representing codepage not UTF-8. These would be indistinguishable from % hex
encoded bytes in the Xalan output, and we would not know whether to decode
them from codepage or from UTF-8 at the server side.

No browser I've encountered unconditionally implements the IETF mandate to
always represent non-ASCII in URLs as hex escaped UTF-8. IE does have an
option checkbox "Always send URLs as UTF-8", but other than that, browsers
normally send non-ASCII in the encoding of the currently displayed page
where the form or link occurs. Hex UTF-8 encoded URLs work when we send our
pages as UTF-8, but this is not possible in the cases where the browser has
no Unicode support.

Joseph

-----Original Message-----
From: David N Bertoni/Cambridge/IBM [mailto:[EMAIL PROTECTED]]
Sent: Friday, August 02, 2002 11:47 AM
To: [EMAIL PROTECTED]
Subject: Re: URL escaping in Xalan-C






No, there's no interface to do that.  This was in there for experimental
purposes, but the APIs to modify it were never publically exposed.

Is there some reason why you want to generate URLs which are not properly
escaped?  I don't see it as behavior we support in the normal release, but
you could always modify the sources yourself to do what you want.

Dave



 

                      "Joseph Boyle"

                      <Boyle@siebel.           To:
"'[EMAIL PROTECTED]'" <[EMAIL PROTECTED]>

                      com>                     cc:      (bcc: David N
Bertoni/Cambridge/IBM)                                           
                                               Subject: URL escaping in
Xalan-C                                                        
                      08/02/2002 10:24

                      AM

                      Please respond

                      to xalan-dev

 

 




Is there a way to turn off escaping of URLs in output as hex encoded UTF-8?
I can see setting m_escapeURLs in FormatterToHTML.cpp to false would do
this, but m_escapeURLs is initialized to true and I do not see an interface
to change it. Thanks, Joseph






Reply via email to