Unfortunately we have to support wireless (mobile phone) browser software which is not Unicode enabled, uses only codepage, and sends non-ASCII in URLs that it generates for form input etc. as % hex escaped bytes representing codepage not UTF-8. These would be indistinguishable from % hex encoded bytes in the Xalan output, and we would not know whether to decode them from codepage or from UTF-8 at the server side.
No browser I've encountered unconditionally implements the IETF mandate to always represent non-ASCII in URLs as hex escaped UTF-8. IE does have an option checkbox "Always send URLs as UTF-8", but other than that, browsers normally send non-ASCII in the encoding of the currently displayed page where the form or link occurs. Hex UTF-8 encoded URLs work when we send our pages as UTF-8, but this is not possible in the cases where the browser has no Unicode support. Joseph -----Original Message----- From: David N Bertoni/Cambridge/IBM [mailto:[EMAIL PROTECTED]] Sent: Friday, August 02, 2002 11:47 AM To: [EMAIL PROTECTED] Subject: Re: URL escaping in Xalan-C No, there's no interface to do that. This was in there for experimental purposes, but the APIs to modify it were never publically exposed. Is there some reason why you want to generate URLs which are not properly escaped? I don't see it as behavior we support in the normal release, but you could always modify the sources yourself to do what you want. Dave "Joseph Boyle" <Boyle@siebel. To: "'[EMAIL PROTECTED]'" <[EMAIL PROTECTED]> com> cc: (bcc: David N Bertoni/Cambridge/IBM) Subject: URL escaping in Xalan-C 08/02/2002 10:24 AM Please respond to xalan-dev Is there a way to turn off escaping of URLs in output as hex encoded UTF-8? I can see setting m_escapeURLs in FormatterToHTML.cpp to false would do this, but m_escapeURLs is initialized to true and I do not see an interface to change it. Thanks, Joseph
