Re: [PHP-DEV] Re: PHP Unicode support design document

Andrei Zmievski Wed, 24 Aug 2005 16:24:21 -0700

Hi,

On Aug 23, 2005, at 7:30 PM, Makoto Tozawa wrote:

"HTTP Input Encoding
...
If the HTTP request contains the encoding specification in the headers,
then it will be used instead of this setting."

With my best knowledge there isn't such http request header which
specifies the encoding of the request. In case the intent is to honor
the ACCEPT-CHARSET, it may cause a problem because browsers don't
gurantee the encoding in the ACCEPT-CHARSET is same as the encoding
used to escape characters in the URL query string. After all, the
ACCEPT-CHARSET is to specify the character encodings acceptable for
the response.

I took a closer look at this today and RFC 2616 does not specifywhether user agents are supposed to send a charset parameter in theContent-Type header of the POST request. I did not see any of mybrowsers doing so. I think we can safely disregard this and rely onhttp_input_encoding and output_encoding settings. We are not going touse Accept-Charset for the reasons you mention.

Is there any way to keep the byte semantics (in oppose to unicodesemantics)only for the existing functions? For example, the Oracle 8 functionscan beconfigured to use utf-8 for the character encoding of strings. Inorder forthem to work properly, fundamental functions, which Oracle 8 functioncall,have to behave in byte samentics. And if they work properly when theunicodesemantics switch is turned on, by setting the runtime_encoding toutf-8,
they can be called by uncode applications.


I couldn't parse this on the first try. Could you restate this?

-Andrei

--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: http://www.php.net/unsub.php

Re: [PHP-DEV] Re: PHP Unicode support design document

Reply via email to