Hi,
Randal L. Schwartz wrote:
"BÁRTHÁZI" == BÁRTHÁZI András <[EMAIL PROTECTED]> writes:
Did I miss the memo where anything outside the list of valid URI characters needed to be hexified, hence there's no need for such a URL encoding scheme? Where is this memo?
BÁRTHÁZI> Can you write it again with other words? Both Stevan and me are not BÁRTHÁZI> understand.
URLs are only 7 bit ASCII, according to the RFCs. Did I miss a new RFC where non-7-bit URLs are permitted? If so, please point to that.
You are right, in URLs just 7 bit ASCII is allowed. But you can store any character in an URL, if you encode it with "URL encoding". For example UTF-8 "á" is coded as "%C3%A1".
RFC 1738 [1], part 2.2 is writing about it (just about iso-8859-1 encoding). Or you can read a short tutorial about it at Blooberry[2]. Don't tell me, that you never heard this before. :)
Anyway, it's not just about URL encoding (the URL and the GET parameters), but POST parameters working the same way.
Bye, Andras
[1] http://www.rfc-editor.org/rfc/rfc1738.txt [2] http://www.blooberry.com/indexdot/html/topics/urlencoding.htm