... Does anybody know the default codepage in URIs of HTTP?
US-ASCII. Don't count on high-bit-set bytes resembling Latin-1 or
even working at all on some platforms. However, there is a proposal
to incorporate non-ASCII UTF-8 as (multi-)bytes as %xx :
Hello Florian,
You are right that currently, there is no well-defined way to
include arbitrary characters into URIs, or to interpret URIs
and find out which characters they contain. So if you have
a file with an a-umlaut and an Euro sign in it, to construct
an http URI for it, you have to make