Hello vitalie,

On Fri, 12 May 2006 00:20:39 +0300 GMT (12/05/2006, 04:20 +0700 GMT),
vitalie vrabie wrote:

vv> correct. the URI's umlauts MUST be %-encoded, so it'll be
vv> recognized properly. this is FIRST.

You refer to chapter 2.1 of RFC3986:

vv> 2.1.  Percent-Encoding
vv> 
vv>    A percent-encoding mechanism is used to represent a data octet in a
vv>    component when that octet's corresponding character is outside the
vv>    allowed set or is being used as a delimiter of, or within, the
vv>    component. [...]

Note that it says "outside the allowed set". But what is the "allowed
set"? Chapter 2.0 gives the answer:

vv> 2.  Characters
vv> 
vv>    The URI syntax provides a method of encoding data, presumably for the
vv>    sake of identifying a resource, as a sequence of characters.  The URI
vv>    characters are, in turn, frequently encoded as octets for transport
vv>    or presentation.  This specification does not mandate any particular
vv>    character encoding for mapping between URI characters and the octets
vv>    used to store or transmit those characters.  When a URI appears in a
vv>    protocol element, the character encoding is defined by that protocol;
vv>    without such a definition, a URI is assumed to be in the same
vv>    character encoding as the surrounding text.

We learn that a specific character set is not mandated. The
surrounding text for a German email (or any email containing umlauts)
is one of those that allows umlauts.

vv> the URI per se must NOT contain any umlauts (again, please read
vv> that rfc throughly). it just can be PRESENTED to the user with those.

I've read the RFC, and I understand the opposite: It may indeed
contain umlauts if in a text that uses a charset allowing umlauts. For
example, in an email encoded in Latin-9.

In any case, it should not be too difficult to implement what the user
needs: Clickable URIs, even though they may contain umlauts. Why do
you not want to implement that?

-- 

Cheers,
Thomas.

Kommt ein Mann in ne Bank: "Haende raus, Geld hoch, ich bin eine
Geisel!" - "Gell, Sie wollen's in Schilling?"
http://thomas.fernandez.hat-gar-keine-homepage.de/

Message reply created with The Bat! 3.80.03
under Windows XP 5.1 Build 2600 Service Pack 2



________________________________________________________
 Current beta is 3.80.03 | 'Using TBBETA' information:
http://www.silverstones.com/thebat/TBUDLInfo.html
IMPORTANT: To register as a Beta tester, use this link first -
http://www.ritlabs.com/en/partners/testers/

Reply via email to