https://issues.apache.org/ooo/show_bug.cgi?id=121362

--- Comment #7 from orcmid <[email protected]> ---
(In reply to comment #6)

> > The ten ASCII excluded URI graphic characters (ever only allowed via
> > %-encoding) are
> > 
> >  SP (space)
> >  "  %  < > \ ^ { | }
> ***Where is that in IETF RFC 3986? I mean, in what page?

The excluded characters are simply never listed in the grammar for URIs in RFC
3986.  In older versions they were specifically listed.  In 3986 they are
simply never allowed to appear directly in a URI (but of course they can be
%-encoded).

The way to do it is to list the printable characters for the ASCII codes 20
through 7E.   Then strike out the ones that are mentioned in any of the 3986
BNF rules.  The excluded characters are the ones that are left.  (That's what I
did to make sure I had the correct list.)

Web servers can have many ways of mapping incoming URLs to the actual character
set of their web pages on the surver.  Are you saying that a NIST web page has
a link that is not a well-formed URL?  Or do you have a link into a NIST web
page that is not a well-formed URL?

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to