This is to highlight issue 2167.

https://github.com/apache/jena/issues/2167

What do if asked to print a URI string that has bad characters in it when outputting Turtle-family syntax.

[18]    IRIREF  ::=     '<' ([^#x00-#x20<>"{}|^`\] | UCHAR)* '>'

https://www.w3.org/TR/turtle/#grammar-production-IRIREF

Parsing also requires passing RFC 3986 in addition to the IRIREF rule.
There is no "fix the URI".

Percent encoding "encodes" - it changes the URI (the output URI string would not match the input).

The current PR - for discussion - puts in UCHAR (which is an escape mechanism). That at least then passes the IRREF rule but it is not a legal URI; it has a bad character in it.

    Andy

Reply via email to