afs commented on issue #2062:
URL: https://github.com/apache/jena/issues/2062#issuecomment-1784999301
The fragments part is unaffected. 8141 is following the URI spec.
r-component = pchar *( pchar / "/" / "?" )
and `pchar` includes `sub-delims` which includes `"="`.
Tokenizing is "greedy" (longest match, regex style) so if there are two
possibilities the first/longer happens. This may be intentional - so the
r-component with q-component syntax get passed to a resolver.
Fortunately for syntax validation, it does not matter! it's pass/fail, not
the reason.
There is no "get r-component" method on IRIx.
As for tools - I think tools that try to parse/check UUIDs, and many that
parse/check URNs (which isn't many in linked data land) will not handle
r-,q-components. If they treat them blindly as IRIs, they will pass them,
including internationalization issues.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]