afs commented on issue #2062:
URL: https://github.com/apache/jena/issues/2062#issuecomment-1784999301

   The fragments part is unaffected. 8141 is following the URI spec.
   
       r-component   = pchar *( pchar / "/" / "?" )
   
   and `pchar` includes `sub-delims` which includes `"="`.
   
   Tokenizing is "greedy" (longest match, regex style) so if there are two 
possibilities the first/longer happens. This may be intentional - so the 
r-component with q-component syntax get passed to a resolver.
   
   Fortunately for syntax validation, it does not matter! it's pass/fail, not 
the reason.
   There is no "get r-component" method on IRIx.
   
   As for tools - I think tools that try to parse/check UUIDs, and many that 
parse/check URNs (which isn't many in linked data land) will not handle 
r-,q-components. If they treat them blindly as IRIs, they will pass them, 
including internationalization  issues.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to