afs commented on issue #2732:
URL: https://github.com/apache/jena/issues/2732#issuecomment-2368685026

   > While trying to speed up parsing of RDF/XML
   
   Which parser?  RRX (SAX)?
   
   > PS: I could not find any other Java IRI parser library, except for RDF4J.
   
   This is my proposed replacement for `jena-iri`.
   
   The big issue really is that the error messages will change. I hope uses 
find IRI3986 error messages somewhat less cryptic :-)
   
   https://github.com/afs/x4ld/tree/main/iri4ld - specifically 
[IRI3986.java](https://github.com/afs/x4ld/blob/main/iri4ld/src/main/java/org/seaborne/rfc3986/IRI3986.java).
   
   It does not depend on Jena except for its tests - it runs itself against 
jena-iri to check.
   
   It parses/validates RFC 3986 syntax at the cost of a single java object 
allocation. It's only if the caller asks for parts of a IRI does it split the 
string.
   
   Jena has `IRIx` as a plugin extension point. `iri4ld` is currently passing 
Jena's `IRIx` test suite and last time I checked
   
   With the associated IRI3986 adapter for Jena, it worked the last time I 
tried in the Jena build. I haven't had time to get back to it and it missed 
Jena 5.0.
   
   IRI3986 has URI scheme specific checking support as well, including DID's.
   
   ----
   
   Background/general:
   
   For several of the places where there is generated code, Jena ships the 
generated source code in several places so it can be built without having to 
setup tools chains. Things have got better over the years and maybe plugins now 
execute the tool and not depend on external setup.
   
   > I don“t know if the "maven jflex plugin" was available in 2012 
   
   The earliest version in maven central is 2014 :-)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to