afs commented on issue #2732: URL: https://github.com/apache/jena/issues/2732#issuecomment-2368685026
> While trying to speed up parsing of RDF/XML Which parser? RRX (SAX)? > PS: I could not find any other Java IRI parser library, except for RDF4J. This is my proposed replacement for `jena-iri`. The big issue really is that the error messages will change. I hope uses find IRI3986 error messages somewhat less cryptic :-) https://github.com/afs/x4ld/tree/main/iri4ld - specifically [IRI3986.java](https://github.com/afs/x4ld/blob/main/iri4ld/src/main/java/org/seaborne/rfc3986/IRI3986.java). It does not depend on Jena except for its tests - it runs itself against jena-iri to check. It parses/validates RFC 3986 syntax at the cost of a single java object allocation. It's only if the caller asks for parts of a IRI does it split the string. Jena has `IRIx` as a plugin extension point. `iri4ld` is currently passing Jena's `IRIx` test suite and last time I checked With the associated IRI3986 adapter for Jena, it worked the last time I tried in the Jena build. I haven't had time to get back to it and it missed Jena 5.0. IRI3986 has URI scheme specific checking support as well, including DID's. ---- Background/general: For several of the places where there is generated code, Jena ships the generated source code in several places so it can be built without having to setup tools chains. Things have got better over the years and maybe plugins now execute the tool and not depend on external setup. > I don“t know if the "maven jflex plugin" was available in 2012 The earliest version in maven central is 2014 :-) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
