arne-bdt opened a new issue, #2732:
URL: https://github.com/apache/jena/issues/2732

   ### Change
   
   While trying to speed up parsing of RDF/XML, I profiled the parsing process.
   It seems that resolving/parsing IRIs is one of the most time consuming parts 
in the whole process.
   When trying to look into `org.apache.jena.iri.impl.LexerPath#yylex`,  I 
found hat the whole file has been" generated by JFlex 1.4.3 on 04/03/12".
   Fortunately, JFlex is still actively developed and the latest release is 
JFex 1.9.1 from 2023-03-11.
   `NotesAFS.txt` explains how to genereate the code with JFlex. But maybe no 
one has tried the process for 12 years now.
   I don“t know if the "maven jflex plugin" was available in 2012 or if there 
is a reason, why JFlex has not been integrated in the build process,
   Since Jena-Iri is a core part of the Apache Jena libraries, I think it may 
be wise to ensure, that the generated code is reproducable.
   
   Would it be a good idea to integrate the "maven jflex plugin" instead of the 
manual process?
   
   PS: I could not find any other Java IRI parser library, except for RDF4J.
   
   ### Are you interested in contributing a pull request for this task?
   
   Perhaps?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to