mcb5637 opened a new issue, #2766: URL: https://github.com/apache/jena/issues/2766
### Version Apache Jena RIOT version 5.1.0 ### What happened? I am trying to validate a turtle file with the turtle command described here: https://jena.apache.org/documentation/io/ The result are a bunch of warnings, which i reduced down to cases simmilar to the following: Trying to parse the triple `<http://en.wikipedia.org/wiki/𝁰> a <http://xmlns.com/foaf/0.1/Document> .` (the last character of the subject is unicode codepoint 1D070, in case it does not get rendered properly). The turtle command issues the following warnings: ``` 11:04:40 WARN riot :: [line: 1, col: 32] Illegal character in IRI (Not a ucschar: 0xD834): <http://en.wikipedia.org/wiki/[U+D834]...> 11:04:40 WARN riot :: [line: 1, col: 33] Illegal character in IRI (Not a ucschar: 0xDC70): <http://en.wikipedia.org/wiki/?[U+DC70]...> ``` (Note that 0xD834 and 0xDC70 are the 2 halves of 1D070, if you represent it in utf-16) (I also validated the input and it does contain the correct representation of 1D070 in utf-8) In case it is relevant: OS: `Ubuntu 24.04.1 LTS` Java version: ``` openjdk 21.0.4 2024-07-16 OpenJDK Runtime Environment (build 21.0.4+7-Ubuntu-1ubuntu224.04) OpenJDK 64-Bit Server VM (build 21.0.4+7-Ubuntu-1ubuntu224.04, mixed mode, sharing) ``` Command run: `cat test.ttl | apache-jena-5.1.0/bin/turtle --output=TURTLE > /dev/null`, where test.tll only contains the relevant triple. (Although different types of input (like loading the file directly with turtle) result in the same warnings) ### Relevant output and stacktrace _No response_ ### Are you interested in making a pull request? None -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
