mcb5637 opened a new issue, #2766:
URL: https://github.com/apache/jena/issues/2766

   ### Version
   
   Apache Jena RIOT version 5.1.0
   
   ### What happened?
   
   I am trying to validate a turtle file with the turtle command described 
here: https://jena.apache.org/documentation/io/
   The result are a bunch of warnings, which i reduced down to cases simmilar 
to the following:
   
   Trying to parse the triple `<http://en.wikipedia.org/wiki/𝁰> a 
<http://xmlns.com/foaf/0.1/Document> .` (the last character of the subject is 
unicode codepoint 1D070, in case it does not get rendered properly). The turtle 
command issues the following warnings:
   ```
   11:04:40 WARN  riot            :: [line: 1, col: 32] Illegal character in 
IRI (Not a ucschar: 0xD834): <http://en.wikipedia.org/wiki/[U+D834]...>
   11:04:40 WARN  riot            :: [line: 1, col: 33] Illegal character in 
IRI (Not a ucschar: 0xDC70): <http://en.wikipedia.org/wiki/?[U+DC70]...>
   ```
   (Note that 0xD834 and 0xDC70 are the 2 halves of 1D070, if you represent it 
in utf-16)
   (I also validated the input and it does contain the correct representation 
of 1D070 in utf-8)
   
   In case it is relevant:
   OS: `Ubuntu 24.04.1 LTS`
   Java version:
   ```
   openjdk 21.0.4 2024-07-16
   OpenJDK Runtime Environment (build 21.0.4+7-Ubuntu-1ubuntu224.04)
   OpenJDK 64-Bit Server VM (build 21.0.4+7-Ubuntu-1ubuntu224.04, mixed mode, 
sharing)
   ```
   Command run: `cat test.ttl | apache-jena-5.1.0/bin/turtle --output=TURTLE > 
/dev/null`, where test.tll only contains the relevant triple. (Although 
different types of input (like loading the file directly with turtle) result in 
the same warnings)
   
   ### Relevant output and stacktrace
   
   _No response_
   
   ### Are you interested in making a pull request?
   
   None


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to