[
https://issues.apache.org/jira/browse/JENA-1924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17143845#comment-17143845
]
Andy Seaborne commented on JENA-1924:
-------------------------------------
Hi [~jj-author],
Thanks for the report. Yes, there is a bug.
U+FFFC is a strange character - it is legal Turtle syntax (not sure that was
intended), while illegal in an IRI.
Turtle rule 163s: {{[#xFDF0-#xFFFD]}} but it should be caught by the IRI
checker and that isn't happening.
To add to the weirdness, it is legal in RDF 1.0! (that uses things called "RDF
URI References"which predate IRIs.).
> Invalid IRI (with U+FFFC) parsed as valid
> -----------------------------------------
>
> Key: JENA-1924
> URL: https://issues.apache.org/jira/browse/JENA-1924
> Project: Apache Jena
> Issue Type: Bug
> Components: IRI
> Affects Versions: Jena 3.13.1, Jena 3.15.0
> Reporter: Johannes Frey
> Assignee: Andy Seaborne
> Priority: Major
>
> We found an IRI which seems invalid according to IRI grammar
> ucschar = %xA0-D7FF / %xF900-FDCF / %xFDF0-FFEF
> but is reported as valid. The IRI is <http://pl.dbpedia.org/resource/>
> (containing the [U+FFFC|https://en.wiktionary.org/wiki/%EF%BF%BC] char at
> the end).
> The original issue can be found here
> [https://github.com/dbpedia/extraction-framework/issues/634]
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)