[
https://issues.apache.org/jira/browse/JENA-1892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17096542#comment-17096542
]
Gilles Sérasset commented on JENA-1892:
---------------------------------------
Regardless the bug, the
org.apache.jena.atlas.lib.StrUtils.decodeHex(StrUtils.java) does not make sense
at all. Algorithmically it is totally broken as soon as the URI contains a non
ASCII char.
> Non ASCII chars in URIs leads to StringIndexOutOfBoundException
> ---------------------------------------------------------------
>
> Key: JENA-1892
> URL: https://issues.apache.org/jira/browse/JENA-1892
> Project: Apache Jena
> Issue Type: Bug
> Components: Base, TDB
> Affects Versions: Jena 3.14.0
> Reporter: Gilles Sérasset
> Priority: Major
> Attachments: JenaIRITest.java
>
>
> When a URI uses a non ASCII char, the StrUtils.decodeHex() method will
> incorrectly decode the URI when stored in TDB and eventually throw a
> StringIndexOutOfBoundException.
> In the attached JUnit class is a method which will raise the exception.
> Note that if the dataset is not closed and reopened (forcing TDB to store the
> data on disk and retrieve it), the code works correctly.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)