[ 
https://issues.apache.org/jira/browse/JENA-785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andy Seaborne closed JENA-785.
------------------------------

> exception getting substr() from Deseret string
> ----------------------------------------------
>
>                 Key: JENA-785
>                 URL: https://issues.apache.org/jira/browse/JENA-785
>             Project: Apache Jena
>          Issue Type: Bug
>         Environment: Ubuntu 14.04 amd64
> $ java -version
> java version "1.7.0_65"
> OpenJDK Runtime Environment (IcedTea 2.5.1) (7u65-2.5.1-4ubuntu1~0.14.04.2)
> OpenJDK 64-Bit Server VM (build 24.65-b04, mixed mode)
>            Reporter: Osma Suominen
>            Assignee: Rob Vesse
>             Fix For: Jena 2.12.1
>
>
> I stumbled on a problem with a SPARQL query, run against Fuseki 1.1.0 and the 
> latest 1.1.1 snapshot. This is a minimal example demonstrating the problem.
> I have the following single triple (based on Lexvo.org) as my dataset:
> {noformat}
> <http://lexvo.org/id/iso3166/AQ> <http://www.w3.org/2000/01/rdf-schema#label> 
> "\U00010408\U0001044C\U0001043B\U0001042A\U00010449\U0001043F\U0001043B\U0001042E\U0001043F\U00010432"@en-Dsrt
>  .
> {noformat}
> (this is a label in the rather obscure Deseret alphabet - see wikipedia for 
> details about the script)
> Then I ran this query:
> {noformat}
> SELECT ?label (SUBSTR(?label, 1, 1) as ?l)
> WHERE { ?s <http://www.w3.org/2000/01/rdf-schema#label> ?label }
> {noformat}
> With the text output, I get this:
> {noformat}
> ------------------------------------------------
> | label                          | l           |
> ================================================
> | "𐐈𐑌𐐻𐐪𐑉𐐿𐐻𐐮𐐿𐐲"@en-Dsrt | " |
> ------------------------------------------------
> {noformat}
> (you may need to install a Deseret font if you can't see the letters above. I 
> use DeseretBee from here: 
> http://copper.chem.ucla.edu/~jericks/Fonts/Bee%20Fonts/Sans%20Serif/_DeseretBee2.ttf)
> The result is not completely wrong, but there's something fishy about the ?l 
> binding - it only shows one quote.
> With output set to JSON, XML, CSV or TSV, I get this error instead:
> {noformat}
> Error 500: java.nio.charset.MalformedInputException: Input length = 1
> Fuseki - version 1.1.1-SNAPSHOT (Build date: 2014-09-16T13:16:19+0000)
> {noformat}
> As you can see I'm using the latest Fuseki snapshot from yesterday.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to