Lewis John McGibbney created ANY23-214:
------------------------------------------
Summary: ë (e-umlaut or diaeresis) not decoded in RDF output
Key: ANY23-214
URL: https://issues.apache.org/jira/browse/ANY23-214
Project: Apache Any23
Issue Type: Improvement
Affects Versions: 1.0
Reporter: Lewis John McGibbney
Fix For: 1.1
{
"quads": [
[
{
"type": "uri",
"value": "http:\/\/any23.org\/tmp\/"
},
"http:\/\/purl.org\/dc\/terms\/title",
{
"type": "literal",
"value": "Ti\u00c3\u00absto - Wikipedia, the free encyclopedia",
"lang": null,
"datatype": null
},
null
],
[
{
"type": "bnode",
"value": "node57ae60d4152c8125e46a2ce59389333b"
},
"http:\/\/www.w3.org\/2006\/vcard\/ns#fn",
{
"type": "literal",
"value": "Ti\u00c3\u00absto",
"lang": null,
"datatype": null
},
null
],
...
The above is the output from extracting triples from the following page with
the Service.
https://en.wikipedia.org/wiki/Tiesto
You can easily view that we are not decoding the e-umlaut.
We should fix this.
--
This message was sent by Atlassian JIRA
(v6.2#6252)