Hi,

I get a couple of question related to the newest dumps as i published a howto 
for setting up a local DBpedia mirror quite some time ago on my blog.
One I can't answer is related to the encoding of URIs or IRIs in the new dump 
files:


From the DBpedia 3.6 dump: de/labels_de.nt.bz2 and en/labels_en.nt.bz2
> <http://dbpedia.org/resource/Gerhard_Schr%C3%B6der> 
> <http://www.w3.org/2000/01/rdf-schema#label> "Gerhard Schr\u00F6der"@en .

From the DBpedia 3.7 (no i18n) 
3.7/data/HardDrive2/DBpedia/3.7/data-enUris-compressed/en/labels_en.nt.bz2 
(btw. maybe someone could recreate the provided all_languages.tar not to 
include the absolute path on your server?):
> <http://dbpedia.org/resource/Gerhard_Schr%C3%B6der> 
> <http://www.w3.org/2000/01/rdf-schema#label> "Gerhard Schr\u00F6der"@en .


From the DBpedia 3.7 (no i18n) 
3.7/data/HardDrive2/DBpedia/3.7/data-enUris-compressed/de/labels_de.nt.bz2
> <http://dbpedia.org/resource/Gerhard_Schröder> 
> <http://www.w3.org/2000/01/rdf-schema#label> "Gerhard Schr\u00F6der"@de .

From the DBpedia 3.7 (i18n) 3.7-i18n/all_languages-i18n/en/labels_en.nt.bz2
> <http://dbpedia.org/resource/Gerhard_Schr%C3%B6der> 
> <http://www.w3.org/2000/01/rdf-schema#label> "Gerhard Schr\u00F6der"@en .

From the DBpedia 3.7 (i18n) 3.7-i18n/all_languages-i18n/de/labels_de.nt.bz2
> <http://de.dbpedia.org/resource/Gerhard_Schröder> 
> <http://www.w3.org/2000/01/rdf-schema#label> "Gerhard Schr\u00F6der"@de .


Are the 3.7 .nt files with non ASCII chars in the URIs valid ntriples files?
http://www.w3.org/TR/rdf-testcases/#ntriples says quite clearly:
> The Internet media type / MIME type of N-Triples is text/plain and the 
> character encoding is 7-bit US-ASCII.

and there's no ö or ō in ASCII (the .nt files actually seem to be UTF-8 
encoded).


Aside from finding this inconsistent and inconvenient I ask myself: is it 
planned to use IRIs where we used URIs before?

Another question arising from this:
Will DBpedia now use the IRI <http://de.dbpedia.org/resource/Gerhard_Schröder>? 
If so how can i request it in sparql?

select *
where {
  <http://dbpedia.org/resource/Gerhard_Schr%C3%B6der> ?p ?o.
}

or

select *
where {
  <http://dbpedia.org/resource/Gerhard_Schr\u00F6der> ?p ?o.
}

or 

select *
where {
  <http://dbpedia.org/resource/Gerhard_Schröder> ?p ?o.
}


cheers,
Jörn


------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2dcopy1
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to