In DBpedia there are two resources - one for Hokkaido Prefecture 
[http://dbpedia.org/resource/Hokkaido] and one for Hokkiado Island 
[http://dbpedia.org/resource/Hokkaido__island__1] - extracted from the very 
same Wikipedia page [https://en.wikipedia.org/wiki/Hokkaido].
I assume this results from the two infoboxes - Infobox Prefecture Japan and 
Infobox islands - contained in the page. So far, the intention is clear, 
because these two infoboxes (might) describe two separate entities in the real 
world: ontologically Hokkaido Prefecture and Hokkiado Island are two separate 
entities. They also have different values for the same properties, e.g. 
dbo:areaTotal 83453570000 (including water area and smaller islands) vs. 
77981870000 (islands landmass) respectively.

But this causes several problems:
 (1) [http://dbpedia.org/resource/Hokkaido__island__1] misses some essential 
properties such as rdfs:label, prov:wasDerivedFrom, dbo:wikiPageID, 
dbo:wikiPageRevisionID, rdfs:comment, owl:sameAs.
 (2) [http://dbpedia.org/resource/Hokkaido__island__1] has no incoming links. 
Obviously, every link actually referring to the island will point to the 
prefecture resource.
 (3) Yago types for the prefecture include 
[http://dbpedia.org/class/yago/Island109316454] and 
[http://dbpedia.org/class/yago/IslandsOfJapan], which most probably results 
from (2).
 (4) the wikidata identifier/link for [http://dbpedia.org/page/Hokkaido] is 
incorrect, it points to [http://wikidata.org/entity/Q35581], which is 
describing the island. The prefecture being 
[https://www.wikidata.org/wiki/Q1037393]. There is no wikidata link for 
[http://dbpedia.org/resource/Hokkaido__island__1].
 (5) There is no explicit link from [http://dbpedia.org/resource/Hokkaido] to 
[http://dbpedia.org/resource/Hokkaido__island__1].

In other language chapters the distinction between prefecture and island has 
been made, e.g. [https://simple.wikipedia.org/wiki/Hokkaid%C5%8D_Prefecture] 
and [https://simple.wikipedia.org/wiki/Hokkaid%C5%8D]: "Hokkaidō Prefecture 
(北海道 Hokkaidō?) is a special Japanese prefecture in Hokkaidō region on the 
island Hokkaidō."
But, since that is not the case for the english chapter, I think it is 
misleading to introduce this distinction.

Similar holds for other pages containing multiple infoboxes, e.g. 
[http://dbpedia.org/resource/Masayuki_Kono] vs. 
[http://dbpedia.org/resource/Masayuki_Kono__Masayuki_Kono__1].
Maybe, I am missing something and the understanding and handling of resources 
containing "__“ (two underscores) is explained or defined somewhere.

Since we rely on OWA, I would assume for each resource matching 
http://dbpedia.org/resource/([\d\w_]+)__([\d\w_]+)__([\d]+): 
<http://dbpedia.org/resource/$1> owl:sameAs 
<http://dbpedia.org/resource/$1__$2__$3> .

Best regards
Magnus

-- 
Magnus Knuth

Hasso-Plattner-Institut für Softwaresystemtechnik GmbH
Prof.-Dr.-Helmert-Str. 2-3
14482 Potsdam

Amtsgericht Potsdam, HRB 12184
Geschäftsführung: Prof. Dr. Christoph Meinel

tel:     +49 331 5509 547
email:   magnus.kn...@hpi.de
web:     http://www.hpi.de/
webID:   http://magnus.13mm.de/


------------------------------------------------------------------------------
_______________________________________________
Dbpedia-developers mailing list
Dbpedia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-developers

Reply via email to