In DBpedia there are two resources - one for Hokkaido Prefecture [http://dbpedia.org/resource/Hokkaido] and one for Hokkiado Island [http://dbpedia.org/resource/Hokkaido__island__1] - extracted from the very same Wikipedia page [https://en.wikipedia.org/wiki/Hokkaido]. I assume this results from the two infoboxes - Infobox Prefecture Japan and Infobox islands - contained in the page. So far, the intention is clear, because these two infoboxes (might) describe two separate entities in the real world: ontologically Hokkaido Prefecture and Hokkiado Island are two separate entities. They also have different values for the same properties, e.g. dbo:areaTotal 83453570000 (including water area and smaller islands) vs. 77981870000 (islands landmass) respectively.
But this causes several problems: (1) [http://dbpedia.org/resource/Hokkaido__island__1] misses some essential properties such as rdfs:label, prov:wasDerivedFrom, dbo:wikiPageID, dbo:wikiPageRevisionID, rdfs:comment, owl:sameAs. (2) [http://dbpedia.org/resource/Hokkaido__island__1] has no incoming links. Obviously, every link actually referring to the island will point to the prefecture resource. (3) Yago types for the prefecture include [http://dbpedia.org/class/yago/Island109316454] and [http://dbpedia.org/class/yago/IslandsOfJapan], which most probably results from (2). (4) the wikidata identifier/link for [http://dbpedia.org/page/Hokkaido] is incorrect, it points to [http://wikidata.org/entity/Q35581], which is describing the island. The prefecture being [https://www.wikidata.org/wiki/Q1037393]. There is no wikidata link for [http://dbpedia.org/resource/Hokkaido__island__1]. (5) There is no explicit link from [http://dbpedia.org/resource/Hokkaido] to [http://dbpedia.org/resource/Hokkaido__island__1]. In other language chapters the distinction between prefecture and island has been made, e.g. [https://simple.wikipedia.org/wiki/Hokkaid%C5%8D_Prefecture] and [https://simple.wikipedia.org/wiki/Hokkaid%C5%8D]: "Hokkaidō Prefecture (北海道 Hokkaidō?) is a special Japanese prefecture in Hokkaidō region on the island Hokkaidō." But, since that is not the case for the english chapter, I think it is misleading to introduce this distinction. Similar holds for other pages containing multiple infoboxes, e.g. [http://dbpedia.org/resource/Masayuki_Kono] vs. [http://dbpedia.org/resource/Masayuki_Kono__Masayuki_Kono__1]. Maybe, I am missing something and the understanding and handling of resources containing "__“ (two underscores) is explained or defined somewhere. Since we rely on OWA, I would assume for each resource matching http://dbpedia.org/resource/([\d\w_]+)__([\d\w_]+)__([\d]+): <http://dbpedia.org/resource/$1> owl:sameAs <http://dbpedia.org/resource/$1__$2__$3> . Best regards Magnus -- Magnus Knuth Hasso-Plattner-Institut für Softwaresystemtechnik GmbH Prof.-Dr.-Helmert-Str. 2-3 14482 Potsdam Amtsgericht Potsdam, HRB 12184 Geschäftsführung: Prof. Dr. Christoph Meinel tel: +49 331 5509 547 email: magnus.kn...@hpi.de web: http://www.hpi.de/ webID: http://magnus.13mm.de/ ------------------------------------------------------------------------------ _______________________________________________ Dbpedia-developers mailing list Dbpedia-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dbpedia-developers