Hi Johann,
I just had a closer look at the German example.
In this case the foaf:name property is used very liberally as a mapping:
http://mappings.dbpedia.org/index.php/Mapping_de:Taxobox
{{PropertyMapping | templateProperty = Taxon_Name | ontologyProperty =
foaf:name }}
{{PropertyMapping | templateProperty = Taxon2_Name | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon3_Name | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon4_Name | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon5_Name | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon6_Name | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Bild | ontologyProperty =
foaf:depiction }}
{{PropertyMapping | templateProperty = Bildbeschreibung |
ontologyProperty = depictionDescription }}
{{PropertyMapping | templateProperty = Taxon_WissName |
ontologyProperty = scientificName }}
{{PropertyMapping | templateProperty = Taxon2_WissName |
ontologyProperty = scientificName }}
{{PropertyMapping | templateProperty = Taxon3_WissName |
ontologyProperty = scientificName }}
{{PropertyMapping | templateProperty = Taxon4_WissName |
ontologyProperty = scientificName }}
{{PropertyMapping | templateProperty = Taxon5_WissName |
ontologyProperty = scientificName }}
{{PropertyMapping | templateProperty = Taxon6_WissName |
ontologyProperty = scientificName }}
{{PropertyMapping | templateProperty = Taxon_Rang | ontologyProperty =
foaf:name }}
{{PropertyMapping | templateProperty = Taxon2_Rang | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon3_Rang | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon4_Rang | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon5_Rang | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon6_Rang | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon2_LinkName |
ontologyProperty = foaf:name }}
{{PropertyMapping | templateProperty = Taxon3_LinkName |
ontologyProperty = foaf:name }}
{{PropertyMapping | templateProperty = Taxon4_LinkName |
ontologyProperty = foaf:name }}
{{PropertyMapping | templateProperty = Taxon5_LinkName |
ontologyProperty = foaf:name }}
{{PropertyMapping | templateProperty = Taxon6_LinkName |
ontologyProperty = foaf:name }}
{{PropertyMapping | templateProperty = Taxon_Autor | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon2_Autor | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon3_Autor | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon4_Autor | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Taxon5_Autor | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = Modus | ontologyProperty =
foaf:name }}
{{PropertyMapping | templateProperty = ErdzeitalterVon |
ontologyProperty = foaf:name }}
{{PropertyMapping | templateProperty = Fundorte | ontologyProperty =
foaf:name }}
{{PropertyMapping | templateProperty = MioVon | ontologyProperty =
foaf:name }}
{{PropertyMapping | templateProperty = MioBis | ontologyProperty =
foaf:name }}
{{PropertyMapping | templateProperty = Subtaxa | ontologyProperty =
foaf:name }}
{{PropertyMapping | templateProperty = Subtaxa_Rang | ontologyProperty
= foaf:name }}
{{PropertyMapping | templateProperty = ErdzeitalterBis |
ontologyProperty = foaf:name }}
{{PropertyMapping | templateProperty = Subtaxa_Plural |
ontologyProperty = foaf:name }}
{{PropertyMapping | templateProperty = Rangunterdrückung |
ontologyProperty = foaf:name }}
{{PropertyMapping | templateProperty = TausendBis | ontologyProperty =
foaf:name }}
{{PropertyMapping | templateProperty = TausendVon | ontologyProperty =
foaf:name }}
{{PropertyMapping | templateProperty = Name | ontologyProperty =
foaf:name }}
This causes this behavior, obviously.
Please feel free to change this mapping to a more precise depiction of
Taxobox template in DBpedia.
I can see that a lot of valid data is neglected (or passed with foaf:name)
which could be useful for the community, when portrayed with fitting
properties.
Best,
Markus Freudenberg
Release Manager, DBpedia <http://wiki.dbpedia.org>
On Tue, Jan 17, 2017 at 7:58 PM, Paul Houle <paul.ho...@ontology2.com>
wrote:
> In the case of the first two, (KBE) and (DBE) are abbreviations of
> chivalric titles which aren't that different from putting PhD or MD after
> the name, just I think it's more exclusive. You could make the case that
> "Sir Alfred Hitchcock (KBE)" is a valid name, but KBE itself is not.
> "Lady Mallowan" is a totally appropriate name for Agatha Christie because
> she was married to Max Mallowan who was himself a knight before she became
> a Dame.
>
> In general when you are harvesting names like this there is the problem of
> finding the valid variant forms and not finding invalid forms. There is
> also the issue that you should be more liberal about what you recognize
> than what you generate. For instance you can find some racist slurs in
> Wikipedia redirects which are names you probably should recognize if
> somebody tries to use them but that will probably cause you trouble if you
> try to use them.
>
> --
> Paul Houle
> paul.ho...@ontology2.com
>
>
>
> On Tue, Jan 17, 2017, at 12:39 PM, Johann Petrak wrote:
>
> I have noticed that the files "Mappingbased literals" provided for
> download here
> http://wiki.dbpedia.org/downloads-2016-04
> contain rather odd values, and rather a lot of those.
>
> For example for English there are the following entries for Hitchcock:
>
> <http://dbpedia.org/resource/Alfred_Hitchcock> <http://xmlns.com/foaf/0.1/
> name> "Sir Alfred Hitchcock"@en .
> <http://dbpedia.org/resource/Alfred_Hitchcock> <http://xmlns.com/foaf/0.1/
> name> "(KBE)"@en .
>
> where "(KBE" as a name looks odd to me.
>
> or
> <http://dbpedia.org/resource/Agatha_Christie> <http://xmlns.com/foaf/0.1/
> name> "Dame Agatha Christie"@en .
> <http://dbpedia.org/resource/Agatha_Christie> <http://xmlns.com/foaf/0.1/
> name> "(Lady Mallowan)"@en .
> <http://dbpedia.org/resource/Agatha_Christie> <http://xmlns.com/foaf/0.1/
> name> "(DBE)"@en .
>
> where both the name in parentheses and the "(DBE)" looks odd to me.
>
> Other literals contain several entries or additional descriptive text, e.g.
> <http://dbpedia.org/resource/Alp_Arslan> <http://xmlns.com/foaf/0.1/name>
> "Laqab: Diya ad-Din (shortly), Adud ad-Dawlah"@en .
> <http://dbpedia.org/resource/Alp_Arslan> <http://xmlns.com/foaf/0.1/name>
> "Kunya: Abu Shuja"@en .
> <http://dbpedia.org/resource/Alp_Arslan> <http://xmlns.com/foaf/0.1/name>
> "Given name: Muhammad"@en .
> <http://dbpedia.org/resource/Alp_Arslan> <http://xmlns.com/foaf/0.1/name>
> "Turkic nickname: Alp Arslan"@en .
> <http://dbpedia.org/resource/Alp_Arslan> <http://xmlns.com/foaf/0.1/name>
> "Nasab: Alp ArslanibnChaghri-Beg ibnMikailibnSeljuqibnDuqaq"@en .
>
> Sometimes the language id for the literal appears to be incorrect, e.g.
> <http://dbpedia.org/resource/Athens> <http://xmlns.com/foaf/0.1/name>
> "Athens"@en .
> <http://dbpedia.org/resource/Athens> <http://xmlns.com/foaf/0.1/name>
> "Αθήνα"@en .
>
>
> In the German language file, some entries are still worse, e.g.:
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "Akeleien"@de .
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "Hahnenfußgewächse"@de .
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "Hahnenfußartige"@de .
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "Eudikotyledonen"@de .
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "Gattung"@de .
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "Tribus"@de .
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "Unterfamilie"@de .
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "Familie"@de .
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "Ordnung"@de .
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "ohne"@de .
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "nein"@de .
> <http://de.dbpedia.org/resource/Akeleien> <http://xmlns.com/foaf/0.1/name>
> "L."@de .
>
> Clearly, only the first of these triples is correct.
>
> The template for this entry is not filled incorrectly or otherwise broken
> though:
> {{Taxobox
> | Taxon_Name = Akeleien
> | Taxon_WissName = Aquilegia
> | Taxon_Rang = Gattung
> | Taxon_Autor = [[Carl von Linné|L.]]
> | Taxon2_LinkName = nein
> | Taxon2_WissName = Isopyreae
> | Taxon2_Rang = Tribus
> | Taxon3_WissName = Isopyroideae
> | Taxon3_Rang = Unterfamilie
> | Taxon4_Name = Hahnenfußgewächse
> | Taxon4_WissName = Ranunculaceae
> | Taxon4_Rang = Familie
> | Taxon5_Name = Hahnenfußartige
> | Taxon5_WissName = Ranunculales
> | Taxon5_Rang = Ordnung
> | Taxon6_Name = Eudikotyledonen
> | Taxon6_Rang = ohne
> | Bild = Aquilegia ottonis amaliae2UME.jpg
> | Bildbeschreibung = [[Balkanische Akelei]] (''[[Aquilegia ottonis]]''
> subsp. ''amaliae'')
> }}
>
> I do not know too much about how the triples get extracted from the
> original Wikimedia text or templates, but
> it seems that the extraction is too lenient instead of too strict:
> personally I would rather have no entries
> for http://de.dbpedia.org/resource/Akeleien and foaf:name at all than all
> those wrong ones.
>
> Although the semantics of foaf:name are pretty loose, I do not think that
> for organisms, all the higher level taxon names
> should be seen as the name of that organism, clearly "Eukaryote" is not a
> name of humans even though humans
> belong to this group.
> That the names of the taxon group is also included (e.g. the value for
> Taxon4_Rang) appears to be a bug.
>
> These are all examples only, but there are many more entries where the
> same errors appear in a systematic way.
>
> Is there any way to accomplish an extraction of the triples with more
> importance of precision than recall?
>
>
> ------------------------------------------------------------
> ------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, SlashDot.org! http://sdm.link/slashdot
> *_______________________________________________*
> DBpedia-discussion mailing list
> DBpedia-discussion@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>
>
> ------------------------------------------------------------
> ------------------
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, SlashDot.org! http://sdm.link/slashdot
> _______________________________________________
> DBpedia-discussion mailing list
> DBpedia-discussion@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
DBpedia-discussion mailing list
DBpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion