sometimes it helps to look at the state of the articles at the time of
extraction
http://en.wikipedia.org/w/index.php?title=Elvis_Presley&action=edit&oldid=606258011
DBpedia assigns a single type for each resource and creates separate ones
for subsequent mapped templates if they are not direct
subclasses/superclasses of the first mapped template
in this case we have an infobox Person followed by a infobox military person
One problem is that we do not process embedded templates (Infobox musical
artist)which is mainly a design issue. I am not aware who made it in the
past, it is quite easy to change it but not sure of the implications of
such a change
A view of the current extraction can be better seen in
http://mappings.dbpedia.org/server/extraction/en/extract?title=Elvis_Presley&revid=&format=turtle-triples
On Wed, Feb 18, 2015 at 3:24 PM, Vladimir Alexiev <
[email protected]> wrote:
> > For Elvis_Presley, the DBpedia types are just
> > http://dbpedia.org/ontology/Agent
> > http://dbpedia.org/ontology/MilitaryPerson
> > http://dbpedia.org/ontology/Person
>
> Wikipedia has this:
> https://en.wikipedia.org/w/index.php?title=Elvis_Presley&action=edit
>
> {{Infobox person
> | occupation = Singer, actor
> | module = {{Infobox military person
> | module2 = {{Infobox musical artist
> | instrument = Vocals, guitar, piano
> | background = solo_singer
> | genre = {{flat list|
> *[[Rock and roll]]
> *[[Pop music|pop]]
> ...
>
> You can find the newest extraction here:
>
> http://mappings.dbpedia.org/server/extraction/en/extract?title=Elvis_Presley&revid=&format=turtle-triples&extractors=custom
>
> Unfortunately DBpedia processes only the first two infboxes (Person and
> Military person) but not Musical artist.
> It even skips the instrument, background and genre fields from the third
> infobox (Musical artist).
> Gerard Kuys has remarked that DBpedia picks only one leaf class "to avoid
> contradictions".
> I can understand that various infoboxes scattered throughout the article
> could contribute non-sensical classes,
> especially if they have non-sense mappings as described here:
> http://vladimiralexiev.github.io/pres/20150209-dbpedia/dbpedia-problems-long.html#sec-1-3
>
> However:
> - The above two "modules" are not randomly scattered, they are embedded in
> the main infobox template
> - How is "contradiction" defined"? Definitely the subclasses of Person are
> *not* disjoint, there are numerous examples.
>
> I posted issue https://github.com/dbpedia/extraction-framework/issues/341
>
> Also: it is not currently possible to examine one field (like "occupation"
> above) and emit two classes: see here
>
> http://vladimiralexiev.github.io/pres/20150209-dbpedia/dbpedia-problems-long.html#sec-3-1
>
>
>
>
> ------------------------------------------------------------------------------
> Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
> from Actuate! Instantly Supercharge Your Business Reports and Dashboards
> with Interactivity, Sharing, Native Excel Exports, App Integration & more
> Get technology previously reserved for billion-dollar corporations, FREE
>
> http://pubads.g.doubleclick.net/gampad/clk?id=190641631&iu=/4140/ostg.clktrk
> _______________________________________________
> Dbpedia-discussion mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
--
Kontokostas Dimitris
------------------------------------------------------------------------------
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration & more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=190641631&iu=/4140/ostg.clktrk
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion