Freebase has done a pretty good job of recognizing Wikipedia articles
that are about people.

It's pretty easy to write a MQL query that gets the Wikipedia article
names for all of the topics of type Person in Freebase.

When we did this we got over 550K Wikipedia articles.  There are still
errors of course, but these are slowly being corrected.  For example,
the Wikipedia article 'List_of_assassinated_people_from_Turkey' [1] is
still classified as a Person topic in Freebase [2].

I saw a number of other false positives like this which were lists of
people, e.g. List_of_Lieutenant_Governors_of_British_Columbia but many
of these are easy to filter out after the fact -- like articles with
names like 'List_of_*'.

We used freebase-python [2] to run the query and process the results.

We also extracted lists of organizations and places.  Getting Freebase
topics that are Organizations was a bit more difficult because there
is no single type that matches what we wanted, so we had to identify
the relevant subtypes Freebase uses.


[1] http://en.wikipedia.org/wiki/List_of_assassinated_people_from_Turkey
[2] http://www.freebase.com/view/en/laicist_turks_who_are_murdered
[3] http://code.google.com/p/freebase-python/

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to