Freebase has done a pretty good job of recognizing Wikipedia articles that are about people.
It's pretty easy to write a MQL query that gets the Wikipedia article names for all of the topics of type Person in Freebase. When we did this we got over 550K Wikipedia articles. There are still errors of course, but these are slowly being corrected. For example, the Wikipedia article 'List_of_assassinated_people_from_Turkey' [1] is still classified as a Person topic in Freebase [2]. I saw a number of other false positives like this which were lists of people, e.g. List_of_Lieutenant_Governors_of_British_Columbia but many of these are easy to filter out after the fact -- like articles with names like 'List_of_*'. We used freebase-python [2] to run the query and process the results. We also extracted lists of organizations and places. Getting Freebase topics that are Organizations was a bit more difficult because there is no single type that matches what we wanted, so we had to identify the relevant subtypes Freebase uses. [1] http://en.wikipedia.org/wiki/List_of_assassinated_people_from_Turkey [2] http://www.freebase.com/view/en/laicist_turks_who_are_murdered [3] http://code.google.com/p/freebase-python/ ------------------------------------------------------------------------------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ Dbpedia-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
