Hi, 2011/1/14 Roberto García <[email protected]>: > Dear all, > > I'm trying to derive a tree of categories from DBPedia but as I get so > many top categories I have taken a deeper look into the data and > compared it with the Wikipedia counterpart. > > It seems that there are many missing broader links among categories, > at least from what there is in Wikipedia. > > For instance, compare: > http://dbpedia.org/resource/Category:1910s_births and > http://en.wikipedia.org/wiki/Category:1910s_births > > In DBPedia, there is just a broader link from > http://dbpedia.org/resource/Category:Births_of_the_last_123_years and > no outgoing broader or narrower link. > In Wikipedia, there are 10 subcategories (from 1910 births to 1919 > births) and two supercategories (1910s and 20th-century births).
The DBpedia extraction framework operates on the wiki source code of Wikipedia. For category pages, it extracts skos:broader relations using links to other category pages. This works well for example in this case: http://en.wikipedia.org/wiki/Category:Futurama Unfortunately, it is difficult for the extraction framework to deal with all the different templates that Wikipedia offers its contributers. Looking at the wiki source code of http://en.wikipedia.org/wiki/Category:1910s_births, you can see that almost the complete page is created by two templates ({{birthdecade|...}} and {{Commons cat|...}}). Therefore, the extraction does not find many links to other categories and cannot produce the desired data. I know this does not provide a solution to your problem, but perhaps a better understanding. Cheers, Max ------------------------------------------------------------------------------ Protect Your Site and Customers from Malware Attacks Learn about various malware tactics and how to avoid them. Understand malware threats, the impact they can have on your business, and how you can protect your company and customers by using code signing. http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Dbpedia-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
