Hi, Some have asked for it privately so here is the subset of DBpedia skos categories that I use to build the topic classification [1] index I am working on integrating in Stanbol right now:
http://dl.dropbox.com/u/5743203/data/topics_abstracts.tsv.gz (> 500MB I think) It's a big tab separated value file, the identifier for the category should be in the first column, you also have the materialized paths to ancestors and and aggregate text from a random selection of articles categorized using those topics. [1] https://issues.apache.org/jira/browse/STANBOL-197 -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel
