Thanks Rupert for the update. Meanwhile I am looking at generating custom vocab index page http://incubator.apache.org/stanbol/docs/trunk/customvocabulary.html and trying to know which files I have to use under dbpedia chinese download available at http://downloads.dbpedia.org/3.8/zh/
The dbpedia download for chinese has article categories, lables, short/long abstracts, inter language links. Donot know which ones to use for the stanbol entityhub custom vocabulary index tool. -harish On Thu, Aug 9, 2012 at 11:08 AM, Rupert Westenthaler < [email protected]> wrote: > Hi > > the dbpedia 3.7 index was build by ogrisel so I do not know the details. > > I think Chinese (zh) labels are included, but the index only contains > Entities for Wikipedia pages with 5 or more incoming links. > > In addition while the English DBpedia contains zh labels it will not > contain Entities that do not have a counterpart in the English > Wikipedia. > > best > Rupert > > On Thu, Aug 9, 2012 at 1:00 AM, harish suvarna <[email protected]> wrote: > > I received a USB in IKS conf which contained the 1.19GB of dbpedia full > > solr index. Does it contain the data from the chinese dump (available in > > the dbpedia.org download server under zh folder)? > > > > I do get some dbpedia entries for chinese text in stanbol enhancements. I > > am using the 1.19GB dump. I am expecting some more enhancements which are > > present in wikipedia chinese. Just wondering if chinese dump is not > > utilized. > > > > -harish > > > > -- > | Rupert Westenthaler [email protected] > | Bodenlehenstraße 11 ++43-699-11108907 > | A-5500 Bischofshofen >
