Hi, On Sun, Jun 26, 2011 at 17:31, Alexander Sidorov <[email protected]> wrote: > By which principle are languages chosen for Downloads page at dbpedia.org? I > was surprised not to find russian datasets there. It is not very important > as datasets for all languages are available at downloads.dbpedia.org... so > I'm just curious.
To list exactly twelve languages under Other Datasets is an arbitrary convention that keeps the table readable. The table lists all languages for which infobox mappings exist (en, de, hu, sl, hr, el). The remaining spots are filled by the languages whose Wikipedias have the most articles [1]. By the time of the extraction, the Dutch Wikipedia was larger than the Russian one. Russian was in fact the first language to drop out of the table. Sorry ;) Cheers, Max [1] http://s23.org/wikistats/wikipedias_html.php ------------------------------------------------------------------------------ All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 _______________________________________________ Dbpedia-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
