Hello everyone,

Are there any tools or at least guidelines which can be useful to get some
descriptional statistics for particular Wikipedia dump with regard to
categories size and infoboxes & their properties usage?
I mean if I am going to add more mappings for Russian DBPedia it would be
worth to know the following information to make mappings which will yield
more data:
1) what are the biggest categories with respect to the articles number
(including subcategories),
2) which infobox templates are used more often inside particular category,
3) which infobox properties are usually filled,
4) and so on.

I guess that this information can be derived by quering against following
DBPedia datasets:
Raw Infobox Properties,
Articles Categories,Categories (Labels),Categories (Skos).

But is there some better (or simpler) way do that?

---
Rinat Gareev
------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_sfd2d_oct
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to