[Wiki-research-l] Constructing sensible baselines for Wikipedia language development analytics

2014-07-08 Thread Han-Teng Liao (OII)
Dear all, Your suggestions are needed on the ways in which one can construct some sensible baselines, most likely based on data sets *external* to Wikipedia projects, of *expected* Wikipedia language versions development. Such baselines should ideally indicate, given the availability

Re: [Wiki-research-l] Constructing sensible baselines for Wikipedia language development analytics

2014-07-08 Thread Laura Hale
I more or less tried to have a go at this on http://wikinewsreporter.wordpress.com/2014/06/30/determining-the-relative-quality-of-one-wikipedia-project-to-another-one-approach-with-english-spanish-catalan-galician-argonese-and-euskera-wikipedias/ using both internal and external criteria for

Re: [Wiki-research-l] Constructing sensible baselines for Wikipedia language development analytics

2014-07-08 Thread Stuart A. Yeates
Web browser language settings are an obvious place to start this. This will give you an approximation of user's preferred language (more likely the preferred language of those who configured their software). See http://www.w3.org/International/questions/qa-lang-priorities.en.php for the gory

Re: [Wiki-research-l] Constructing sensible baselines for Wikipedia language development analytics

2014-07-08 Thread Gerard Meijssen
Hoi, At the WMF language committee, the question if a language is viable for a Wikimedia project is a practical one. It is also very much a political one. One vitally important difference with your approach is that the distinction is between a first project and a subsequent project. In the latest

Re: [Wiki-research-l] Constructing sensible baselines for Wikipedia language development analytics

2014-07-08 Thread h
(user language log: e.g. Accept-Language parameter) Yes Stuart, locale data could be a nice source to look at, including the HTTP headers of the Accept-Language to find locale such as zh-TW,zh;q=0.8,en;q=0.6 Do you or anyone have suggestions on the external or global datasets that can be used

Re: [Wiki-research-l] Constructing sensible baselines for Wikipedia language development analytics

2014-07-08 Thread h
Indeed, GerardM, I agree with you that a few good women or men with passions can kick start some Wikimedia projects, and different Wikimedia projects have different barriers or paths of development. I also agree with you that the direction that I am pursuing may not be helpful to those languages

Re: [Wiki-research-l] Constructing sensible baselines for Wikipedia language development analytics

2014-07-08 Thread Jane Darnell
han-teng liao, Sorry but I had to read your answer a couple times before I understood what you were getting at. I missed the previous conversation also. For information about the 10,000 things, I would just go to GerardM because he knows all about that stuff. As far as page stats on all the