Dear all,
Your suggestions are needed on the ways in which one can construct
some sensible baselines, most likely based on data sets *external* to
Wikipedia projects, of *expected* Wikipedia language versions development.
Such baselines should ideally indicate, given the availability
I more or less tried to have a go at this on
http://wikinewsreporter.wordpress.com/2014/06/30/determining-the-relative-quality-of-one-wikipedia-project-to-another-one-approach-with-english-spanish-catalan-galician-argonese-and-euskera-wikipedias/
using both internal and external criteria for
Web browser language settings are an obvious place to start this. This
will give you an approximation of user's preferred language (more
likely the preferred language of those who configured their software).
See http://www.w3.org/International/questions/qa-lang-priorities.en.php
for the gory
Hoi,
At the WMF language committee, the question if a language is viable for a
Wikimedia project is a practical one. It is also very much a political one.
One vitally important difference with your approach is that the distinction
is between a first project and a subsequent project. In the latest
(user language log: e.g. Accept-Language parameter)
Yes Stuart, locale data could be a nice source to look at, including the
HTTP headers of the Accept-Language to find locale such as
zh-TW,zh;q=0.8,en;q=0.6
Do you or anyone have suggestions on the external or global datasets that
can be used
Indeed, GerardM, I agree with you that a few good women or men with
passions can kick start some Wikimedia projects, and different Wikimedia
projects have different barriers or paths of development.
I also agree with you that the direction that I am pursuing may not be
helpful to those languages
han-teng liao,
Sorry but I had to read your answer a couple times before I understood what
you were getting at. I missed the previous conversation also. For
information about the 10,000 things, I would just go to GerardM because he
knows all about that stuff. As far as page stats on all the