The one major caveat, I think, is that the danger of proportionate data is that it makes small projects very vulnerable to artificial traffic spikes. I'd go out on a limb and say that some of the massive bumps in popularity we see in particular combinations are likely due to either undetected automata or simply a project having so little traffic that a small number of people can sway the results outlandishly.
On 25 February 2015 at 16:32, Andrew Lih <andrew....@gmail.com> wrote: > Great job. > > Who knew Esperanto was big in Japan and China at #2 and #3? > > > > On Wed, Feb 25, 2015 at 4:06 PM, Oliver Keyes <oke...@wikimedia.org> wrote: >> >> Hey all! >> >> We've released a highly-aggregated dataset of readership data - >> specifically, data about where, geographically, traffic to each of our >> projects (and all of our projects) comes from. The data can be found >> at http://dx.doi.org/10.6084/m9.figshare.1317408 - additionally, I've >> put together an exploration tool for it at >> https://ironholds.shinyapps.io/WhereInTheWorldIsWikipedia/ >> >> Hope it's useful to people! >> >> -- >> Oliver Keyes >> Research Analyst >> Wikimedia Foundation >> >> _______________________________________________ >> Wiki-research-l mailing list >> Wiki-research-l@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > > > > _______________________________________________ > Wiki-research-l mailing list > Wiki-research-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > -- Oliver Keyes Research Analyst Wikimedia Foundation _______________________________________________ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l