The one major caveat, I think, is that the danger of proportionate data is that it makes small projects very vulnerable to artificial traffic spikes. I'd go out on a limb and say that some of the massive bumps in popularity we see in particular combinations are likely due to either undetected automata or simply a project having so little traffic that a small number of people can sway the results outlandishly.
On 25 February 2015 at 16:32, Andrew Lih <[email protected]> wrote: > Great job. > > Who knew Esperanto was big in Japan and China at #2 and #3? > > > > On Wed, Feb 25, 2015 at 4:06 PM, Oliver Keyes <[email protected]> wrote: >> >> Hey all! >> >> We've released a highly-aggregated dataset of readership data - >> specifically, data about where, geographically, traffic to each of our >> projects (and all of our projects) comes from. The data can be found >> at http://dx.doi.org/10.6084/m9.figshare.1317408 - additionally, I've >> put together an exploration tool for it at >> https://ironholds.shinyapps.io/WhereInTheWorldIsWikipedia/ >> >> Hope it's useful to people! >> >> -- >> Oliver Keyes >> Research Analyst >> Wikimedia Foundation >> >> _______________________________________________ >> Wiki-research-l mailing list >> [email protected] >> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > > > > _______________________________________________ > Wiki-research-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > -- Oliver Keyes Research Analyst Wikimedia Foundation _______________________________________________ Analytics mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/analytics
