Re: [Wiki-research-l] [Release]

2015-03-04 Thread Cristian Consonni
2015-03-04 8:44 GMT+01:00 Dario Taraborelli dtarabore...@wikimedia.org: yay, shiny! The map is a pretty compelling way to show how dominant traffic from the US is, even for very minor languages (say bi.wikipedia.org), I wonder how many requests from US-based bots/automata we’re still failing

Re: [Wiki-research-l] [Release]

2015-03-04 Thread Oliver Keyes
That is the question, and I agree with your conclusion. I'm hoping to do more research into this; getting buyin internally has been tough, but I'm confident of making progress on that front over the next few weeks and months. On 4 March 2015 at 04:13, Cristian Consonni kikkocrist...@gmail.com

Re: [Wiki-research-l] [Release]

2015-03-04 Thread Oliver Keyes
On 4 March 2015 at 04:28, Pine W wiki.p...@gmail.com wrote: I'm not sure how much influence I have, but I would be happy to make whispers in appropriate places to try to get more support, if that's helpful. I think I'm probably good, but thank you. Perhaps you could show your work at the

Re: [Wiki-research-l] [Release]

2015-03-04 Thread Oliver Keyes
'Lots, but that's not currently anyone's job' On Wednesday, 4 March 2015, Dario Taraborelli dtarabore...@wikimedia.org wrote: yay, shiny! The map is a pretty compelling way to show how dominant traffic from the US is, even for very minor languages (say bi.wikipedia.org), I wonder how many

Re: [Wiki-research-l] [Release]

2015-03-04 Thread Pine W
I'm not sure how much influence I have, but I would be happy to make whispers in appropriate places to try to get more support, if that's helpful. Perhaps you could show your work at the next Research and Data showcase? I for one would be interested in seeing a presentation. Pine *This is an

Re: [Wiki-research-l] [Release]

2015-03-04 Thread Scott Hale
Oliver: Scott Hale and I have been working on a paper looking at global reach and how it tracks with internet access growth, in the context of editing, particularly looking at the mobile web. That, we should be done with by then; presenting it could be highly useful (Scott? ;p) I see what

Re: [Wiki-research-l] [Release]

2015-03-03 Thread Dario Taraborelli
yay, shiny! The map is a pretty compelling way to show how dominant traffic from the US is, even for very minor languages (say bi.wikipedia.org), I wonder how many requests from US-based bots/automata we’re still failing to detect. On Mar 3, 2015, at 9:29 PM, Oliver Keyes oke...@wikimedia.org

Re: [Wiki-research-l] [Release]

2015-03-03 Thread Oliver Keyes
Update: the original Shiny instance went down due to server load soon after release. It's now up again at http://datavis.wmflabs.org/where/ on a dedicated Labs machine, where we hope to put...many more visualisations. It also now has mapping, largely thanks to Sarah Laplante

Re: [Wiki-research-l] [Release]

2015-03-02 Thread Oliver Keyes
Indeed! Orienting it that way (pivoting on language rather than project) is something several people have asked for; I plan to spend a chunk of my spare time (that is, recreational time) trying to make it work. Should be fairly trivial. On 2 March 2015 at 09:55, h hant...@gmail.com wrote: Hello

Re: [Wiki-research-l] [Release]

2015-03-02 Thread h
Hello Finn, I do not have a specific answer to your question. However, it might be worthwhile to add Finnish in to the comparison as according to the CLDR 26 T-L information http://www.unicode.org/cldr/charts/26/supplemental/territory_language_information.html You have some sizable Finnish

Re: [Wiki-research-l] [Release]

2015-03-02 Thread Finn Årup Nielsen
Hi Oliver, Interesting dataset! I am curious about why the Danish Wikipedia is so highly acccessed from Sweden. Could it be an error, e.g., with Telia IP-numbers? In Python: import pandas as pd df = pd.read_csv('http://files.figshare.com/1923822/language_pageviews_per_country.tsv',

[Wiki-research-l] [Release]

2015-02-25 Thread Oliver Keyes
Hey all! We've released a highly-aggregated dataset of readership data - specifically, data about where, geographically, traffic to each of our projects (and all of our projects) comes from. The data can be found at http://dx.doi.org/10.6084/m9.figshare.1317408 - additionally, I've put together

Re: [Wiki-research-l] [Release]

2015-02-25 Thread Andrew Lih
Great job. Who knew Esperanto was big in Japan and China at #2 and #3? On Wed, Feb 25, 2015 at 4:06 PM, Oliver Keyes oke...@wikimedia.org wrote: Hey all! We've released a highly-aggregated dataset of readership data - specifically, data about where, geographically, traffic to each of our

Re: [Wiki-research-l] [Release]

2015-02-25 Thread Oliver Keyes
The one major caveat, I think, is that the danger of proportionate data is that it makes small projects very vulnerable to artificial traffic spikes. I'd go out on a limb and say that some of the massive bumps in popularity we see in particular combinations are likely due to either undetected

Re: [Wiki-research-l] [Release]

2015-02-25 Thread Giovanni Luca Ciampaglia
This is really, really cool, great job guys! G Giovanni Luca Ciampaglia ✎ 919 E 10th ∙ Bloomington 47408 IN ∙ USA ☞ http://www.glciampaglia.com/ ✆ +1 812 855-7261 ✉ gciam...@indiana.edu 2015-02-25 16:06 GMT-05:00 Oliver Keyes oke...@wikimedia.org: Hey all! We've released a