Any reason why France tops the table? With Warm Regards
V.Kadal Amutham 919444360480 914422396480 On 25 January 2013 19:48, Rob Weir <[email protected]> wrote: > On Thu, Jan 24, 2013 at 10:05 PM, Samer Mansour <[email protected]> > wrote: > > I'm a little stumped how to visualize this in a meaningful way. > > > > Here it is again with the latest data. This is the best visual I think > out > > of the few I tried: > > > > > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/aoo-downloads-download-density > > > > > This is nice, thanks. The bubbles are easier to understand than color > shading, I think. > > Since we recently had all the security stories about Java in the > browser, I think for the blog post on this I'll use some static screen > shots, a global one as well as close us for some of the denser areas, > like Central America and Europe. Then we can link to Manyeyes for > anyone who wants to use this interactively. > > Is the data series published publicly as well, on ManyEyes, so others > can make their own visualizations? > > -Rob > > > > > On Thu, Jan 24, 2013 at 10:28 AM, Rob Weir <[email protected]> wrote: > >> > >> On Wed, Jan 23, 2013 at 7:16 PM, Kadal Amutham <[email protected]> > wrote: > >> > Dear Rob, > >> > > >> > For the internet population I used the data from the wiki since the > link > >> > was already in the google spreadsheet. For the population details, I > >> > typed > >> > population followed by the country name in the address bar of google > and > >> > in > >> > most of the searches the result was from www.google.co.in/publicdata. > I > >> > do > >> > not expect much difference between google data and wiki. You can > verify > >> > few > >> > data and the difference may be in the last 3 digits. > >> > > >> > >> I reviewed the data. Most of it looked fine. As you say, maybe in > >> the last few digits. I only found a few things to fix. For example > >> (and I didn't know this before) there are two Congo's now: Democratic > >> Republic of the Congo and the Republic of the Congo. It looks like the > >> data there was reversed. Little things like that. > >> > >> The top 30 countries, for downloading AOO, normalized by number of > >> internet users in the country, are: > >> > >> 1. Monaco > >> 2. Italy > >> 3. France > >> 4. San Marino > >> 5. Luxembourg > >> 6. Belgium > >> 7. Estonia > >> 8. Germany > >> 9. Switzerland > >> 10. Finland > >> 11. Spain > >> 12. Hong Kong > >> 13. Austria > >> 14. Netherlands > >> 15. Canada > >> 16. Belize > >> 17. Czech Republic > >> 18. Falkland Islands (Malvinas) > >> 19. Faroe Islands > >> 20. Malta > >> 21. Singapore > >> 22. Iceland > >> 23. Taiwan > >> 24. United Kingdom > >> 25. United States > >> 26. Uruguay > >> 27. Hungary > >> 28. New Zealand > >> 29. Ireland > >> 30. Liechtenstein > >> > >> I'm attaching the full spreadsheet. Hopefully Samer can put the > >> updated figures on the ManEyes website. > >> > >> The countries at the bottom of the list, are some where we have > >> opportunities for growth. For example, South Korea should improve > >> once the Korean translation is released next week. > >> > >> -Rob > >> > >> > >> > With Warm Regards > >> > > >> > V.Kadal Amutham > >> > 919444360480 > >> > 914422396480 > >> > > >> > > >> > On 23 January 2013 23:58, Samer Mansour <[email protected]> wrote: > >> > > >> >> Some countries didn't get counted (not matched in the tool/didn't > >> >> exist). > >> >> Some cities got their own row, but the Map doesn't have a spot for > >> >> them. ie > >> >> Hong Kong and Singapore: > >> >> Also I think some rows are gone, like "Europe (not specified)", it > was > >> >> there yesterday. > >> >> > >> >> Here are two more visualizations: > >> >> > >> >> > >> >> > >> >> > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/apache-openoffice-downloads-vs-pop > >> >> > >> >> > >> >> > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/apache-openoffice-downloads-vs-pop-2 > >> >> > >> >> And the original: > >> >> > >> >> > >> >> > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/apache-openoffice-downloads-by-pop > >> >> > >> >> I didn't have a design in mind in how we want to show this so I just > >> >> plotted and mashed around with it like playdough. > >> >> > >> >> On Wed, Jan 23, 2013 at 12:37 PM, Rob Weir <[email protected]> > wrote: > >> >> > >> >> > On Wed, Jan 23, 2013 at 10:59 AM, Samer Mansour < > [email protected]> > >> >> > wrote: > >> >> > > Opps here it is: > >> >> > > > >> >> > > > >> >> > > >> >> > >> >> > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/apache-openoffice-downloads-by-pop > >> >> > > > >> >> > > >> >> > This is very nice. We might want to take out the "population" and > >> >> > "internet users" and the two rank columns and just use the > "downloads > >> >> > per 1k population" and "downloads per 1k internet users" columns. > >> >> > > >> >> > I found a few cut & paste errors in the spreadsheet, so it would be > >> >> > good if we could have some reviewers double check the numbers. > >> >> > > >> >> > Compare columns C and E of this spreadsheet: > >> >> > > >> >> > > >> >> > > >> >> > >> >> > https://docs.google.com/spreadsheet/ccc?key=0Av4Lhq3W5zKodGZtNU1oRGFjWi1kYXkzVEtjOWY1ZlE#gid=0 > >> >> > > >> >> > With the data in these two Wikipedia articles. > >> >> > > >> >> > > >> >> > >> >> > http://en.wikipedia.org/wiki/List_of_countries_by_number_of_Internet_users > >> >> > > >> >> > http://en.wikipedia.org/wiki/List_of_countries_by_population > >> >> > > >> >> > Once reviewed, and we have the updated maps, then I have an idea. > We > >> >> > could do a blog post on these numbers, but frame it as a general > >> >> > story > >> >> > about end-user use of desktop open source software, etc. If we > make > >> >> > the story more general interest we'll get broader circulation and > >> >> > uptake in the press. Of course, it supports a positive story of > our > >> >> > project as well. But by telling the broader story, and using > >> >> > OpenOffice as an example, we'll go further. > >> >> > > >> >> > Does this make sense? > >> >> > > >> >> > -Rob > >> >> > > >> >> > > On Wed, Jan 23, 2013 at 10:59 AM, Samer Mansour > >> >> > > <[email protected]> > >> >> > wrote: > >> >> > > > >> >> > >> Here is an updated map. I moved to bubble chart because > coloring > >> >> > >> the > >> >> > >> country made places like Russia and Canada look odd. > >> >> > >> I will see if I can make other visualizations as well, maybe Map > >> >> > >> is > >> >> not > >> >> > >> the best visualization. > >> >> > >> > >> >> > >> > >> >> > >> On Wed, Jan 23, 2013 at 3:10 AM, Kadal Amutham < > [email protected]> > >> >> > wrote: > >> >> > >> > >> >> > >>> Dear Rob Weir, > >> >> > >>> > >> >> > >>> The internet user data has been filled > >> >> > >>> > >> >> > >>> With Warm Regards > >> >> > >>> > >> >> > >>> V.Kadal Amutham > >> >> > >>> 919444360480 > >> >> > >>> 914422396480 > >> >> > >>> > >> >> > >>> > >> >> > >>> On 23 January 2013 07:23, Kadal Amutham <[email protected]> > wrote: > >> >> > >>> > >> >> > >>> > I will start filling the internet user data > >> >> > >>> > > >> >> > >>> > With Warm Regards > >> >> > >>> > > >> >> > >>> > V.Kadal Amutham > >> >> > >>> > 919444360480 > >> >> > >>> > 914422396480 > >> >> > >>> > > >> >> > >>> > > >> >> > >>> > On 23 January 2013 01:38, Rob Weir <[email protected]> > wrote: > >> >> > >>> > > >> >> > >>> >> On Tue, Jan 22, 2013 at 12:11 PM, Kadal Amutham > >> >> > >>> >> <[email protected] > >> >> > > >> >> > >>> wrote: > >> >> > >>> >> > Check the google document at > >> >> > >>> >> > > >> >> > >>> >> > >> >> > >>> > >> >> > > >> >> > >> >> > https://docs.google.com/spreadsheet/ccc?key=0Av4Lhq3W5zKodGZtNU1oRGFjWi1kYXkzVEtjOWY1ZlE#gid=0 > >> >> > >>> >> > > >> >> > >>> >> > I have entered the population data alone > >> >> > >>> >> > > >> >> > >>> >> > >> >> > >>> >> Very good. I made some edits and also received a spreadsheet > >> >> > >>> >> with > >> >> > some > >> >> > >>> >> more data from Gianvittorio. > >> >> > >>> >> > >> >> > >>> >> The results are interesting. For example, what countries > have > >> >> > >>> >> the > >> >> > >>> >> highest percentage of downloads by population? The top 5 > are: > >> >> > >>> >> > >> >> > >>> >> 1) Gambia > >> >> > >>> >> 2) Trinidad and Tobago > >> >> > >>> >> 3) Zimbabwe > >> >> > >>> >> 4) Vatican City > >> >> > >>> >> 5) Saint Pierre and Miquelon > >> >> > >>> >> > >> >> > >>> >> It will be interesting to see how the data is when we look > at > >> >> usage > >> >> > >>> >> per internet users in a country. > >> >> > >>> >> > >> >> > >>> >> But however we slice the data we'll have this problem: For > >> >> > >>> >> very > >> >> > small > >> >> > >>> >> countries, just a few downloads can shift the ratio by a > large > >> >> > amount. > >> >> > >>> >> For example, the Vatican City had around 100 downloads. > So a > >> >> > >>> >> difference of 10 downloads is 10%. France had 4.5 million > >> >> > downloads. > >> >> > >>> >> A difference of 10 downloads is nothing. > >> >> > >>> >> > >> >> > >>> >> I've seen this handled in other contexts by applying > >> >> > >>> >> statistical > >> >> > >>> >> techniques to estimate error bounds on the ratio, and then > >> >> > >>> >> rank by > >> >> > the > >> >> > >>> >> lower confidence limit. You can see the technique described > >> >> > >>> >> (and > >> >> > >>> >> formulas given) here: > >> >> > >>> >> > http://evanmiller.org/how-not-to-sort-by-average-rating.html > >> >> > >>> >> > >> >> > >>> >> -Rob > >> >> > >>> >> > >> >> > >>> >> > With Warm Regards > >> >> > >>> >> > > >> >> > >>> >> > V.Kadal Amutham > >> >> > >>> >> > 919444360480 > >> >> > >>> >> > 914422396480 > >> >> > >>> >> > > >> >> > >>> >> > > >> >> > >>> >> > On 22 January 2013 22:26, Kadal Amutham <[email protected] > > > >> >> wrote: > >> >> > >>> >> > > >> >> > >>> >> >> Good show. I have almost entered the population data into > >> >> > >>> >> >> the > >> >> > spread > >> >> > >>> >> >> sheet. Then I have to populate the internet user data. > Once > >> >> > >>> >> >> it > >> >> is > >> >> > >>> >> over, I > >> >> > >>> >> >> will let you know. > >> >> > >>> >> >> > >> >> > >>> >> >> With Warm Regards > >> >> > >>> >> >> > >> >> > >>> >> >> V.Kadal Amutham > >> >> > >>> >> >> 919444360480 > >> >> > >>> >> >> 914422396480 > >> >> > >>> >> >> > >> >> > >>> >> >> > >> >> > >>> >> >> On 22 January 2013 21:37, Samer Mansour > >> >> > >>> >> >> <[email protected]> > >> >> > wrote: > >> >> > >>> >> >> > >> >> > >>> >> >>> We don't have internet users completely populated so > from > >> >> > >>> >> >>> the > >> >> > data > >> >> > >>> we > >> >> > >>> >> >>> currently have I whipped something up in 5 mins at work. > >> >> > >>> >> >>> This is just the top 40 Countries. Its manual data > entry, > >> >> once > >> >> > we > >> >> > >>> >> have > >> >> > >>> >> >>> internet users populated and I will resolve things like > >> >> > >>> >> >>> "Hong > >> >> > >>> Kong" -> > >> >> > >>> >> >>> "China" and Singapore -> South Korea. > >> >> > >>> >> >>> > >> >> > >>> >> >>> I can maybe finish some data entry tonight. > >> >> > >>> >> >>> > >> >> > >>> >> >>> > >> >> > >>> >> >>> > >> >> > >>> >> > >> >> > >>> > >> >> > > >> >> > >> >> > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/aoo-test-visualization > >> >> > >>> >> >>> > >> >> > >>> >> >>> This is a PNG output: > >> >> > >>> >> >>> > >> >> > >>> >> >>> > >> >> > >>> >> > >> >> > >>> > >> >> > > >> >> > >> >> > http://www-958.ibm.com/software/analytics/manyeyes/vis/FullScreen/fullscreenvisualization.html?id=files%2Fthumbnails%2Ffed90050-64ac-11e2-926b-000255111976.wm.png&visId=ff062cd864ac11e2926b000255111976 > >> >> > >>> >> >>> > >> >> > >>> >> >>> On Mon, Jan 21, 2013 at 7:44 AM, Rob Weir > >> >> > >>> >> >>> <[email protected] > >> >> > > >> >> > >>> wrote: > >> >> > >>> >> >>> > >> >> > >>> >> >>> > On Sun, Jan 20, 2013 at 10:34 PM, Kadal Amutham < > >> >> > >>> [email protected]> > >> >> > >>> >> >>> wrote: > >> >> > >>> >> >>> > > Dear Mr. Rob, I tried to edit the document, but it > is > >> >> > >>> >> >>> > > read > >> >> > >>> only. > >> >> > >>> >> Can > >> >> > >>> >> >>> you > >> >> > >>> >> >>> > > make it readable so that everybody can fill the > data? > >> >> > >>> >> >>> > > > >> >> > >>> >> >>> > > >> >> > >>> >> >>> > OK. I gave you and Samer write permissions. > >> >> > >>> >> >>> > > >> >> > >>> >> >>> > > >> >> > >>> >> >>> > -Rob > >> >> > >>> >> >>> > > >> >> > >>> >> >>> > > >> >> > >>> >> >>> > > With Warm Regards > >> >> > >>> >> >>> > > > >> >> > >>> >> >>> > > V.Kadal Amutham > >> >> > >>> >> >>> > > 919444360480 > >> >> > >>> >> >>> > > 914422396480 > >> >> > >>> >> >>> > > > >> >> > >>> >> >>> > > > >> >> > >>> >> >>> > > On 21 January 2013 09:01, Kadal Amutham > >> >> > >>> >> >>> > > <[email protected] > >> >> > > >> >> > >>> wrote: > >> >> > >>> >> >>> > > > >> >> > >>> >> >>> > >> Who ever finds time, can fill the remaining Data, > so > >> >> > >>> >> >>> > >> that > >> >> > the > >> >> > >>> >> >>> document > >> >> > >>> >> >>> > >> become complete > >> >> > >>> >> >>> > >> > >> >> > >>> >> >>> > >> With Warm Regards > >> >> > >>> >> >>> > >> > >> >> > >>> >> >>> > >> V.Kadal Amutham > >> >> > >>> >> >>> > >> 919444360480 > >> >> > >>> >> >>> > >> 914422396480 > >> >> > >>> >> >>> > >> > >> >> > >>> >> >>> > >> > >> >> > >>> >> >>> > >> On 21 January 2013 07:46, Rob Weir > >> >> > >>> >> >>> > >> <[email protected]> > >> >> > >>> wrote: > >> >> > >>> >> >>> > >> > >> >> > >>> >> >>> > >>> On Sat, Jan 19, 2013 at 12:34 PM, Roberto > Galoppini > >> >> > >>> >> >>> > >>> < > >> >> > >>> >> >>> > [email protected]> > >> >> > >>> >> >>> > >>> wrote: > >> >> > >>> >> >>> > >>> > On Fri, Jan 18, 2013 at 10:34 PM, Rob Weir < > >> >> > >>> >> [email protected]> > >> >> > >>> >> >>> > wrote: > >> >> > >>> >> >>> > >>> > > >> >> > >>> >> >>> > >>> >> I was thinking of putting together a world map > >> >> showing > >> >> > the > >> >> > >>> >> use of > >> >> > >>> >> >>> > >>> >> OpenOffice, the kind with each country shaded > or > >> >> color > >> >> > >>> coded > >> >> > >>> >> to > >> >> > >>> >> >>> show > >> >> > >>> >> >>> > >>> >> the density of use. > >> >> > >>> >> >>> > >>> >> > >> >> > >>> >> >>> > >>> >> I can easily get a data set showing the total > >> >> > >>> >> >>> > >>> >> number > >> >> of > >> >> > >>> >> >>> downloads of > >> >> > >>> >> >>> > >>> >> OpenOffice per country. But the raw numbers > >> >> > >>> >> >>> > >>> >> don't > >> >> > really > >> >> > >>> >> tell > >> >> > >>> >> >>> the > >> >> > >>> >> >>> > >>> >> story. It would show, probably, that the USA > has > >> >> > >>> >> >>> > >>> >> the > >> >> > most > >> >> > >>> >> >>> > downloads. > >> >> > >>> >> >>> > >>> >> But that is probably also because of its large > >> >> > population. > >> >> > >>> >> >>> > >>> >> > >> >> > >>> >> >>> > >>> >> So maybe we then show downloads per capita, or > >> >> > downloads > >> >> > >>> per > >> >> > >>> >> >>> 100,000 > >> >> > >>> >> >>> > >>> >> population. But that then becomes a proxy for > >> >> economic > >> >> > >>> >> >>> development, > >> >> > >>> >> >>> > >>> >> since there are highly populated countries with > >> >> > >>> >> >>> > >>> >> fewer > >> >> > >>> >> computers > >> >> > >>> >> >>> per > >> >> > >>> >> >>> > >>> >> capital, and low population countries with more > >> >> > computers, > >> >> > >>> >> etc. > >> >> > >>> >> >>> I > >> >> > >>> >> >>> > >>> >> don't think that is what we want to show. > >> >> > >>> >> >>> > >>> >> > >> >> > >>> >> >>> > >>> >> So, I'm wondering, has anyone seen data for > >> >> > >>> >> >>> > >>> >> something > >> >> > like > >> >> > >>> >> PCs > >> >> > >>> >> >>> per > >> >> > >>> >> >>> > >>> >> capita, or home computers, or internet users, > or > >> >> > >>> >> >>> > >>> >> some > >> >> > >>> other > >> >> > >>> >> proxy > >> >> > >>> >> >>> > for > >> >> > >>> >> >>> > >>> >> what our potential usership would be per > country? > >> >> > >>> >> >>> > >>> >> > >> >> > >>> >> >>> > >>> > > >> >> > >>> >> >>> > >>> > Maybe we can use internet users stats by > country, > >> >> > something > >> >> > >>> >> like: > >> >> > >>> >> >>> > >>> > > >> >> > >>> >> >>> > >>> > >> >> > >>> >> >>> > > >> >> > >>> >> >>> > >> >> > >>> >> > >> >> > >>> > >> >> > > >> >> > >> >> > http://en.wikipedia.org/wiki/List_of_countries_by_number_of_Internet_usersto > >> >> > >>> >> >>> > >>> > normalize our stats. > >> >> > >>> >> >>> > >>> > > >> >> > >>> >> >>> > >>> > >> >> > >>> >> >>> > >>> Thanks, that looks useful. I started entering the > >> >> > >>> >> >>> > >>> data > >> >> > into > >> >> > >>> a > >> >> > >>> >> >>> > >>> spreadsheet: > >> >> > >>> >> >>> > >>> > >> >> > >>> >> >>> > >>> > >> >> > >>> >> >>> > >>> > >> >> > >>> >> >>> > > >> >> > >>> >> >>> > >> >> > >>> >> > >> >> > >>> > >> >> > > >> >> > >> >> > https://docs.google.com/spreadsheet/ccc?key=0Av4Lhq3W5zKodGZtNU1oRGFjWi1kYXkzVEtjOWY1ZlE > >> >> > >>> >> >>> > >>> > >> >> > >>> >> >>> > >>> As you see, Italy is at the top, if you look at > >> >> downloads > >> >> > per > >> >> > >>> >> 1000 > >> >> > >>> >> >>> > >>> internet users, with 91. So nearly one in ten in > >> >> > >>> >> >>> > >>> Italy > >> >> > have > >> >> > >>> >> >>> > >>> downloaded AOO! > >> >> > >>> >> >>> > >>> > >> >> > >>> >> >>> > >>> -Rob > >> >> > >>> >> >>> > >>> > >> >> > >>> >> >>> > >>> > > >> >> > >>> >> >>> > >>> > > >> >> > >>> >> >>> > >>> >> Regards, > >> >> > >>> >> >>> > >>> >> > >> >> > >>> >> >>> > >>> >> -Rob > >> >> > >>> >> >>> > >>> >> > >> >> > >>> >> >>> > >>> > > >> >> > >>> >> >>> > >>> > -- > >> >> > >>> >> >>> > >>> > ==== > >> >> > >>> >> >>> > >>> > This e- mail message is intended only for the > >> >> > >>> >> >>> > >>> > named > >> >> > >>> >> recipient(s) > >> >> > >>> >> >>> > above. > >> >> > >>> >> >>> > >>> It > >> >> > >>> >> >>> > >>> > may contain confidential and privileged > >> >> > >>> >> >>> > >>> > information. > >> >> If > >> >> > you > >> >> > >>> >> are > >> >> > >>> >> >>> not > >> >> > >>> >> >>> > the > >> >> > >>> >> >>> > >>> > intended recipient you are hereby notified that > >> >> > >>> >> >>> > >>> > any > >> >> > >>> >> dissemination, > >> >> > >>> >> >>> > >>> > distribution or copying of this e-mail and any > >> >> > >>> attachment(s) > >> >> > >>> >> is > >> >> > >>> >> >>> > strictly > >> >> > >>> >> >>> > >>> > prohibited. If you have received this e-mail in > >> >> > >>> >> >>> > >>> > error, > >> >> > >>> please > >> >> > >>> >> >>> > >>> immediately > >> >> > >>> >> >>> > >>> > notify the sender by replying to this e-mail and > >> >> delete > >> >> > the > >> >> > >>> >> >>> message > >> >> > >>> >> >>> > and > >> >> > >>> >> >>> > >>> any > >> >> > >>> >> >>> > >>> > attachment(s) from your system. Thank you. > >> >> > >>> >> >>> > >>> > > >> >> > >>> >> >>> > >>> > >> >> > >>> >> >>> > >> > >> >> > >>> >> >>> > >> > >> >> > >>> >> >>> > > >> >> > >>> >> >>> > >> >> > >>> >> >> > >> >> > >>> >> >> > >> >> > >>> >> > >> >> > >>> > > >> >> > >>> > > >> >> > >>> > >> >> > >> > >> >> > >> > >> >> > > >> >> > > > > >
