Dear Rob, For the internet population I used the data from the wiki since the link was already in the google spreadsheet. For the population details, I typed population followed by the country name in the address bar of google and in most of the searches the result was from www.google.co.in/publicdata. I do not expect much difference between google data and wiki. You can verify few data and the difference may be in the last 3 digits.
With Warm Regards V.Kadal Amutham 919444360480 914422396480 On 23 January 2013 23:58, Samer Mansour <[email protected]> wrote: > Some countries didn't get counted (not matched in the tool/didn't exist). > Some cities got their own row, but the Map doesn't have a spot for them. ie > Hong Kong and Singapore: > Also I think some rows are gone, like "Europe (not specified)", it was > there yesterday. > > Here are two more visualizations: > > > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/apache-openoffice-downloads-vs-pop > > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/apache-openoffice-downloads-vs-pop-2 > > And the original: > > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/apache-openoffice-downloads-by-pop > > I didn't have a design in mind in how we want to show this so I just > plotted and mashed around with it like playdough. > > On Wed, Jan 23, 2013 at 12:37 PM, Rob Weir <[email protected]> wrote: > > > On Wed, Jan 23, 2013 at 10:59 AM, Samer Mansour <[email protected]> > > wrote: > > > Opps here it is: > > > > > > > > > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/apache-openoffice-downloads-by-pop > > > > > > > This is very nice. We might want to take out the "population" and > > "internet users" and the two rank columns and just use the "downloads > > per 1k population" and "downloads per 1k internet users" columns. > > > > I found a few cut & paste errors in the spreadsheet, so it would be > > good if we could have some reviewers double check the numbers. > > > > Compare columns C and E of this spreadsheet: > > > > > > > https://docs.google.com/spreadsheet/ccc?key=0Av4Lhq3W5zKodGZtNU1oRGFjWi1kYXkzVEtjOWY1ZlE#gid=0 > > > > With the data in these two Wikipedia articles. > > > > > http://en.wikipedia.org/wiki/List_of_countries_by_number_of_Internet_users > > > > http://en.wikipedia.org/wiki/List_of_countries_by_population > > > > Once reviewed, and we have the updated maps, then I have an idea. We > > could do a blog post on these numbers, but frame it as a general story > > about end-user use of desktop open source software, etc. If we make > > the story more general interest we'll get broader circulation and > > uptake in the press. Of course, it supports a positive story of our > > project as well. But by telling the broader story, and using > > OpenOffice as an example, we'll go further. > > > > Does this make sense? > > > > -Rob > > > > > On Wed, Jan 23, 2013 at 10:59 AM, Samer Mansour <[email protected]> > > wrote: > > > > > >> Here is an updated map. I moved to bubble chart because coloring the > > >> country made places like Russia and Canada look odd. > > >> I will see if I can make other visualizations as well, maybe Map is > not > > >> the best visualization. > > >> > > >> > > >> On Wed, Jan 23, 2013 at 3:10 AM, Kadal Amutham <[email protected]> > > wrote: > > >> > > >>> Dear Rob Weir, > > >>> > > >>> The internet user data has been filled > > >>> > > >>> With Warm Regards > > >>> > > >>> V.Kadal Amutham > > >>> 919444360480 > > >>> 914422396480 > > >>> > > >>> > > >>> On 23 January 2013 07:23, Kadal Amutham <[email protected]> wrote: > > >>> > > >>> > I will start filling the internet user data > > >>> > > > >>> > With Warm Regards > > >>> > > > >>> > V.Kadal Amutham > > >>> > 919444360480 > > >>> > 914422396480 > > >>> > > > >>> > > > >>> > On 23 January 2013 01:38, Rob Weir <[email protected]> wrote: > > >>> > > > >>> >> On Tue, Jan 22, 2013 at 12:11 PM, Kadal Amutham <[email protected] > > > > >>> wrote: > > >>> >> > Check the google document at > > >>> >> > > > >>> >> > > >>> > > > https://docs.google.com/spreadsheet/ccc?key=0Av4Lhq3W5zKodGZtNU1oRGFjWi1kYXkzVEtjOWY1ZlE#gid=0 > > >>> >> > > > >>> >> > I have entered the population data alone > > >>> >> > > > >>> >> > > >>> >> Very good. I made some edits and also received a spreadsheet with > > some > > >>> >> more data from Gianvittorio. > > >>> >> > > >>> >> The results are interesting. For example, what countries have the > > >>> >> highest percentage of downloads by population? The top 5 are: > > >>> >> > > >>> >> 1) Gambia > > >>> >> 2) Trinidad and Tobago > > >>> >> 3) Zimbabwe > > >>> >> 4) Vatican City > > >>> >> 5) Saint Pierre and Miquelon > > >>> >> > > >>> >> It will be interesting to see how the data is when we look at > usage > > >>> >> per internet users in a country. > > >>> >> > > >>> >> But however we slice the data we'll have this problem: For very > > small > > >>> >> countries, just a few downloads can shift the ratio by a large > > amount. > > >>> >> For example, the Vatican City had around 100 downloads. So a > > >>> >> difference of 10 downloads is 10%. France had 4.5 million > > downloads. > > >>> >> A difference of 10 downloads is nothing. > > >>> >> > > >>> >> I've seen this handled in other contexts by applying statistical > > >>> >> techniques to estimate error bounds on the ratio, and then rank by > > the > > >>> >> lower confidence limit. You can see the technique described (and > > >>> >> formulas given) here: > > >>> >> http://evanmiller.org/how-not-to-sort-by-average-rating.html > > >>> >> > > >>> >> -Rob > > >>> >> > > >>> >> > With Warm Regards > > >>> >> > > > >>> >> > V.Kadal Amutham > > >>> >> > 919444360480 > > >>> >> > 914422396480 > > >>> >> > > > >>> >> > > > >>> >> > On 22 January 2013 22:26, Kadal Amutham <[email protected]> > wrote: > > >>> >> > > > >>> >> >> Good show. I have almost entered the population data into the > > spread > > >>> >> >> sheet. Then I have to populate the internet user data. Once it > is > > >>> >> over, I > > >>> >> >> will let you know. > > >>> >> >> > > >>> >> >> With Warm Regards > > >>> >> >> > > >>> >> >> V.Kadal Amutham > > >>> >> >> 919444360480 > > >>> >> >> 914422396480 > > >>> >> >> > > >>> >> >> > > >>> >> >> On 22 January 2013 21:37, Samer Mansour <[email protected]> > > wrote: > > >>> >> >> > > >>> >> >>> We don't have internet users completely populated so from the > > data > > >>> we > > >>> >> >>> currently have I whipped something up in 5 mins at work. > > >>> >> >>> This is just the top 40 Countries. Its manual data entry, > once > > we > > >>> >> have > > >>> >> >>> internet users populated and I will resolve things like "Hong > > >>> Kong" -> > > >>> >> >>> "China" and Singapore -> South Korea. > > >>> >> >>> > > >>> >> >>> I can maybe finish some data entry tonight. > > >>> >> >>> > > >>> >> >>> > > >>> >> >>> > > >>> >> > > >>> > > > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/aoo-test-visualization > > >>> >> >>> > > >>> >> >>> This is a PNG output: > > >>> >> >>> > > >>> >> >>> > > >>> >> > > >>> > > > http://www-958.ibm.com/software/analytics/manyeyes/vis/FullScreen/fullscreenvisualization.html?id=files%2Fthumbnails%2Ffed90050-64ac-11e2-926b-000255111976.wm.png&visId=ff062cd864ac11e2926b000255111976 > > >>> >> >>> > > >>> >> >>> On Mon, Jan 21, 2013 at 7:44 AM, Rob Weir <[email protected] > > > > >>> wrote: > > >>> >> >>> > > >>> >> >>> > On Sun, Jan 20, 2013 at 10:34 PM, Kadal Amutham < > > >>> [email protected]> > > >>> >> >>> wrote: > > >>> >> >>> > > Dear Mr. Rob, I tried to edit the document, but it is read > > >>> only. > > >>> >> Can > > >>> >> >>> you > > >>> >> >>> > > make it readable so that everybody can fill the data? > > >>> >> >>> > > > > >>> >> >>> > > > >>> >> >>> > OK. I gave you and Samer write permissions. > > >>> >> >>> > > > >>> >> >>> > > > >>> >> >>> > -Rob > > >>> >> >>> > > > >>> >> >>> > > > >>> >> >>> > > With Warm Regards > > >>> >> >>> > > > > >>> >> >>> > > V.Kadal Amutham > > >>> >> >>> > > 919444360480 > > >>> >> >>> > > 914422396480 > > >>> >> >>> > > > > >>> >> >>> > > > > >>> >> >>> > > On 21 January 2013 09:01, Kadal Amutham <[email protected] > > > > >>> wrote: > > >>> >> >>> > > > > >>> >> >>> > >> Who ever finds time, can fill the remaining Data, so that > > the > > >>> >> >>> document > > >>> >> >>> > >> become complete > > >>> >> >>> > >> > > >>> >> >>> > >> With Warm Regards > > >>> >> >>> > >> > > >>> >> >>> > >> V.Kadal Amutham > > >>> >> >>> > >> 919444360480 > > >>> >> >>> > >> 914422396480 > > >>> >> >>> > >> > > >>> >> >>> > >> > > >>> >> >>> > >> On 21 January 2013 07:46, Rob Weir <[email protected]> > > >>> wrote: > > >>> >> >>> > >> > > >>> >> >>> > >>> On Sat, Jan 19, 2013 at 12:34 PM, Roberto Galoppini < > > >>> >> >>> > [email protected]> > > >>> >> >>> > >>> wrote: > > >>> >> >>> > >>> > On Fri, Jan 18, 2013 at 10:34 PM, Rob Weir < > > >>> >> [email protected]> > > >>> >> >>> > wrote: > > >>> >> >>> > >>> > > > >>> >> >>> > >>> >> I was thinking of putting together a world map > showing > > the > > >>> >> use of > > >>> >> >>> > >>> >> OpenOffice, the kind with each country shaded or > color > > >>> coded > > >>> >> to > > >>> >> >>> show > > >>> >> >>> > >>> >> the density of use. > > >>> >> >>> > >>> >> > > >>> >> >>> > >>> >> I can easily get a data set showing the total number > of > > >>> >> >>> downloads of > > >>> >> >>> > >>> >> OpenOffice per country. But the raw numbers don't > > really > > >>> >> tell > > >>> >> >>> the > > >>> >> >>> > >>> >> story. It would show, probably, that the USA has the > > most > > >>> >> >>> > downloads. > > >>> >> >>> > >>> >> But that is probably also because of its large > > population. > > >>> >> >>> > >>> >> > > >>> >> >>> > >>> >> So maybe we then show downloads per capita, or > > downloads > > >>> per > > >>> >> >>> 100,000 > > >>> >> >>> > >>> >> population. But that then becomes a proxy for > economic > > >>> >> >>> development, > > >>> >> >>> > >>> >> since there are highly populated countries with fewer > > >>> >> computers > > >>> >> >>> per > > >>> >> >>> > >>> >> capital, and low population countries with more > > computers, > > >>> >> etc. > > >>> >> >>> I > > >>> >> >>> > >>> >> don't think that is what we want to show. > > >>> >> >>> > >>> >> > > >>> >> >>> > >>> >> So, I'm wondering, has anyone seen data for something > > like > > >>> >> PCs > > >>> >> >>> per > > >>> >> >>> > >>> >> capita, or home computers, or internet users, or some > > >>> other > > >>> >> proxy > > >>> >> >>> > for > > >>> >> >>> > >>> >> what our potential usership would be per country? > > >>> >> >>> > >>> >> > > >>> >> >>> > >>> > > > >>> >> >>> > >>> > Maybe we can use internet users stats by country, > > something > > >>> >> like: > > >>> >> >>> > >>> > > > >>> >> >>> > >>> > > >>> >> >>> > > > >>> >> >>> > > >>> >> > > >>> > > > http://en.wikipedia.org/wiki/List_of_countries_by_number_of_Internet_usersto > > >>> >> >>> > >>> > normalize our stats. > > >>> >> >>> > >>> > > > >>> >> >>> > >>> > > >>> >> >>> > >>> Thanks, that looks useful. I started entering the data > > into > > >>> a > > >>> >> >>> > >>> spreadsheet: > > >>> >> >>> > >>> > > >>> >> >>> > >>> > > >>> >> >>> > >>> > > >>> >> >>> > > > >>> >> >>> > > >>> >> > > >>> > > > https://docs.google.com/spreadsheet/ccc?key=0Av4Lhq3W5zKodGZtNU1oRGFjWi1kYXkzVEtjOWY1ZlE > > >>> >> >>> > >>> > > >>> >> >>> > >>> As you see, Italy is at the top, if you look at > downloads > > per > > >>> >> 1000 > > >>> >> >>> > >>> internet users, with 91. So nearly one in ten in Italy > > have > > >>> >> >>> > >>> downloaded AOO! > > >>> >> >>> > >>> > > >>> >> >>> > >>> -Rob > > >>> >> >>> > >>> > > >>> >> >>> > >>> > > > >>> >> >>> > >>> > > > >>> >> >>> > >>> >> Regards, > > >>> >> >>> > >>> >> > > >>> >> >>> > >>> >> -Rob > > >>> >> >>> > >>> >> > > >>> >> >>> > >>> > > > >>> >> >>> > >>> > -- > > >>> >> >>> > >>> > ==== > > >>> >> >>> > >>> > This e- mail message is intended only for the named > > >>> >> recipient(s) > > >>> >> >>> > above. > > >>> >> >>> > >>> It > > >>> >> >>> > >>> > may contain confidential and privileged information. > If > > you > > >>> >> are > > >>> >> >>> not > > >>> >> >>> > the > > >>> >> >>> > >>> > intended recipient you are hereby notified that any > > >>> >> dissemination, > > >>> >> >>> > >>> > distribution or copying of this e-mail and any > > >>> attachment(s) > > >>> >> is > > >>> >> >>> > strictly > > >>> >> >>> > >>> > prohibited. If you have received this e-mail in error, > > >>> please > > >>> >> >>> > >>> immediately > > >>> >> >>> > >>> > notify the sender by replying to this e-mail and > delete > > the > > >>> >> >>> message > > >>> >> >>> > and > > >>> >> >>> > >>> any > > >>> >> >>> > >>> > attachment(s) from your system. Thank you. > > >>> >> >>> > >>> > > > >>> >> >>> > >>> > > >>> >> >>> > >> > > >>> >> >>> > >> > > >>> >> >>> > > > >>> >> >>> > > >>> >> >> > > >>> >> >> > > >>> >> > > >>> > > > >>> > > > >>> > > >> > > >> > > >
