On Wed, Jan 23, 2013 at 10:59 AM, Samer Mansour <[email protected]> wrote: > Opps here it is: > > http://www-958.ibm.com/software/analytics/manyeyes/visualizations/apache-openoffice-downloads-by-pop >
This is very nice. We might want to take out the "population" and "internet users" and the two rank columns and just use the "downloads per 1k population" and "downloads per 1k internet users" columns. I found a few cut & paste errors in the spreadsheet, so it would be good if we could have some reviewers double check the numbers. Compare columns C and E of this spreadsheet: https://docs.google.com/spreadsheet/ccc?key=0Av4Lhq3W5zKodGZtNU1oRGFjWi1kYXkzVEtjOWY1ZlE#gid=0 With the data in these two Wikipedia articles. http://en.wikipedia.org/wiki/List_of_countries_by_number_of_Internet_users http://en.wikipedia.org/wiki/List_of_countries_by_population Once reviewed, and we have the updated maps, then I have an idea. We could do a blog post on these numbers, but frame it as a general story about end-user use of desktop open source software, etc. If we make the story more general interest we'll get broader circulation and uptake in the press. Of course, it supports a positive story of our project as well. But by telling the broader story, and using OpenOffice as an example, we'll go further. Does this make sense? -Rob > On Wed, Jan 23, 2013 at 10:59 AM, Samer Mansour <[email protected]> wrote: > >> Here is an updated map. I moved to bubble chart because coloring the >> country made places like Russia and Canada look odd. >> I will see if I can make other visualizations as well, maybe Map is not >> the best visualization. >> >> >> On Wed, Jan 23, 2013 at 3:10 AM, Kadal Amutham <[email protected]> wrote: >> >>> Dear Rob Weir, >>> >>> The internet user data has been filled >>> >>> With Warm Regards >>> >>> V.Kadal Amutham >>> 919444360480 >>> 914422396480 >>> >>> >>> On 23 January 2013 07:23, Kadal Amutham <[email protected]> wrote: >>> >>> > I will start filling the internet user data >>> > >>> > With Warm Regards >>> > >>> > V.Kadal Amutham >>> > 919444360480 >>> > 914422396480 >>> > >>> > >>> > On 23 January 2013 01:38, Rob Weir <[email protected]> wrote: >>> > >>> >> On Tue, Jan 22, 2013 at 12:11 PM, Kadal Amutham <[email protected]> >>> wrote: >>> >> > Check the google document at >>> >> > >>> >> >>> https://docs.google.com/spreadsheet/ccc?key=0Av4Lhq3W5zKodGZtNU1oRGFjWi1kYXkzVEtjOWY1ZlE#gid=0 >>> >> > >>> >> > I have entered the population data alone >>> >> > >>> >> >>> >> Very good. I made some edits and also received a spreadsheet with some >>> >> more data from Gianvittorio. >>> >> >>> >> The results are interesting. For example, what countries have the >>> >> highest percentage of downloads by population? The top 5 are: >>> >> >>> >> 1) Gambia >>> >> 2) Trinidad and Tobago >>> >> 3) Zimbabwe >>> >> 4) Vatican City >>> >> 5) Saint Pierre and Miquelon >>> >> >>> >> It will be interesting to see how the data is when we look at usage >>> >> per internet users in a country. >>> >> >>> >> But however we slice the data we'll have this problem: For very small >>> >> countries, just a few downloads can shift the ratio by a large amount. >>> >> For example, the Vatican City had around 100 downloads. So a >>> >> difference of 10 downloads is 10%. France had 4.5 million downloads. >>> >> A difference of 10 downloads is nothing. >>> >> >>> >> I've seen this handled in other contexts by applying statistical >>> >> techniques to estimate error bounds on the ratio, and then rank by the >>> >> lower confidence limit. You can see the technique described (and >>> >> formulas given) here: >>> >> http://evanmiller.org/how-not-to-sort-by-average-rating.html >>> >> >>> >> -Rob >>> >> >>> >> > With Warm Regards >>> >> > >>> >> > V.Kadal Amutham >>> >> > 919444360480 >>> >> > 914422396480 >>> >> > >>> >> > >>> >> > On 22 January 2013 22:26, Kadal Amutham <[email protected]> wrote: >>> >> > >>> >> >> Good show. I have almost entered the population data into the spread >>> >> >> sheet. Then I have to populate the internet user data. Once it is >>> >> over, I >>> >> >> will let you know. >>> >> >> >>> >> >> With Warm Regards >>> >> >> >>> >> >> V.Kadal Amutham >>> >> >> 919444360480 >>> >> >> 914422396480 >>> >> >> >>> >> >> >>> >> >> On 22 January 2013 21:37, Samer Mansour <[email protected]> wrote: >>> >> >> >>> >> >>> We don't have internet users completely populated so from the data >>> we >>> >> >>> currently have I whipped something up in 5 mins at work. >>> >> >>> This is just the top 40 Countries. Its manual data entry, once we >>> >> have >>> >> >>> internet users populated and I will resolve things like "Hong >>> Kong" -> >>> >> >>> "China" and Singapore -> South Korea. >>> >> >>> >>> >> >>> I can maybe finish some data entry tonight. >>> >> >>> >>> >> >>> >>> >> >>> >>> >> >>> http://www-958.ibm.com/software/analytics/manyeyes/visualizations/aoo-test-visualization >>> >> >>> >>> >> >>> This is a PNG output: >>> >> >>> >>> >> >>> >>> >> >>> http://www-958.ibm.com/software/analytics/manyeyes/vis/FullScreen/fullscreenvisualization.html?id=files%2Fthumbnails%2Ffed90050-64ac-11e2-926b-000255111976.wm.png&visId=ff062cd864ac11e2926b000255111976 >>> >> >>> >>> >> >>> On Mon, Jan 21, 2013 at 7:44 AM, Rob Weir <[email protected]> >>> wrote: >>> >> >>> >>> >> >>> > On Sun, Jan 20, 2013 at 10:34 PM, Kadal Amutham < >>> [email protected]> >>> >> >>> wrote: >>> >> >>> > > Dear Mr. Rob, I tried to edit the document, but it is read >>> only. >>> >> Can >>> >> >>> you >>> >> >>> > > make it readable so that everybody can fill the data? >>> >> >>> > > >>> >> >>> > >>> >> >>> > OK. I gave you and Samer write permissions. >>> >> >>> > >>> >> >>> > >>> >> >>> > -Rob >>> >> >>> > >>> >> >>> > >>> >> >>> > > With Warm Regards >>> >> >>> > > >>> >> >>> > > V.Kadal Amutham >>> >> >>> > > 919444360480 >>> >> >>> > > 914422396480 >>> >> >>> > > >>> >> >>> > > >>> >> >>> > > On 21 January 2013 09:01, Kadal Amutham <[email protected]> >>> wrote: >>> >> >>> > > >>> >> >>> > >> Who ever finds time, can fill the remaining Data, so that the >>> >> >>> document >>> >> >>> > >> become complete >>> >> >>> > >> >>> >> >>> > >> With Warm Regards >>> >> >>> > >> >>> >> >>> > >> V.Kadal Amutham >>> >> >>> > >> 919444360480 >>> >> >>> > >> 914422396480 >>> >> >>> > >> >>> >> >>> > >> >>> >> >>> > >> On 21 January 2013 07:46, Rob Weir <[email protected]> >>> wrote: >>> >> >>> > >> >>> >> >>> > >>> On Sat, Jan 19, 2013 at 12:34 PM, Roberto Galoppini < >>> >> >>> > [email protected]> >>> >> >>> > >>> wrote: >>> >> >>> > >>> > On Fri, Jan 18, 2013 at 10:34 PM, Rob Weir < >>> >> [email protected]> >>> >> >>> > wrote: >>> >> >>> > >>> > >>> >> >>> > >>> >> I was thinking of putting together a world map showing the >>> >> use of >>> >> >>> > >>> >> OpenOffice, the kind with each country shaded or color >>> coded >>> >> to >>> >> >>> show >>> >> >>> > >>> >> the density of use. >>> >> >>> > >>> >> >>> >> >>> > >>> >> I can easily get a data set showing the total number of >>> >> >>> downloads of >>> >> >>> > >>> >> OpenOffice per country. But the raw numbers don't really >>> >> tell >>> >> >>> the >>> >> >>> > >>> >> story. It would show, probably, that the USA has the most >>> >> >>> > downloads. >>> >> >>> > >>> >> But that is probably also because of its large population. >>> >> >>> > >>> >> >>> >> >>> > >>> >> So maybe we then show downloads per capita, or downloads >>> per >>> >> >>> 100,000 >>> >> >>> > >>> >> population. But that then becomes a proxy for economic >>> >> >>> development, >>> >> >>> > >>> >> since there are highly populated countries with fewer >>> >> computers >>> >> >>> per >>> >> >>> > >>> >> capital, and low population countries with more computers, >>> >> etc. >>> >> >>> I >>> >> >>> > >>> >> don't think that is what we want to show. >>> >> >>> > >>> >> >>> >> >>> > >>> >> So, I'm wondering, has anyone seen data for something like >>> >> PCs >>> >> >>> per >>> >> >>> > >>> >> capita, or home computers, or internet users, or some >>> other >>> >> proxy >>> >> >>> > for >>> >> >>> > >>> >> what our potential usership would be per country? >>> >> >>> > >>> >> >>> >> >>> > >>> > >>> >> >>> > >>> > Maybe we can use internet users stats by country, something >>> >> like: >>> >> >>> > >>> > >>> >> >>> > >>> >>> >> >>> > >>> >> >>> >>> >> >>> http://en.wikipedia.org/wiki/List_of_countries_by_number_of_Internet_usersto >>> >> >>> > >>> > normalize our stats. >>> >> >>> > >>> > >>> >> >>> > >>> >>> >> >>> > >>> Thanks, that looks useful. I started entering the data into >>> a >>> >> >>> > >>> spreadsheet: >>> >> >>> > >>> >>> >> >>> > >>> >>> >> >>> > >>> >>> >> >>> > >>> >> >>> >>> >> >>> https://docs.google.com/spreadsheet/ccc?key=0Av4Lhq3W5zKodGZtNU1oRGFjWi1kYXkzVEtjOWY1ZlE >>> >> >>> > >>> >>> >> >>> > >>> As you see, Italy is at the top, if you look at downloads per >>> >> 1000 >>> >> >>> > >>> internet users, with 91. So nearly one in ten in Italy have >>> >> >>> > >>> downloaded AOO! >>> >> >>> > >>> >>> >> >>> > >>> -Rob >>> >> >>> > >>> >>> >> >>> > >>> > >>> >> >>> > >>> > >>> >> >>> > >>> >> Regards, >>> >> >>> > >>> >> >>> >> >>> > >>> >> -Rob >>> >> >>> > >>> >> >>> >> >>> > >>> > >>> >> >>> > >>> > -- >>> >> >>> > >>> > ==== >>> >> >>> > >>> > This e- mail message is intended only for the named >>> >> recipient(s) >>> >> >>> > above. >>> >> >>> > >>> It >>> >> >>> > >>> > may contain confidential and privileged information. If you >>> >> are >>> >> >>> not >>> >> >>> > the >>> >> >>> > >>> > intended recipient you are hereby notified that any >>> >> dissemination, >>> >> >>> > >>> > distribution or copying of this e-mail and any >>> attachment(s) >>> >> is >>> >> >>> > strictly >>> >> >>> > >>> > prohibited. If you have received this e-mail in error, >>> please >>> >> >>> > >>> immediately >>> >> >>> > >>> > notify the sender by replying to this e-mail and delete the >>> >> >>> message >>> >> >>> > and >>> >> >>> > >>> any >>> >> >>> > >>> > attachment(s) from your system. Thank you. >>> >> >>> > >>> > >>> >> >>> > >>> >>> >> >>> > >> >>> >> >>> > >> >>> >> >>> > >>> >> >>> >>> >> >> >>> >> >> >>> >> >>> > >>> > >>> >> >>
