Hi Lex, The data we use for the lessons (at least for R) are actually coming from the gapminder R package put together by Jenny Brian. The package contains the code used to tidy the data from the spreadsheets made available by the gapminder website. It might be worth putting a pull request together to update the data there, and then it will be easy to update the data in our lessons.
Cheers, -- François On Mon, Jan 23, 2017 at 7:20 AM, Lex Nederbragt <[email protected]> wrote: > Hi, > > The gapminderDataFiveYear data that I have on my harddisk, and the ones from > the python and R lessons using the gap minder data, runs from 1952 to 2007. > I thought it would be nice to add the 2012 data, it being 2017, after all. > > So I went to what I guessed to be the original source, > https://www.gapminder.org/data/ (that was easy), and checked a few of the > population size numbers from the years in the datasets we use. I choose > "Population, total” as dataset, which can be viewed as google sheet here. > The numbers are not the same, in some cases they are quite much lower or > higher, while in others they are more close. > > The other data sources are a bit harder to compare. There are a few > GDP/Capita datasets, I think "Income per person (GDP/capita, PPP$ > inflation-adjusted)” comes closest, but the numbers are quite a bit higher > than in ‘our’ dataset. "Life expectancy (years)” is close, but also off. > > Should we update our numbers and add 2012? This could be done with some > smart webscraping, I think? > > Best, > > Lex Nederbragt > > _______________________________________________ > Discuss mailing list > [email protected] > http://lists.software-carpentry.org/listinfo/discuss _______________________________________________ Discuss mailing list [email protected] http://lists.software-carpentry.org/listinfo/discuss
