Hi, You also also predict the ethnic group based on name using [1] and again looking at a name you might be able to have a good guess of what ethic group someone belongs to.
How useful its this? Probably not very, but perhaps changes over time may tell us something interesting? Or perhaps some guesstimates of how many committers don’t speak English as their first language? Or perhaps it could indicate we are in some respects we are somewhat diverse bunch of people? Obviously European is going to include European sounding names from the USA, so don’t feel left out USA. Having a Jewish last name may not mean you are Jewish, having an East European name doesn’t mean you are from there, just that one of your ancestors may of been etc etc etc But for interest, here's a simplified output from that script from all the names in Apache’s GitHub organisation: European 710 East Asian 308 Indian 258 Jewish 189 East Europe 183 French 103 Germanic 102 Hispanic 99 Nordic 89 Italian 75 African 70 Japanese 52 Other 1 Again if anyone want details on how I generated the above just ask. Although I just realised that whimsey data probably has the names of all committers and may be an easer source of that data. Thanks, Justin 1. https://github.com/appeler/ethnicolr
