Hi,

You also also predict the ethnic group based on name using [1] and again 
looking at a name you might be able to have a good guess of what ethic group 
someone belongs to.

How useful its this? Probably not very, but perhaps changes over time may tell 
us something interesting? Or perhaps some guesstimates of how many committers 
don’t speak English as their first language? Or perhaps it could indicate we 
are in some respects we are somewhat diverse bunch of people?

Obviously European is going to include European sounding names from the USA, so 
don’t feel left out USA. Having a Jewish last name may not mean you are Jewish, 
having an East European name doesn’t mean you are from there, just that one of 
your ancestors may of been etc etc etc

But for interest, here's a simplified output from that script from all the 
names in Apache’s GitHub organisation:
European 710
East Asian 308
Indian 258
Jewish 189
East Europe 183
French 103
Germanic 102
Hispanic 99
Nordic 89
Italian 75
African 70
Japanese 52
Other 1

Again if anyone want details on how I generated the above just ask.

Although I just realised that whimsey data probably has the names of all 
committers and may be an easer source of that data.

Thanks,
Justin

1. https://github.com/appeler/ethnicolr

Reply via email to