Hi, Something I thought might be of interest. I took all of the names of people in the Apache GitHub organisation [1] and ran them through a gender guesser.
Here’s the results: Total records: 2689 Male: 61.7% Female: 3.9% Unknown: 25.3% No name: 9.1% You might get more accurate results by also looking at GitHub icons, as about 1/2 of those have photos. So you can see just from someone’s first name you have a good chance of knowing what's their gender is, even if you may not realise it. and some people might be sharing that information when they don’t think they are. Now the above is not likely to be entirely accurate for a number ion reasons, for instance people may not put their real names down, it is however a large sample size of our committers. This represents 40% of our committers (current total 6946) and perhaps a higher % of currently active committers. It might be interesting to run this over time and see if it changes, or perhaps use as a rough baseline for any survey results to see if a group is under or over represented in the answers. The last committer survey [2] In which 765 people responded had 92% male / 5% female + 3% other. Taking the male/female results above (and assuming unknown/no name is at same ratio which may be a big assumption) we get 94% male / 6% female which is (very) close to those results. If people want the code or more details how exactly I got these values, I’m happy to share. Thanks, Justin 1. https://github.com/orgs/apache/people 2. https://cwiki.apache.org/confluence/display/COMDEV/ASF+Committer+Diversity+Survey+-+2016
