Hi, > Thanks for taking an initial look Justin. Can you share what name guesser > you're using?
I already did [1], it has a database of 40,000 names. Is it 100% correct? Probably not and some names can be either gender, but it probably gives a good indicator and is the same rough percentages as the committer survey taken a few years back. > Can I also ask if we are sure our community wants their "name" on GitHub > used in stats on "gender?” IMO I think as long as it’s aggregated there’s no harm, we not identifying anyone and groups have 100s or 1000s of people in them, but if anyone disagrees please speak up. BTW Whimsy has more accurate data and lists everyone so I’ve moved to using that data, rather than GitHub’s. > I tend toward not using data that wasn't intended for a purpose for a > purpose without letting people know, especially if we are planning to > publish figures. If this was published anywhere in detail, there would be careful consideration and discussion before doing so, currently it's just back of the envelope numbers based on some data we have, that may provide some insights. I think it’s confirms what we already know i.e. that the gender mix in our committer base is not the same as it is in others open source foundations, employment in ITC or the general population. Thanks, Justin 1. https://pypi.org/project/gender-guesser/
