Hi,

> Thanks for taking an initial look Justin. Can you share what name guesser
> you're using?

I already did [1], it has a database of 40,000 names. Is it 100% correct? 
Probably not and some names can be either gender, but it probably gives a good 
indicator and is the same rough percentages as the committer survey taken a few 
years back.

> Can I also ask if we are sure our community wants their "name" on GitHub
> used in stats on "gender?”

IMO I think as long as it’s aggregated there’s no harm, we not identifying 
anyone and groups have 100s or 1000s of people in them, but if anyone disagrees 
please speak up.

BTW Whimsy has more accurate data and lists everyone so I’ve moved to using 
that data, rather than GitHub’s.

> I tend toward not using data that wasn't intended for a purpose for a
> purpose without letting people know, especially if we are planning to
> publish figures.

If this was published anywhere in detail, there would be careful consideration 
and discussion before doing so, currently it's just back of the envelope 
numbers based on some data we have, that may provide some insights.

I think it’s confirms what we already know i.e. that the gender mix in our 
committer base is not the same as it is in others open source foundations, 
employment in ITC or the general population.

Thanks,
Justin

1.  https://pypi.org/project/gender-guesser/

Reply via email to