Thanks Carlo and Eric for the initiative.

I am all for it and I'll do my part to mind the code. This is a small yet
meaningful step we can take. Meanwhile, I'd like to take this opportunity
to open up conversation around the Diversity & Inclusion within the
community.

If you read this quarter's Hadoop board report, I am starting to collect
metrics about the composition of our community in order to understand if we
are building a diverse & inclusive community. Things that are obvious to me
that I thought I should report are the following: affiliation among
commiters, and demographics of committers. As of last quarter, 4 out of 7
newly minted committers are affiliated with Cloudera. 4 out of the 7 said
committers are located in Asia. Those facts suggest we have a good
international participation (I am being US-centric), which is good.
However, having half of the active committers affiliated with one company
is a potential problem.

I'd like to hear your thoughts on this. What other metrics should we
collect, and what actions can we take.



On Fri, Jul 10, 2020 at 11:29 AM Carlo Aldo Curino <carlo.cur...@gmail.com>
wrote:

> Eric,
>
> Thank you so much for the support and for stepping up offering to work on
> this. I am super +1 on this. Let's give folks a few more days to chime in,
> in case there is anything to discuss before we get cracking!
>
> (Really) Thanks,
> Carlo
>
> On Fri, Jul 10, 2020, 10:38 AM Eric Badger <ebad...@verizonmedia.com>
> wrote:
>
> > Thanks for writing this up, Carlo. I'm +1 (idk if I'm technically binding
> > on this or not) for the changes moving forward and I think we refactor
> away
> > any instances that are internal to the code (i.e. not APIs or other
> things
> > that would break compatibility) in all active branches and then also
> change
> > the APIs in trunk (an incompatible change).
> >
> > I just came across an internal issue related to the NM
> > whitelist/blacklist. I would be happy to go refactor the code and look
> for
> > instances of these and replace them with allowlist/blocklist. Doing a
> quick
> > "git grep" of trunk, I see 270 instances of "whitelist" and 1318
> instances
> > of "blacklist".
> >
> > If there are no objections, I'll create a JIRA to clean this specific
> > stuff up. It would be wonderful if others could pick up a different
> portion
> > (e.g. master/slave) so that we can spread the work out.
> >
> > Eric
> >
> > On Tue, Jul 7, 2020 at 6:27 PM Carlo Aldo Curino <carlo.cur...@gmail.com
> >
> > wrote:
> >
> >> Hello Folks,
> >>
> >> I hope you are all doing well...
> >>
> >> *The problem*
> >> The recent protests made me realize that we are not just a bystanders of
> >> the systematic racism that affect our society, but we are active
> >> participants of it. Being "non-racist" is not enough, I strongly feel we
> >> should be actively "anti-racist" in our day to day lives, and
> continuously
> >> check our biases. I assume most of you will agree with the general
> >> sentiment, but based on your exposure to the recent events and US
> >> culture/history might have more or less strong feelings about your role
> in
> >> the problem and potential solution.
> >>
> >> *What can we do about it?* I think a simple action we can take is to
> work
> >> on our code/comments/documentation/websites and remove racist
> terminology.
> >> Here is a IETF draft to fix up some of the most egregious examples
> >> (master/slave, whitelist/backlist) with proposed alternatives.
> >>
> >>
> https://tools.ietf.org/id/draft-knodel-terminology-00.html#rfc.section.1.1.1
> >> Also as we go about this effort, we should also consider other
> >> "non-inclusive" terminology issues around gender (e.g., binary gendered
> >> examples, "Alice" doing the wrong security thing systematically), and
> >> ableism (e.g., referring to misbehaving hardware as "lame" or "limping",
> >> etc.).
> >> The easiest action item is to avoid this going forward (ideally adding
> it
> >> to the checkstyles if possible), a more costly one is to start going
> back
> >> and refactor away existing instances.
> >>
> >> I know this requires a bunch of work as refactorings might break dev
> >> branches and non-committed patches, possibly scripts, etc. but I think
> >> this
> >> is something important and relatively simple we can do. The effect goes
> >> well beyond some text in github, it signals what we believe in, and
> forces
> >> hundreds of users and contributors to notice and think about it. Our
> >> force-multiplier is huge and it matches our responsibility.
> >>
> >> What do you folks think?
> >>
> >> Thanks,
> >> Carlo
> >>
> >
>

Reply via email to