Hi Vinoth,
About scaling community support, I agree with the suggestions above mentioned. 
My suggestion is that Hudi community requires contributors, committer and PMC 
member to partipate in community together, who could answer users' question of 
Hudi. Multiple modules should be divided in Hudi community, and each module 
needs one or more committers to follow the progress of the module, which helps 
attract more contributors to the community. About community user support, I 
thought that the community needs to vigorously expand Chinese users and train 
multiple Chinese submitters to answer local Chinese user questions, which way 
Apache Flink practices.


Thanks,
Nicholas







At 2019-12-14 01:28:47, "Vinoth Chandar" <vin...@apache.org> wrote:
>Thanks everyone for chiming in!   Any more thoughts, before I try and
>summarize?
>
>On Tue, Dec 10, 2019 at 9:33 PM Y Ethan Guo <ethan.guoyi...@gmail.com>
>wrote:
>
>> Here are my two cents in addition to the great suggestions in the thread:
>>
>> I agree with @Sivabalan that folks in Hudi community have different levels
>> of expertise and amount of effort to put in the community.  So in general,
>> it may be good to have PoCs or contributors for each area in Hudi, e.g.,
>> indexing, Timeline, performance, user facing APIs like deltastreamer, etc.
>> Then for issues, tickets and PRs, we may tag and label them accordingly so
>> that contributors can jump on them quickly based on the tags.  Committers
>> and PMC members with more expertise can help triage the tickets/PRs with
>> labels in one pass and trickle them down into sub-areas.  Then every
>> contributor can take up something easily without skimming through all
>> tickets/PRs each day.  Speaking from my experience, I'm able to help on
>> deltastreamer and docs related issues, but not yet comfortable addressing
>> areas like Presto, indexing and such.  I feel it's hard to keep track of
>> all tickets/PRs each day by a single person as the number of them are
>> growing.
>>
>> Specifically,
>>
>> for [1], we can have the support rotations by triaging the issues, tickets
>> and PRs with labels and tags, and looping in area PoCs and relevant
>> contributors if necessary.  Other contributors are still welcome to jump in
>> this process.
>>
>> for [2], similarly, contributors can simply tag the PRs if he or she is not
>> able to review the code in full.  The authors of PRs can also tag the PRs,
>> based on the areas, to make the review process easier.  Again, other
>> contributors are still welcome to review the code.
>>
>> for [3], as suggested by others, we can have 2-day ETA for the responses,
>> so people can expect a response within the ETA guarantee.  The first
>> response can just be an ack and triaging by looping in related area PoCs
>> and contributors.
>>
>> Best,
>> - Ethan
>>
>> On Tue, Dec 10, 2019 at 5:48 PM leesf <leesf0...@gmail.com> wrote:
>>
>> > Agree with @Sivabalan that any person who is interested in PRs would help
>> > to review, I think assign PR to someone just indicates that he/she is
>> > responsible for the progress as the lead person. Anyone is highly
>> > appreciated and encouraged to review PRs if they are interested.
>> >
>> > Best,
>> > Leesf
>> >
>> > Sivabalan <n.siv...@gmail.com> 于2019年12月11日周三 上午5:46写道:
>> >
>> > > IMO, open source community has folks with different needs and
>> interests.
>> > > For eg: “pro active” folks like you and Balaji who is very active in
>> all
>> > > aspects of the project. 2nd category is “selectively active” people,
>> who
>> > > will have good knowledge in few code blocks like Sudha in case of
>> presto
>> > > and so on. 3rd category is “active” folks who are interested in the
>> > > project, but they need to be assigned specific task items for them to
>> > > contribute. 4th is “newly active”, who are genuinely interested to
>> > > contribute, but got lost somewhere and don’t know where to get started
>> > and
>> > > will be dormant or slowly fade away. 5th is “user of the project” who
>> > might
>> > > be interested in consuming the project in most part and doesn’t have
>> real
>> > > intentions to contribute much. Last set of “passive” who just want to
>> > have
>> > > a tab at Hudi.
>> > >
>> > > I don’t think we can expect everyone to become an active contributor
>> with
>> > > our proposals. I mean, on a lighter note, the point I am trying to make
>> > is,
>> > > not sure if we can expect 20 folks to pitch in with our on-call
>> proposal.
>> > > Anyways, having said that, I really appreciate you thinking about
>> scaling
>> > > the community. It is really important to think these at this stage. We
>> > > should def employ something.
>> > >
>> > > Here are my thoughts.
>> > > 1: having 2 to 3 day on call would really help folks like me who is
>> > looking
>> > > to become an active contributor. Just that it may take some time to
>> > triage
>> > > an issue compared to someone like you or Balaji. But can’t do much, we
>> > got
>> > > to go through these to become more active.
>> > > On the contrary, I am not sure how many among the 15+ contributor you
>> > > quoted would be interested in fielding/answering ANY questions related
>> to
>> > > hudi. I feel some are more equipped to answer certain questions
>> relating
>> > to
>> > > their forte. To tackle these, I thought of an alternative suggestion.
>> We
>> > > can have leads for each area in general. Each of these leads should
>> have
>> > a
>> > > tab of people who has good knowledge in these areas and who is just
>> > getting
>> > > started. So, for new issues raised, the lead should feel free to assign
>> > to
>> > > specific person whom he/she thinks could answer. If the issue hasn't
>> been
>> > > responded within some SLA, then the lead could take charge and answer
>> > them.
>> > > I understand, that this is differing from the 2-3 day on-call strategy,
>> > but
>> > > just putting it out to hear others thoughts. Hope is that, slowly these
>> > > folks will become conversant w/ the respective area and might start to
>> > take
>> > > up issues on their own and not depend on lead to assign them.
>> > >
>> > > 2: Again following on the previous proposal, leads for each area should
>> > > assign reviewers for any PRs. Obviously, others can pitch in if
>> > interested,
>> > > but we need ways to nurture people to start contributing more. If the
>> > > assigned person is not interested or couldn’t review within some SLA,
>> > then
>> > > the respective leads or any other leads(if the resp lead is busy with
>> > other
>> > > stuffs) pitch in. Just that, we don't want Vinoth or Balaji to be
>> > reviewing
>> > > every PR out there.
>> > >
>> > > 3: Again, similar to 1. Not everyone can answer questions about
>> anything.
>> > > So each contributor should nurture a habit of answering and helping the
>> > > community in general with good faith. Don't have good suggestions here.
>> > >
>> > >
>> > > On Tue, Dec 10, 2019 at 9:13 AM vbal...@apache.org <vbal...@apache.org
>> >
>> > > wrote:
>> > >
>> > > >
>> > > > Regarding (1), I support the "on-call" model for answering to dev@
>> > > > emails, triaging GH and Jira. This would help reduce context-switch
>> for
>> > > the
>> > > > contributor community as a whole. Also, Trying to answer questions
>> > about
>> > > > Hudi is a good way to ramp up on the internals of it.  A 2 day
>> schedule
>> > > > would be good way to start and we can try this to see how this oncall
>> > > model
>> > > > works for us.
>> > > > Regarding (2), Code-reviews are critical to building our community.
>> > > Maybe,
>> > > > we can try a recognition model for most active code-reviewers every
>> > month
>> > > > would help here. We can send a recognition/appreciation  email
>> > > identifying
>> > > > new and most active code-reviewers.
>> > > > Regarding (3),  How about we assume that if a ticket owner is not
>> > > > responding for more than 1 or 2 weeks, then they are not working on
>> > this
>> > > > and we can re-assign if it is a critical feature that needs to go
>> into
>> > a
>> > > > release. The response from ticket owners need not be a complete
>> answer
>> > > but
>> > > > just enough communication that they are actively working on it. I
>> agree
>> > > > with @leesf that this depends on each person's situation but a clear
>> > > > communication regarding ETA expectations and sticking to it would
>> help
>> > in
>> > > > project planning.
>> > > > Thanks,Balaji.V
>> > > >
>> > > >
>> > > > Answering issues on this mailing list or GH issues or occasionally
>> > > > slack. We need a clear owner to triage the problem, reproduce it if
>> > > needed,
>> > > > either provide suggestions or file a JIRA - AND always look for ways
>> to
>> > > > update the FAQ. We need a clear hand off process also.
>> > > >     On Saturday, December 7, 2019, 12:01:28 PM PST, Vinoth Chandar <
>> > > > vin...@apache.org> wrote:
>> > > >
>> > > >  Hello all,
>> > > >
>> > > > As we grow, we need a scalable way for new users/contributors to
>> either
>> > > > easily use Hudi or ramp up on the project. Last month alone, we had
>> > close
>> > > > to 1600 notifications on commits@. and few hundred emails on this
>> > list.
>> > > In
>> > > > addition, to authoring RFCs and implementing JIRAs we need to share
>> the
>> > > > following responsibilities amongst us to be able to scale this
>> process.
>> > > >
>> > > > 1) Answering issues on this mailing list or GH issues or occasionally
>> > > > slack. We need a clear owner to triage the problem, reproduce it if
>> > > needed,
>> > > > either provide suggestions or file a JIRA - AND always look for ways
>> to
>> > > > update the FAQ. We need a clear hand off process also.
>> > > > 2) Code review process currently spreads the load amongst all the
>> > > > committers. But PRs vary dramatically in their complexity and we need
>> > > more
>> > > > committers who can review any part of the codebase.
>> > > > 3) Responding to pings/clarifications and unblocking . IMHO
>> committers
>> > > > should prioritize this higher than working on their own stuff (I
>> know I
>> > > > have been doing this at some cost to my productivity on the project).
>> > > This
>> > > > is the only way to scale and add new committers. committers need to
>> be
>> > > > nurturing in this process.
>> > > >
>> > > > I don't have a clear proposals for scaling 2 & 3, which fall heavily
>> on
>> > > > committers.. Love to hear suggestions.
>> > > >
>> > > > But for 1, I propose we have 2-3 day "Support Rotations" where any
>> > > > contributor can assume responsibility for support the community. This
>> > > > brings more focus to support and also fast tracks learning/ramping
>> for
>> > > the
>> > > > person on the rotation. It also minimizes interruptions for other
>> folks
>> > > and
>> > > > we gain more velocity. I am sure this is familiar to a lot of you at
>> > your
>> > > > own companies. We have at-least 10-15 active contributors at this
>> > point..
>> > > > So  the investment is minimal : doing this once a month.
>> > > >
>> > > >  A committer and a PMC member will always be designated
>> > secondary/backup
>> > > in
>> > > > case the primary cannot field a question. I am happy to additionally
>> > > > volunteer as "always on rotation" as a third level backup, to get
>> this
>> > > > process booted up.
>> > > >
>> > > > Please let me know what you all think. Please be specific in what
>> issue
>> > > > [1][2] or [3] you are talking about in your feedback
>> > > >
>> > > > thanks
>> > > > vinoth
>> > > >
>> > >
>> >
>>

Reply via email to