I'm really enjoying this feature so far! The "Pull Requests" page for Beam
is now way more readable. Thanks Alex :)

On Wed, Feb 12, 2020 at 9:18 PM Alex Van Boxel <[email protected]> wrote:

> What do you exactly mean with github grep... where is it an issue. I find
> it useful for searching here:
>
> [image: Screen Shot 2020-02-13 at 06.11.33.png]
>
> OK, you get some false positives, but then the color coding works. You
> can't search on a category so this looks like the only alternative. I was
> even thinking of adding more text in the description as it could help new
> contributors to identify something they could help with.
>
> It's also nice when you hover over the label.
>
> So, could you exactly pinpoint where you see a problem?
>
>  _/
> _/ Alex Van Boxel
>
>
> On Wed, Feb 12, 2020 at 10:22 PM Ismaël Mejía <[email protected]> wrote:
>
>> Alex would you consider removing the descriptions from the labels? It
>> seems that
>> github greps not only the text of the label but also the text of the
>> description
>> producing false positives, e.g. if I search the label `io` it resolves
>> not only
>> all the IOs but also results like `core` because it matches the
>> description
>> `core-constructIOn-java` and also `extensIOns` making the point of having
>> general categories not really useful.
>>
>> On Wed, Feb 12, 2020 at 3:01 PM Ismaël Mejía <[email protected]> wrote:
>>
>>> The prefix is just extra characters makes readibility worse, notice
>>> that the full category (e.g. ios/runners/etc) will match because we have
>>> an
>>> extra tag equivalent to the prefix, so filtering is still possible. you
>>> rarely
>>> need to filter for more than one criteria, that's why github does not
>>> allow it
>>> (and the reason to have the extra per category labels).
>>>
>>> The only issue i can see is a possible name collision in the future, but
>>> until that
>>> happens i think this is a reasonable tradeoff.
>>>
>>> Excellent idea to unifiy the colors for the categories +1 !
>>>
>>> On Wed, Feb 12, 2020 at 2:33 PM Alex Van Boxel <[email protected]> wrote:
>>>
>>>> Ismael, I saw that you removed the prefix. I still want to have some
>>>> grouping between the subtypes, so I color coded them.
>>>>
>>>> I also added all the labels in the file. We now have 62 labels.
>>>>
>>>>  _/
>>>> _/ Alex Van Boxel
>>>>
>>>>
>>>> On Wed, Feb 12, 2020 at 12:03 PM Ismaël Mejía <[email protected]>
>>>> wrote:
>>>>
>>>>> Forgot to mention, older PRs will look not classified, up to you guys
>>>>> if you
>>>>> want to do manually. All new PRs will be automatically labeled.
>>>>>
>>>>> On Wed, Feb 12, 2020 at 12:02 PM Ismaël Mejía <[email protected]>
>>>>> wrote:
>>>>>
>>>>>> For info Alex's PR to suport autolabeler was merged today and INFRA
>>>>>> enabled the plugin.
>>>>>> I created an artificial PR to check it was autolabeled correctly.
>>>>>> It is working perfectly now.
>>>>>> Thanks Alex !
>>>>>>
>>>>>> On Tue, Feb 11, 2020 at 5:23 PM Robert Bradshaw <[email protected]>
>>>>>> wrote:
>>>>>>
>>>>>>> +1 to finding the right balance.
>>>>>>>
>>>>>>> I do think per-runner makes sense, rather than a general "runners."
>>>>>>> IOs might make sense as well. Not sure about all the extensions-* I'd
>>>>>>> leave those out for now.
>>>>>>>
>>>>>>> On Tue, Feb 11, 2020 at 5:56 AM Ismaël Mejía <[email protected]>
>>>>>>> wrote:
>>>>>>> >
>>>>>>> > > So I propose going simple with a limited set of labels. Later on
>>>>>>> we can refine. Don't forget that does labels only are useful during the
>>>>>>> life-cycle of a PR.
>>>>>>> >
>>>>>>> > Labels are handy for quick filtering and finding PRs we care about
>>>>>>> for example
>>>>>>> > to review.
>>>>>>> >
>>>>>>> > I agree with the feeling that we should not go to the extremes,
>>>>>>> but what is
>>>>>>> > requested in the PR rarely would produce more than 5 labels per
>>>>>>> PR.  For example
>>>>>>> > if a PR changes KafkaIO and something in the CI it will produce
>>>>>>> "java io kafka
>>>>>>> > infra", a pure change on Flink runer will produce "runners flink"
>>>>>>> >
>>>>>>> > 100% d'accord with not to have many labels and keep them short,
>>>>>>> but the current
>>>>>>> > classification lacks detail, e.g. few people care about some
>>>>>>> general categories
>>>>>>> > "runners" or "io", but maintainers may care about their specific
>>>>>>> categories like
>>>>>>> > "spark" or "kafka" so I don't think that this extra level of
>>>>>>> detail is
>>>>>>> > inappropriate and in the end it will only add one extra label per
>>>>>>> matching path.
>>>>>>> >
>>>>>>> > Let's give it a try if it is too excesive we can took the opposite
>>>>>>> path and reduce it.
>>>>>>> >
>>>>>>> > Ismaël
>>>>>>> >
>>>>>>> >
>>>>>>> > On Tue, Feb 11, 2020 at 1:04 PM Alex Van Boxel <[email protected]>
>>>>>>> wrote:
>>>>>>> >>
>>>>>>> >> I'm wondering if we're not taking it too far with those detailed
>>>>>>> labels. It's like going from nothing to super details. The simples 
>>>>>>> use-case
>>>>>>> hasn't proven itself in practice yet.
>>>>>>> >>
>>>>>>> >> So I propose going simple with a limited set of labels. Later on
>>>>>>> we can refine. Don't forget that does labels only are useful during the
>>>>>>> life-cycle of a PR.
>>>>>>> >>
>>>>>>> >>  _/
>>>>>>> >> _/ Alex Van Boxel
>>>>>>> >>
>>>>>>> >>
>>>>>>> >> On Tue, Feb 11, 2020 at 9:46 AM Ismaël Mejía <[email protected]>
>>>>>>> wrote:
>>>>>>> >>>
>>>>>>> >>> Let some comments too, let's keep the discussion on refinements
>>>>>>> in the PR.
>>>>>>> >>>
>>>>>>> >>> On Tue, Feb 11, 2020 at 9:13 AM jincheng sun <
>>>>>>> [email protected]> wrote:
>>>>>>> >>>>
>>>>>>> >>>> I left comments on PR, the main suggestion is that we may need
>>>>>>> a discussion about what kind of labels should be add. I would like to 
>>>>>>> share
>>>>>>> my thoughts as follows:
>>>>>>> >>>>
>>>>>>> >>>> I think we need to add labels according to some rules. For
>>>>>>> example, the easiest way is to add labels by languages, java / python / 
>>>>>>> go
>>>>>>> etc. But this kind of help is very limited, so we need to subdivide some
>>>>>>> labels, such as by components. Currently we have more than 70 
>>>>>>> components,
>>>>>>> each component is configured with labels, and it seems cumbersome. So we
>>>>>>> should have some rules for dividing labels, which can play the role of
>>>>>>> labels without being too cumbersome. Such as:
>>>>>>> >>>>
>>>>>>> >>>> We can add `extensions` or `extensions-ideas and
>>>>>>> extensions-java` for the following components:
>>>>>>> >>>>
>>>>>>> >>>> - extensions-ideas
>>>>>>> >>>> - extensions-java-join-library
>>>>>>> >>>> - extensions-java-json
>>>>>>> >>>> - extensions-java-protobuf
>>>>>>> >>>> - extensions-java-sketching
>>>>>>> >>>> - extensions-java-sorter
>>>>>>> >>>>
>>>>>>> >>>> And it's better to add a label for each Runner as follows:
>>>>>>> >>>>
>>>>>>> >>>> - runner-apex
>>>>>>> >>>> - runner-core
>>>>>>> >>>> - runner-dataflow
>>>>>>> >>>> - runner-direct
>>>>>>> >>>> - runner-flink
>>>>>>> >>>> - runner-jstorm
>>>>>>> >>>> - runner-...
>>>>>>> >>>>
>>>>>>> >>>> So, I think would be great to collect feedbacks from the
>>>>>>> community on the set of labels needed.
>>>>>>> >>>>
>>>>>>> >>>> What do you think?
>>>>>>> >>>>
>>>>>>> >>>> Best,
>>>>>>> >>>> Jincheng
>>>>>>> >>>>
>>>>>>> >>>> Alex Van Boxel <[email protected]> 于2020年2月11日周二 下午3:11写道:
>>>>>>> >>>>>
>>>>>>> >>>>> I've opened a PR and a ticket with INFRA.
>>>>>>> >>>>>
>>>>>>> >>>>> PR: https://github.com/apache/beam/pull/10824
>>>>>>> >>>>>
>>>>>>> >>>>>  _/
>>>>>>> >>>>> _/ Alex Van Boxel
>>>>>>> >>>>>
>>>>>>> >>>>>
>>>>>>> >>>>> On Tue, Feb 11, 2020 at 6:57 AM jincheng sun <
>>>>>>> [email protected]> wrote:
>>>>>>> >>>>>>
>>>>>>> >>>>>> +1. Autolabeler seems really cool and it seems that it's
>>>>>>> simple to configure and set up.
>>>>>>> >>>>>>
>>>>>>> >>>>>> Best,
>>>>>>> >>>>>> Jincheng
>>>>>>> >>>>>>
>>>>>>> >>>>>>
>>>>>>> >>>>>>
>>>>>>> >>>>>> Udi Meiri <[email protected]> 于2020年2月11日周二 上午2:01写道:
>>>>>>> >>>>>>>
>>>>>>> >>>>>>> Cool!
>>>>>>> >>>>>>>
>>>>>>> >>>>>>> On Mon, Feb 10, 2020 at 9:27 AM Robert Burke <
>>>>>>> [email protected]> wrote:
>>>>>>> >>>>>>>>
>>>>>>> >>>>>>>> +1 to autolabeling
>>>>>>> >>>>>>>>
>>>>>>> >>>>>>>> On Mon, Feb 10, 2020, 9:21 AM Luke Cwik <[email protected]>
>>>>>>> wrote:
>>>>>>> >>>>>>>>>
>>>>>>> >>>>>>>>> Nice
>>>>>>> >>>>>>>>>
>>>>>>> >>>>>>>>> On Mon, Feb 10, 2020 at 2:52 AM Alex Van Boxel <
>>>>>>> [email protected]> wrote:
>>>>>>> >>>>>>>>>>
>>>>>>> >>>>>>>>>> Ha, cool. I'll have a look at the autolabeler. The infra
>>>>>>> stuff is not something I've looked at... I'll dive into that.
>>>>>>> >>>>>>>>>>
>>>>>>> >>>>>>>>>>  _/
>>>>>>> >>>>>>>>>> _/ Alex Van Boxel
>>>>>>> >>>>>>>>>>
>>>>>>> >>>>>>>>>>
>>>>>>> >>>>>>>>>> On Mon, Feb 10, 2020 at 11:49 AM Ismaël Mejía <
>>>>>>> [email protected]> wrote:
>>>>>>> >>>>>>>>>>>
>>>>>>> >>>>>>>>>>> +1
>>>>>>> >>>>>>>>>>>
>>>>>>> >>>>>>>>>>> You don't need to write your own action, there is
>>>>>>> already one autolabeler action [1].
>>>>>>> >>>>>>>>>>> INFRA can easily configure it for Beam (as they did for
>>>>>>> Avro [2]) if we request it.
>>>>>>> >>>>>>>>>>> The plugin is quite easy to configure and works like a
>>>>>>> charm [3].
>>>>>>> >>>>>>>>>>>
>>>>>>> >>>>>>>>>>> [1] https://github.com/probot/autolabeler
>>>>>>> >>>>>>>>>>> [1] https://issues.apache.org/jira/browse/INFRA-17367
>>>>>>> >>>>>>>>>>> [2]
>>>>>>> https://github.com/apache/avro/blob/master/.github/autolabeler.yml
>>>>>>> >>>>>>>>>>>
>>>>>>> >>>>>>>>>>>
>>>>>>> >>>>>>>>>>> On Mon, Feb 10, 2020 at 11:20 AM Alexey Romanenko <
>>>>>>> [email protected]> wrote:
>>>>>>> >>>>>>>>>>>>
>>>>>>> >>>>>>>>>>>> Great initiative, thanks Alex! I was thinking to add
>>>>>>> such labels into PR title but I believe that GitHub labels are better 
>>>>>>> since
>>>>>>> it can be used easily for filtering, for example.
>>>>>>> >>>>>>>>>>>>
>>>>>>> >>>>>>>>>>>> Maybe it could be useful to add more granulation for
>>>>>>> labels, like “release”, “runners”, “website”, etc but I’m afraid to make
>>>>>>> the titles too heavy because of this.
>>>>>>> >>>>>>>>>>>>
>>>>>>> >>>>>>>>>>>> > On 10 Feb 2020, at 08:35, Alex Van Boxel <
>>>>>>> [email protected]> wrote:
>>>>>>> >>>>>>>>>>>> >
>>>>>>> >>>>>>>>>>>> > I've started putting labels on PR's. I've done the
>>>>>>> first page for now (as I'm afraid putting them on older once could 
>>>>>>> affect
>>>>>>> the stale bot. I hope this is ok.
>>>>>>> >>>>>>>>>>>> >
>>>>>>> >>>>>>>>>>>> > For now I'm only focussing on language and I'm going
>>>>>>> to see if I can write a GitLab action for it. I hope this is useful. 
>>>>>>> Other
>>>>>>> kind of suggestions for labels, that can be automated, are welcome.
>>>>>>> >>>>>>>>>>>> >
>>>>>>> >>>>>>>>>>>> > <Screen Shot 2020-02-10 at 08.31.09.png>
>>>>>>> >>>>>>>>>>>> >  _/
>>>>>>> >>>>>>>>>>>> > _/ Alex Van Boxel
>>>>>>> >>>>>>>>>>>>
>>>>>>>
>>>>>>

Reply via email to