And one more thing is that, due to the limited resources(infra has not
migrated all the build nodes), I filtered out all the feature branches for
nightly and flaky jobs.

Will add them back after infra has done all the migration.

And I wonder whether we really need to test all the feature branches.

Thanks.

Nick Dimiduk <[email protected]> 于2020年8月12日周三 上午2:02写道:

> Thank you so much for taking on these migrations! I very much appreciate
> it!
>
> -n
>
> On Tue, Aug 11, 2020 at 8:08 AM 张铎(Duo Zhang) <[email protected]>
> wrote:
>
> > Some updates here, we have migrated most of the jobs to ci-hadoop.a.o.
> >
> > There is a known issue that our flaky dashboard is broken, due to this
> new
> > feature of jenkins
> >
> >
> https://wiki.jenkins.io/display/JENKINS/Configuring+Content+Security+Policy
> >
> > Josh is contacting the infra team to see if they can relax the policy
> but I
> > do not think it is easy as the policy is per site, not per job...
> >
> > Anyway, there is a chrome plugin to temporarily disable CSP, so you can
> > view the correct flaky dashboard.
> >
> >
> >
> https://chrome.google.com/webstore/detail/disable-content-security/ieelmcmcagommplceebfedjlakkhpden
> >
> > Thanks.
> >
> > Andor Molnar <[email protected]> 于2020年7月30日周四 下午3:12写道:
> >
> > > https://issues.apache.org/jira/browse/INFRA-20613
> > >
> > >
> > >
> > > > On 2020. Jul 30., at 1:47, 张铎(Duo Zhang) <[email protected]>
> > wrote:
> > > >
> > > > This never worked in the past...
> > > >
> > > > But it would be great if you can kick the infra team to get this done
> > :)
> > > >
> > > > File an infra issue?
> > > >
> > > > Andor Molnar <[email protected]>于2020年7月29日 周三18:36写道:
> > > >
> > > >> You’re having the same issue with HBase Robot btw. At the end of
> > console
> > > >> outputs:
> > > >>
> > > >> "Could not update commit status, please check if your scan
> credentials
> > > >> belong to a member of the organization or a collaborator of the
> > > repository
> > > >> and repo:status scope is selected”
> > > >>
> > > >> ...and shortly after that:
> > > >>
> > > >> "GitHub has been notified of this commit’s build result”
> > > >>
> > > >> Whatever does it mean.
> > > >>
> > > >> Andor
> > > >>
> > > >>
> > > >>
> > > >>> On 2020. Jul 29., at 11:57, Andor Molnar <[email protected]> wrote:
> > > >>>
> > > >>> Yep, we’ve finally received it. It’s done.
> > > >>>
> > > >>> Current issue is that Jenkins is unable to set Github build status.
> > > I’ve
> > > >> added repo:status permission, but it’s also asking to be member of
> the
> > > >> project/organization and not sure how to do that.
> > > >>>
> > > >>> Andor
> > > >>>
> > > >>>
> > > >>>
> > > >>>> On 2020. Jul 29., at 4:10, 张铎(Duo Zhang) <[email protected]>
> > > wrote:
> > > >>>>
> > > >>>> Seems you have already made it?
> > > >>>>
> > > >>>> Usually there are several moderators for the private list, you
> need
> > to
> > > >> ask
> > > >>>> them to let the GitHub registration go through.
> > > >>>>
> > > >>>> Andor Molnar <[email protected]> 于2020年7月29日周三 上午1:03写道:
> > > >>>>
> > > >>>>> Thanks Duo, that’s very helpful.
> > > >>>>> I cannot set private@zookeeper as a verified e-mail address,
> > because
> > > >> the
> > > >>>>> verification e-mail cannot be sent to the list. Isn’t that
> > restricted
> > > >> for
> > > >>>>> members only (by default)?
> > > >>>>>
> > > >>>>> Andor
> > > >>>>>
> > > >>>>>
> > > >>>>>
> > > >>>>>> On 2020. Jul 28., at 3:15, 张铎(Duo Zhang) <[email protected]
> >
> > > >> wrote:
> > > >>>>>>
> > > >>>>>> Hi Andor,
> > > >>>>>>
> > > >>>>>> The Apache-HBase account is registered by me, using the
> > > private@hbase
> > > >>>>>> mailing list, so all the PMC members can maintain the password.
> > > >>>>>>
> > > >>>>>> I generated an access token and added it to our jenkins, so we
> can
> > > >> use it
> > > >>>>>> to post comments back to GitHub.
> > > >>>>>>
> > > >>>>>> I think you could do the same to register an Apache-ZooKeeper
> > > >> account? Or
> > > >>>>>> if you want  to use the hadoop-yetus account, you'd better ask
> the
> > > >> hadoop
> > > >>>>>> PMC members or Gavin to add the token to jenkins so you can use
> > it.
> > > >>>>>>
> > > >>>>>> Thanks.
> > > >>>>>>
> > > >>>>>> Andor Molnar <[email protected]> 于2020年7月28日周二 上午3:56写道:
> > > >>>>>>
> > > >>>>>>> Hi Duo,
> > > >>>>>>>
> > > >>>>>>> I’m trying to create a similar job for Apache ZooKeeper, but
> > > >>>>> unfortunately
> > > >>>>>>> haven’t got too much help on the Apache builds@ list so far,
> so
> > > I’m
> > > >>>>>>> rather asking you if you don’t mind.
> > > >>>>>>>
> > > >>>>>>> First, how have you set up the Hbase Github account that you
> use
> > in
> > > >> this
> > > >>>>>>> job to access the repo?
> > > >>>>>>>
> > > >>>>>>> Thanks,
> > > >>>>>>> Andor
> > > >>>>>>>
> > > >>>>>>>
> > > >>>>>>>
> > > >>>>>>>> On 2020. Jul 27., at 2:22, 张铎(Duo Zhang) <
> [email protected]
> > >
> > > >>>>> wrote:
> > > >>>>>>>>
> > > >>>>>>>> The pre commit job has been migrated to c-hadoop.a.o.
> > > >>>>>>>>
> > > >>>>>>>> I have disabled periodical scan for the old job on builds.a.o,
> > as
> > > we
> > > >>>>>>> still
> > > >>>>>>>> need to view the pre commit result on it do not delete for
> now.
> > > Will
> > > >>>>>>> delete
> > > >>>>>>>> it later, maybe after several weeks.
> > > >>>>>>>>
> > > >>>>>>>> The new job is here
> > > >>>>>>>>
> > > >>>>>>>>
> > > >>
> https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/
> > > >>>>>>>>
> > > >>>>>>>> Thanks.
> > > >>>>>>>>
> > > >>>>>>>> 张铎(Duo Zhang) <[email protected]> 于2020年7月25日周六 下午9:44写道:
> > > >>>>>>>>
> > > >>>>>>>>>
> > > >>>>>>>>>
> > > >>>>>>>
> > > >>>>>
> > > >>
> > >
> >
> https://ci-hadoop.apache.org/job/HBase/job/HBase%20Nightly/job/branch-2/5/console
> > > >>>>>>>>>
> > > >>>>>>>>>
> > > >>>>>>>>> We successfully finished a nightly build.
> > > >>>>>>>>>
> > > >>>>>>>>> But seems the jiraComment did not work. I haven't seen the
> > > comment
> > > >>>>>>>>> on HBASE-24757...
> > > >>>>>>>>>
> > > >>>>>>>>> 张铎(Duo Zhang) <[email protected]> 于2020年7月25日周六
> 下午4:51写道:
> > > >>>>>>>>>
> > > >>>>>>>>>> After installing two new jenkins plugins, the pre commit job
> > > seems
> > > >>>>> fine
> > > >>>>>>>>>> now.
> > > >>>>>>>>>>
> > > >>>>>>>>>> The last failure is because of a timeout, I assume the
> problem
> > > is
> > > >>>>> that
> > > >>>>>>> we
> > > >>>>>>>>>> do not have enough executors so all the jobs are executed
> > > >>>>> sequentially.
> > > >>>>>>>>>>
> > > >>>>>>>>>> Maybe we could move the pre commit job to the new env first?
> > The
> > > >>>>>>> nightly
> > > >>>>>>>>>> job and flaky job require more resources, and we need the
> > output
> > > >> of
> > > >>>>>>> these
> > > >>>>>>>>>> jenkins jobs(the flaky test list).
> > > >>>>>>>>>>
> > > >>>>>>>>>> Thanks.
> > > >>>>>>>>>>
> > > >>>>>>>>>>
> > > >>>>>>>>>>
> > > >>>>>>>>>> 张铎(Duo Zhang) <[email protected]> 于2020年7月24日周五
> 下午4:36写道:
> > > >>>>>>>>>>
> > > >>>>>>>>>>> The problem seems because of this:
> > > >>>>>>>>>>>
> > > >>>>>>>>>>> https://issues.jenkins-ci.org/browse/JENKINS-48556
> > > >>>>>>>>>>>
> > > >>>>>>>>>>> I triggered the job again, it passed the timestamps call,
> and
> > > >> will
> > > >>>>>>> keep
> > > >>>>>>>>>>> an eye on it.
> > > >>>>>>>>>>>
> > > >>>>>>>>>>> 张铎(Duo Zhang) <[email protected]> 于2020年7月21日周二
> > 上午11:18写道:
> > > >>>>>>>>>>>
> > > >>>>>>>>>>>> On the sponsors, we could have a try.
> > > >>>>>>>>>>>>
> > > >>>>>>>>>>>> The problem here is the process of the donation? IIRC
> there
> > > is a
> > > >>>>>>> thread
> > > >>>>>>>>>>>> on the infra mailing list about how to donate machines to
> a
> > > >>>>> specific
> > > >>>>>>>>>>>> project and the discussion did not go well...
> > > >>>>>>>>>>>>
> > > >>>>>>>>>>>> Sean Busbey <[email protected]> 于2020年7月21日周二 上午11:13写道:
> > > >>>>>>>>>>>>
> > > >>>>>>>>>>>>> We could check with ASF infra for the current state of
> > things
> > > >> wrt
> > > >>>>>>>>>>>>> GitHub
> > > >>>>>>>>>>>>> actions. I believe there is a queue set up across ASF
> > > projects.
> > > >>>>>>>>>>>>>
> > > >>>>>>>>>>>>> It has the same resource issue Travis had; things are
> fine
> > > >> until
> > > >>>>>>> some
> > > >>>>>>>>>>>>> critical mass of projects seeking better perf realize
> some
> > > new
> > > >>>>>>> option
> > > >>>>>>>>>>>>> is
> > > >>>>>>>>>>>>> available and then quickly all available resources are
> > > >> consumed.
> > > >>>>>>>>>>>>>
> > > >>>>>>>>>>>>> AFAICT the only option that gets us the same or better as
> > the
> > > >> H*
> > > >>>>>>> nodes
> > > >>>>>>>>>>>>> will
> > > >>>>>>>>>>>>> be finding sponsors and running our own.
> > > >>>>>>>>>>>>>
> > > >>>>>>>>>>>>> On Mon, Jul 20, 2020, 21:55 张铎(Duo Zhang) <
> > > >> [email protected]>
> > > >>>>>>>>>>>>> wrote:
> > > >>>>>>>>>>>>>
> > > >>>>>>>>>>>>>> I think our nightly, flakey, and pre commit jobs should
> be
> > > >>>>>>>>>>>>> transferred as a
> > > >>>>>>>>>>>>>> whole? They depend on each other.
> > > >>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>> I offer my help on the transition.
> > > >>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>> And on github CI, does ASF have a special deal with
> > github?
> > > If
> > > >>>>> not,
> > > >>>>>>>>>>>>> I do
> > > >>>>>>>>>>>>>> not think the default resource can fit our
> requirements...
> > > >>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>> Sean Busbey <[email protected]> 于2020年7月21日周二 上午1:49写道:
> > > >>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> Hi folks!
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> Back in April there was a brief discussion[1] about ASF
> > > >> Infra's
> > > >>>>>>>>>>>>>>> notification that builds.a.o is going away and we are
> > > >> currently
> > > >>>>>>>>>>>>> slated
> > > >>>>>>>>>>>>>>> to migrate to a set of CI servers for "Hadoop and
> related
> > > >>>>>>>>>>>>> projects".
> > > >>>>>>>>>>>>>>> This is the ci farm that will contain the bulk of the
> H*
> > > >> worker
> > > >>>>>>>>>>>>> nodes
> > > >>>>>>>>>>>>>>> that are donated by Yahoo!, which are the nodes we've
> > been
> > > >>>>> running
> > > >>>>>>>>>>>>> on
> > > >>>>>>>>>>>>>>> for ages[2].
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> Migration discussion still happens on the
> > > >>>>> [email protected]
> > > >>>>>>>>>>>>>>> list[3] and recently ASF Infra set a target date of
> > August
> > > >> 15th
> > > >>>>>>> for
> > > >>>>>>>>>>>>>>> turning off the existing builds.a.o server[4].
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> That gives us a little under 4 weeks to have things up
> > and
> > > >>>>> working
> > > >>>>>>>>>>>>> on
> > > >>>>>>>>>>>>>>> the new ci-hadoop.a.o jenkins coordinator[5]. it’s not
> > > clear
> > > >> to
> > > >>>>> me
> > > >>>>>>>>>>>>>>> that the level of effort we’ll need to spend is worth
> > what
> > > we
> > > >>>>> get
> > > >>>>>>>>>>>>> out
> > > >>>>>>>>>>>>>>> of a continuation of the status quo on builds.a.o. I
> did
> > a
> > > >> quick
> > > >>>>>>>>>>>>> test
> > > >>>>>>>>>>>>>>> by updating the nightly job on ci-hadoop.a.o to run
> just
> > > >>>>> branch-2,
> > > >>>>>>>>>>>>>>> since that has been stable on builds.a.o. It failed
> with
> > a
> > > >>>>> Jenkins
> > > >>>>>>>>>>>>>>> pipeline DSL syntax error[6] so I'm assuming migrating
> > will
> > > >> be a
> > > >>>>>>>>>>>>> slog.
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> As far as I can see our options are:
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> * Do nothing. Have no testing or automated website
> > > >> publication
> > > >>>>> in
> > > >>>>>>>>>>>>> mid
> > > >>>>>>>>>>>>>>> August.
> > > >>>>>>>>>>>>>>> * Transition website publication and nothing else
> > (probably
> > > >> can
> > > >>>>> be
> > > >>>>>>>>>>>>>>> done in a day)
> > > >>>>>>>>>>>>>>> * Transition just precommit testing for various repos
> > > >> (probably
> > > >>>>>>>>>>>>> can be
> > > >>>>>>>>>>>>>>> done in a few days)
> > > >>>>>>>>>>>>>>> * Transition everything (no idea how long it takes due
> to
> > > >>>>> nightly,
> > > >>>>>>>>>>>>>>> flaky stuff, etc)
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> The alternatives if we do not transition any given job
> to
> > > >>>>>>>>>>>>> ci-hadoop:
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> * Try to move to GitHub Actions
> > > >>>>>>>>>>>>>>> * Try to move to Travis CI
> > > >>>>>>>>>>>>>>> * Try to move to Jenkins infra we maintain ourselves
> > > >> (presumably
> > > >>>>>>> by
> > > >>>>>>>>>>>>>>> soliciting project specific donations for worker nodes
> on
> > > >> cloud
> > > >>>>>>>>>>>>>>> vendors)
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> It's important to remember that as a project we have a
> > > heavy
> > > >>>>>>>>>>>>> footprint
> > > >>>>>>>>>>>>>>> wherever our nightly tests run. For context, a given
> > > branch's
> > > >>>>>>>>>>>>> nightly
> > > >>>>>>>>>>>>>>> can keep 3-4 executors busy for 6+ hours on the current
> > > >>>>> builds.a.o
> > > >>>>>>>>>>>>>>> setup. There's been a bunch of great work lately on
> > > bringing
> > > >>>>> down
> > > >>>>>>>>>>>>> what
> > > >>>>>>>>>>>>>>> it takes to run the full test suite, but applying that
> > work
> > > >> to
> > > >>>>>>>>>>>>> nightly
> > > >>>>>>>>>>>>>>> is itself a significant undertaking.
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> What are folks thinking? Most importantly who is ready
> to
> > > >> work
> > > >>>>>>>>>>>>> towards
> > > >>>>>>>>>>>>>>> any given approach?
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> [1] [DISCUSS] Migrating HBase to new CI Master
> > > >>>>>>>>>>>>>>> https://s.apache.org/fux1o
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> [2] https://builds.apache.org/view/H-L/view/HBase/
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> [3]
> > > >>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>
> > > >>>>>>>
> > > >>
> https://lists.apache.org/[email protected]
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> [4] [IMPORTANT] - 2 more HADOOP nodes migrated over to
> > > >> ci-hadoop
> > > >>>>>>>>>>>>>>> https://s.apache.org/7e1nq
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> [5] https://ci-hadoop.apache.org/job/HBase/
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>> [6]
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>
> > > >>>>>>>
> > > >>>>>
> > > >>
> > >
> >
> https://ci-hadoop.apache.org/job/HBase/job/HBase%20Nightly/job/branch-2/2/console
> > > >>>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>>
> > > >>>>>>>>>>>>>
> > > >>>>>>>>>>>>
> > > >>>>>>>
> > > >>>>>>>
> > > >>>>>
> > > >>>>>
> > > >>>
> > > >>
> > > >>
> > >
> > >
> >
>

Reply via email to