this is an excellent start to things. Thanks for doing this work Duo! On Sun, Jul 26, 2020 at 7:23 PM 张铎(Duo Zhang) <[email protected]> wrote: > > The pre commit job has been migrated to c-hadoop.a.o. > > I have disabled periodical scan for the old job on builds.a.o, as we still > need to view the pre commit result on it do not delete for now. Will delete > it later, maybe after several weeks. > > The new job is here > > https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/ > > Thanks. > > 张铎(Duo Zhang) <[email protected]> 于2020年7月25日周六 下午9:44写道: > > > > > https://ci-hadoop.apache.org/job/HBase/job/HBase%20Nightly/job/branch-2/5/console > > > > > > We successfully finished a nightly build. > > > > But seems the jiraComment did not work. I haven't seen the comment > > on HBASE-24757... > > > > 张铎(Duo Zhang) <[email protected]> 于2020年7月25日周六 下午4:51写道: > > > >> After installing two new jenkins plugins, the pre commit job seems fine > >> now. > >> > >> The last failure is because of a timeout, I assume the problem is that we > >> do not have enough executors so all the jobs are executed sequentially. > >> > >> Maybe we could move the pre commit job to the new env first? The nightly > >> job and flaky job require more resources, and we need the output of these > >> jenkins jobs(the flaky test list). > >> > >> Thanks. > >> > >> > >> > >> 张铎(Duo Zhang) <[email protected]> 于2020年7月24日周五 下午4:36写道: > >> > >>> The problem seems because of this: > >>> > >>> https://issues.jenkins-ci.org/browse/JENKINS-48556 > >>> > >>> I triggered the job again, it passed the timestamps call, and will keep > >>> an eye on it. > >>> > >>> 张铎(Duo Zhang) <[email protected]> 于2020年7月21日周二 上午11:18写道: > >>> > >>>> On the sponsors, we could have a try. > >>>> > >>>> The problem here is the process of the donation? IIRC there is a thread > >>>> on the infra mailing list about how to donate machines to a specific > >>>> project and the discussion did not go well... > >>>> > >>>> Sean Busbey <[email protected]> 于2020年7月21日周二 上午11:13写道: > >>>> > >>>>> We could check with ASF infra for the current state of things wrt > >>>>> GitHub > >>>>> actions. I believe there is a queue set up across ASF projects. > >>>>> > >>>>> It has the same resource issue Travis had; things are fine until some > >>>>> critical mass of projects seeking better perf realize some new option > >>>>> is > >>>>> available and then quickly all available resources are consumed. > >>>>> > >>>>> AFAICT the only option that gets us the same or better as the H* nodes > >>>>> will > >>>>> be finding sponsors and running our own. > >>>>> > >>>>> On Mon, Jul 20, 2020, 21:55 张铎(Duo Zhang) <[email protected]> > >>>>> wrote: > >>>>> > >>>>> > I think our nightly, flakey, and pre commit jobs should be > >>>>> transferred as a > >>>>> > whole? They depend on each other. > >>>>> > > >>>>> > I offer my help on the transition. > >>>>> > > >>>>> > And on github CI, does ASF have a special deal with github? If not, > >>>>> I do > >>>>> > not think the default resource can fit our requirements... > >>>>> > > >>>>> > > >>>>> > > >>>>> > Sean Busbey <[email protected]> 于2020年7月21日周二 上午1:49写道: > >>>>> > > >>>>> > > Hi folks! > >>>>> > > > >>>>> > > Back in April there was a brief discussion[1] about ASF Infra's > >>>>> > > notification that builds.a.o is going away and we are currently > >>>>> slated > >>>>> > > to migrate to a set of CI servers for "Hadoop and related > >>>>> projects". > >>>>> > > This is the ci farm that will contain the bulk of the H* worker > >>>>> nodes > >>>>> > > that are donated by Yahoo!, which are the nodes we've been running > >>>>> on > >>>>> > > for ages[2]. > >>>>> > > > >>>>> > > Migration discussion still happens on the [email protected] > >>>>> > > list[3] and recently ASF Infra set a target date of August 15th for > >>>>> > > turning off the existing builds.a.o server[4]. > >>>>> > > > >>>>> > > That gives us a little under 4 weeks to have things up and working > >>>>> on > >>>>> > > the new ci-hadoop.a.o jenkins coordinator[5]. it’s not clear to me > >>>>> > > that the level of effort we’ll need to spend is worth what we get > >>>>> out > >>>>> > > of a continuation of the status quo on builds.a.o. I did a quick > >>>>> test > >>>>> > > by updating the nightly job on ci-hadoop.a.o to run just branch-2, > >>>>> > > since that has been stable on builds.a.o. It failed with a Jenkins > >>>>> > > pipeline DSL syntax error[6] so I'm assuming migrating will be a > >>>>> slog. > >>>>> > > > >>>>> > > As far as I can see our options are: > >>>>> > > > >>>>> > > * Do nothing. Have no testing or automated website publication in > >>>>> mid > >>>>> > > August. > >>>>> > > * Transition website publication and nothing else (probably can be > >>>>> > > done in a day) > >>>>> > > * Transition just precommit testing for various repos (probably > >>>>> can be > >>>>> > > done in a few days) > >>>>> > > * Transition everything (no idea how long it takes due to nightly, > >>>>> > > flaky stuff, etc) > >>>>> > > > >>>>> > > The alternatives if we do not transition any given job to > >>>>> ci-hadoop: > >>>>> > > > >>>>> > > * Try to move to GitHub Actions > >>>>> > > * Try to move to Travis CI > >>>>> > > * Try to move to Jenkins infra we maintain ourselves (presumably by > >>>>> > > soliciting project specific donations for worker nodes on cloud > >>>>> > > vendors) > >>>>> > > > >>>>> > > It's important to remember that as a project we have a heavy > >>>>> footprint > >>>>> > > wherever our nightly tests run. For context, a given branch's > >>>>> nightly > >>>>> > > can keep 3-4 executors busy for 6+ hours on the current builds.a.o > >>>>> > > setup. There's been a bunch of great work lately on bringing down > >>>>> what > >>>>> > > it takes to run the full test suite, but applying that work to > >>>>> nightly > >>>>> > > is itself a significant undertaking. > >>>>> > > > >>>>> > > What are folks thinking? Most importantly who is ready to work > >>>>> towards > >>>>> > > any given approach? > >>>>> > > > >>>>> > > [1] [DISCUSS] Migrating HBase to new CI Master > >>>>> > > https://s.apache.org/fux1o > >>>>> > > > >>>>> > > [2] https://builds.apache.org/view/H-L/view/HBase/ > >>>>> > > > >>>>> > > [3] > >>>>> > > >>>>> https://lists.apache.org/[email protected] > >>>>> > > > >>>>> > > [4] [IMPORTANT] - 2 more HADOOP nodes migrated over to ci-hadoop > >>>>> > > https://s.apache.org/7e1nq > >>>>> > > > >>>>> > > [5] https://ci-hadoop.apache.org/job/HBase/ > >>>>> > > > >>>>> > > [6] > >>>>> > > > >>>>> > > >>>>> https://ci-hadoop.apache.org/job/HBase/job/HBase%20Nightly/job/branch-2/2/console > >>>>> > > > >>>>> > > >>>>> > >>>>
-- Sean
