Hi Duo, Infra finished setting up an official GitHub user for accessing and updating Pull Requests. Look for 'ASF Cloudbees Jenkins ci-hadoop’ credential
Regards, Andor > On 2020. Aug 12., at 3:49, 张铎(Duo Zhang) <[email protected]> wrote: > > And one more thing is that, due to the limited resources(infra has not > migrated all the build nodes), I filtered out all the feature branches for > nightly and flaky jobs. > > Will add them back after infra has done all the migration. > > And I wonder whether we really need to test all the feature branches. > > Thanks. > > Nick Dimiduk <[email protected]> 于2020年8月12日周三 上午2:02写道: > >> Thank you so much for taking on these migrations! I very much appreciate >> it! >> >> -n >> >> On Tue, Aug 11, 2020 at 8:08 AM 张铎(Duo Zhang) <[email protected]> >> wrote: >> >>> Some updates here, we have migrated most of the jobs to ci-hadoop.a.o. >>> >>> There is a known issue that our flaky dashboard is broken, due to this >> new >>> feature of jenkins >>> >>> >> https://wiki.jenkins.io/display/JENKINS/Configuring+Content+Security+Policy >>> >>> Josh is contacting the infra team to see if they can relax the policy >> but I >>> do not think it is easy as the policy is per site, not per job... >>> >>> Anyway, there is a chrome plugin to temporarily disable CSP, so you can >>> view the correct flaky dashboard. >>> >>> >>> >> https://chrome.google.com/webstore/detail/disable-content-security/ieelmcmcagommplceebfedjlakkhpden >>> >>> Thanks. >>> >>> Andor Molnar <[email protected]> 于2020年7月30日周四 下午3:12写道: >>> >>>> https://issues.apache.org/jira/browse/INFRA-20613 >>>> >>>> >>>> >>>>> On 2020. Jul 30., at 1:47, 张铎(Duo Zhang) <[email protected]> >>> wrote: >>>>> >>>>> This never worked in the past... >>>>> >>>>> But it would be great if you can kick the infra team to get this done >>> :) >>>>> >>>>> File an infra issue? >>>>> >>>>> Andor Molnar <[email protected]>于2020年7月29日 周三18:36写道: >>>>> >>>>>> You’re having the same issue with HBase Robot btw. At the end of >>> console >>>>>> outputs: >>>>>> >>>>>> "Could not update commit status, please check if your scan >> credentials >>>>>> belong to a member of the organization or a collaborator of the >>>> repository >>>>>> and repo:status scope is selected” >>>>>> >>>>>> ...and shortly after that: >>>>>> >>>>>> "GitHub has been notified of this commit’s build result” >>>>>> >>>>>> Whatever does it mean. >>>>>> >>>>>> Andor >>>>>> >>>>>> >>>>>> >>>>>>> On 2020. Jul 29., at 11:57, Andor Molnar <[email protected]> wrote: >>>>>>> >>>>>>> Yep, we’ve finally received it. It’s done. >>>>>>> >>>>>>> Current issue is that Jenkins is unable to set Github build status. >>>> I’ve >>>>>> added repo:status permission, but it’s also asking to be member of >> the >>>>>> project/organization and not sure how to do that. >>>>>>> >>>>>>> Andor >>>>>>> >>>>>>> >>>>>>> >>>>>>>> On 2020. Jul 29., at 4:10, 张铎(Duo Zhang) <[email protected]> >>>> wrote: >>>>>>>> >>>>>>>> Seems you have already made it? >>>>>>>> >>>>>>>> Usually there are several moderators for the private list, you >> need >>> to >>>>>> ask >>>>>>>> them to let the GitHub registration go through. >>>>>>>> >>>>>>>> Andor Molnar <[email protected]> 于2020年7月29日周三 上午1:03写道: >>>>>>>> >>>>>>>>> Thanks Duo, that’s very helpful. >>>>>>>>> I cannot set private@zookeeper as a verified e-mail address, >>> because >>>>>> the >>>>>>>>> verification e-mail cannot be sent to the list. Isn’t that >>> restricted >>>>>> for >>>>>>>>> members only (by default)? >>>>>>>>> >>>>>>>>> Andor >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>>> On 2020. Jul 28., at 3:15, 张铎(Duo Zhang) <[email protected] >>> >>>>>> wrote: >>>>>>>>>> >>>>>>>>>> Hi Andor, >>>>>>>>>> >>>>>>>>>> The Apache-HBase account is registered by me, using the >>>> private@hbase >>>>>>>>>> mailing list, so all the PMC members can maintain the password. >>>>>>>>>> >>>>>>>>>> I generated an access token and added it to our jenkins, so we >> can >>>>>> use it >>>>>>>>>> to post comments back to GitHub. >>>>>>>>>> >>>>>>>>>> I think you could do the same to register an Apache-ZooKeeper >>>>>> account? Or >>>>>>>>>> if you want to use the hadoop-yetus account, you'd better ask >> the >>>>>> hadoop >>>>>>>>>> PMC members or Gavin to add the token to jenkins so you can use >>> it. >>>>>>>>>> >>>>>>>>>> Thanks. >>>>>>>>>> >>>>>>>>>> Andor Molnar <[email protected]> 于2020年7月28日周二 上午3:56写道: >>>>>>>>>> >>>>>>>>>>> Hi Duo, >>>>>>>>>>> >>>>>>>>>>> I’m trying to create a similar job for Apache ZooKeeper, but >>>>>>>>> unfortunately >>>>>>>>>>> haven’t got too much help on the Apache builds@ list so far, >> so >>>> I’m >>>>>>>>>>> rather asking you if you don’t mind. >>>>>>>>>>> >>>>>>>>>>> First, how have you set up the Hbase Github account that you >> use >>> in >>>>>> this >>>>>>>>>>> job to access the repo? >>>>>>>>>>> >>>>>>>>>>> Thanks, >>>>>>>>>>> Andor >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>>> On 2020. Jul 27., at 2:22, 张铎(Duo Zhang) < >> [email protected] >>>> >>>>>>>>> wrote: >>>>>>>>>>>> >>>>>>>>>>>> The pre commit job has been migrated to c-hadoop.a.o. >>>>>>>>>>>> >>>>>>>>>>>> I have disabled periodical scan for the old job on builds.a.o, >>> as >>>> we >>>>>>>>>>> still >>>>>>>>>>>> need to view the pre commit result on it do not delete for >> now. >>>> Will >>>>>>>>>>> delete >>>>>>>>>>>> it later, maybe after several weeks. >>>>>>>>>>>> >>>>>>>>>>>> The new job is here >>>>>>>>>>>> >>>>>>>>>>>> >>>>>> >> https://ci-hadoop.apache.org/job/HBase/job/HBase-PreCommit-GitHub-PR/ >>>>>>>>>>>> >>>>>>>>>>>> Thanks. >>>>>>>>>>>> >>>>>>>>>>>> 张铎(Duo Zhang) <[email protected]> 于2020年7月25日周六 下午9:44写道: >>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>> >>>>>> >>>> >>> >> https://ci-hadoop.apache.org/job/HBase/job/HBase%20Nightly/job/branch-2/5/console >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> We successfully finished a nightly build. >>>>>>>>>>>>> >>>>>>>>>>>>> But seems the jiraComment did not work. I haven't seen the >>>> comment >>>>>>>>>>>>> on HBASE-24757... >>>>>>>>>>>>> >>>>>>>>>>>>> 张铎(Duo Zhang) <[email protected]> 于2020年7月25日周六 >> 下午4:51写道: >>>>>>>>>>>>> >>>>>>>>>>>>>> After installing two new jenkins plugins, the pre commit job >>>> seems >>>>>>>>> fine >>>>>>>>>>>>>> now. >>>>>>>>>>>>>> >>>>>>>>>>>>>> The last failure is because of a timeout, I assume the >> problem >>>> is >>>>>>>>> that >>>>>>>>>>> we >>>>>>>>>>>>>> do not have enough executors so all the jobs are executed >>>>>>>>> sequentially. >>>>>>>>>>>>>> >>>>>>>>>>>>>> Maybe we could move the pre commit job to the new env first? >>> The >>>>>>>>>>> nightly >>>>>>>>>>>>>> job and flaky job require more resources, and we need the >>> output >>>>>> of >>>>>>>>>>> these >>>>>>>>>>>>>> jenkins jobs(the flaky test list). >>>>>>>>>>>>>> >>>>>>>>>>>>>> Thanks. >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> 张铎(Duo Zhang) <[email protected]> 于2020年7月24日周五 >> 下午4:36写道: >>>>>>>>>>>>>> >>>>>>>>>>>>>>> The problem seems because of this: >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> https://issues.jenkins-ci.org/browse/JENKINS-48556 >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> I triggered the job again, it passed the timestamps call, >> and >>>>>> will >>>>>>>>>>> keep >>>>>>>>>>>>>>> an eye on it. >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> 张铎(Duo Zhang) <[email protected]> 于2020年7月21日周二 >>> 上午11:18写道: >>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> On the sponsors, we could have a try. >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> The problem here is the process of the donation? IIRC >> there >>>> is a >>>>>>>>>>> thread >>>>>>>>>>>>>>>> on the infra mailing list about how to donate machines to >> a >>>>>>>>> specific >>>>>>>>>>>>>>>> project and the discussion did not go well... >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Sean Busbey <[email protected]> 于2020年7月21日周二 上午11:13写道: >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> We could check with ASF infra for the current state of >>> things >>>>>> wrt >>>>>>>>>>>>>>>>> GitHub >>>>>>>>>>>>>>>>> actions. I believe there is a queue set up across ASF >>>> projects. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> It has the same resource issue Travis had; things are >> fine >>>>>> until >>>>>>>>>>> some >>>>>>>>>>>>>>>>> critical mass of projects seeking better perf realize >> some >>>> new >>>>>>>>>>> option >>>>>>>>>>>>>>>>> is >>>>>>>>>>>>>>>>> available and then quickly all available resources are >>>>>> consumed. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> AFAICT the only option that gets us the same or better as >>> the >>>>>> H* >>>>>>>>>>> nodes >>>>>>>>>>>>>>>>> will >>>>>>>>>>>>>>>>> be finding sponsors and running our own. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> On Mon, Jul 20, 2020, 21:55 张铎(Duo Zhang) < >>>>>> [email protected]> >>>>>>>>>>>>>>>>> wrote: >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> I think our nightly, flakey, and pre commit jobs should >> be >>>>>>>>>>>>>>>>> transferred as a >>>>>>>>>>>>>>>>>> whole? They depend on each other. >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> I offer my help on the transition. >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> And on github CI, does ASF have a special deal with >>> github? >>>> If >>>>>>>>> not, >>>>>>>>>>>>>>>>> I do >>>>>>>>>>>>>>>>>> not think the default resource can fit our >> requirements... >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Sean Busbey <[email protected]> 于2020年7月21日周二 上午1:49写道: >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> Hi folks! >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> Back in April there was a brief discussion[1] about ASF >>>>>> Infra's >>>>>>>>>>>>>>>>>>> notification that builds.a.o is going away and we are >>>>>> currently >>>>>>>>>>>>>>>>> slated >>>>>>>>>>>>>>>>>>> to migrate to a set of CI servers for "Hadoop and >> related >>>>>>>>>>>>>>>>> projects". >>>>>>>>>>>>>>>>>>> This is the ci farm that will contain the bulk of the >> H* >>>>>> worker >>>>>>>>>>>>>>>>> nodes >>>>>>>>>>>>>>>>>>> that are donated by Yahoo!, which are the nodes we've >>> been >>>>>>>>> running >>>>>>>>>>>>>>>>> on >>>>>>>>>>>>>>>>>>> for ages[2]. >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> Migration discussion still happens on the >>>>>>>>> [email protected] >>>>>>>>>>>>>>>>>>> list[3] and recently ASF Infra set a target date of >>> August >>>>>> 15th >>>>>>>>>>> for >>>>>>>>>>>>>>>>>>> turning off the existing builds.a.o server[4]. >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> That gives us a little under 4 weeks to have things up >>> and >>>>>>>>> working >>>>>>>>>>>>>>>>> on >>>>>>>>>>>>>>>>>>> the new ci-hadoop.a.o jenkins coordinator[5]. it’s not >>>> clear >>>>>> to >>>>>>>>> me >>>>>>>>>>>>>>>>>>> that the level of effort we’ll need to spend is worth >>> what >>>> we >>>>>>>>> get >>>>>>>>>>>>>>>>> out >>>>>>>>>>>>>>>>>>> of a continuation of the status quo on builds.a.o. I >> did >>> a >>>>>> quick >>>>>>>>>>>>>>>>> test >>>>>>>>>>>>>>>>>>> by updating the nightly job on ci-hadoop.a.o to run >> just >>>>>>>>> branch-2, >>>>>>>>>>>>>>>>>>> since that has been stable on builds.a.o. It failed >> with >>> a >>>>>>>>> Jenkins >>>>>>>>>>>>>>>>>>> pipeline DSL syntax error[6] so I'm assuming migrating >>> will >>>>>> be a >>>>>>>>>>>>>>>>> slog. >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> As far as I can see our options are: >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> * Do nothing. Have no testing or automated website >>>>>> publication >>>>>>>>> in >>>>>>>>>>>>>>>>> mid >>>>>>>>>>>>>>>>>>> August. >>>>>>>>>>>>>>>>>>> * Transition website publication and nothing else >>> (probably >>>>>> can >>>>>>>>> be >>>>>>>>>>>>>>>>>>> done in a day) >>>>>>>>>>>>>>>>>>> * Transition just precommit testing for various repos >>>>>> (probably >>>>>>>>>>>>>>>>> can be >>>>>>>>>>>>>>>>>>> done in a few days) >>>>>>>>>>>>>>>>>>> * Transition everything (no idea how long it takes due >> to >>>>>>>>> nightly, >>>>>>>>>>>>>>>>>>> flaky stuff, etc) >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> The alternatives if we do not transition any given job >> to >>>>>>>>>>>>>>>>> ci-hadoop: >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> * Try to move to GitHub Actions >>>>>>>>>>>>>>>>>>> * Try to move to Travis CI >>>>>>>>>>>>>>>>>>> * Try to move to Jenkins infra we maintain ourselves >>>>>> (presumably >>>>>>>>>>> by >>>>>>>>>>>>>>>>>>> soliciting project specific donations for worker nodes >> on >>>>>> cloud >>>>>>>>>>>>>>>>>>> vendors) >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> It's important to remember that as a project we have a >>>> heavy >>>>>>>>>>>>>>>>> footprint >>>>>>>>>>>>>>>>>>> wherever our nightly tests run. For context, a given >>>> branch's >>>>>>>>>>>>>>>>> nightly >>>>>>>>>>>>>>>>>>> can keep 3-4 executors busy for 6+ hours on the current >>>>>>>>> builds.a.o >>>>>>>>>>>>>>>>>>> setup. There's been a bunch of great work lately on >>>> bringing >>>>>>>>> down >>>>>>>>>>>>>>>>> what >>>>>>>>>>>>>>>>>>> it takes to run the full test suite, but applying that >>> work >>>>>> to >>>>>>>>>>>>>>>>> nightly >>>>>>>>>>>>>>>>>>> is itself a significant undertaking. >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> What are folks thinking? Most importantly who is ready >> to >>>>>> work >>>>>>>>>>>>>>>>> towards >>>>>>>>>>>>>>>>>>> any given approach? >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> [1] [DISCUSS] Migrating HBase to new CI Master >>>>>>>>>>>>>>>>>>> https://s.apache.org/fux1o >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> [2] https://builds.apache.org/view/H-L/view/HBase/ >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> [3] >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>> >>>>>> >> https://lists.apache.org/[email protected] >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> [4] [IMPORTANT] - 2 more HADOOP nodes migrated over to >>>>>> ci-hadoop >>>>>>>>>>>>>>>>>>> https://s.apache.org/7e1nq >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> [5] https://ci-hadoop.apache.org/job/HBase/ >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> [6] >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>> >>>>>> >>>> >>> >> https://ci-hadoop.apache.org/job/HBase/job/HBase%20Nightly/job/branch-2/2/console >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>> >>>>>> >>>>>> >>>> >>>> >>> >>
