Re: Block a user from spark-website who repeatedly open the invalid same PR

2020-01-26 Thread Shane Knapp
k-website/pull/250 >>>> https://github.com/apache/spark-website/pull/249 >>>> >>>> If there is no objection, and this guy opens the PR again, I am going to >>>> open an infra ticket to block >>>> this guy from spark-webiste repo to prevent such behaviours. >>>> >>>> Please let me know if you guys have any concerns. >>>> >>> >>> >>> -- >>> --- >>> Takeshi Yamamuro -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu - To unsubscribe e-mail: dev-unsubscr...@spark.apache.org

Re: Apache Spark Docker image repository

2020-02-05 Thread shane knapp
it'd be nice to have those available as there as well. ah, an atomic build environment... one can dream. :) shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

new branch-3.0 jenkins job configs are ready to be deployed...

2020-01-31 Thread shane knapp
...whenever i get the word. :) FWIW they will all be identical to the current group of master builds/tests. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

[build system] enabled the ubuntu staging node to help w/build queue

2020-02-11 Thread shane knapp
-- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: PR lint-scala jobs failing with http error

2020-01-16 Thread Shane Knapp
as the https version though. > > Anyone know why its trying to go to http version? > > > Tom -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu ---

Re: PR lint-scala jobs failing with http error

2020-01-16 Thread Shane Knapp
che/spark/pull/27239/checks?check_run_id=393884643 > > Tom > On Thursday, January 16, 2020, 03:17:03 PM CST, Shane Knapp > wrote: > > > i'm seeing a lot of green builds currently... if you think this is > still happening, please include links to the failed jobs. thanks! > &g

jenkins down next tuesday (jan 14th) morning

2020-01-09 Thread Shane Knapp
our colo is performing some electrical work next week, and jenkins will be down between 630-930am PST. sorry for the interruption in service... i'll be following up on this thread w/updates as they come in. -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff

Re: Jenkins looks hosed

2019-12-23 Thread Shane Knapp
yep, it was most definitely wedged. restarted the service and it's back up! On Mon, Dec 23, 2019 at 12:23 PM Shane Knapp wrote: > > checking it now. > > On Mon, Dec 23, 2019 at 11:27 AM Marcelo Vanzin > wrote: > > > > Just in the off-chance that someone with

Re: Jenkins looks hosed

2019-12-23 Thread Shane Knapp
aiting for your PR tests to finish (or even start > running). > > -- > Marcelo > > - > To unsubscribe e-mail: dev-unsubscr...@spark.apache.org > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EE

Re: Jenkins looks hosed

2019-12-23 Thread Shane Knapp
i'll be out of the country and on vacation until early january, but i'll make a point to check in every couple of days to ensure that jenkins is happy. On Mon, Dec 23, 2019 at 12:25 PM Shane Knapp wrote: > > yep, it was most definitely wedged. restarted the service and it's back up! >

Re: Auto-linking from PRs to Jira tickets

2020-03-11 Thread shane knapp
; NC> -- > NC> With best wishes,Alex Ott > NC> http://alexott.net/ > NC> Twitter: alexott_en (English), alexott (Russian) > > > > -- > With best wishes,Alex Ott > http://alexott.net/ > Twitter: alexott_en (Engl

Re: ./dev/run-tests failing at master

2020-05-14 Thread shane knapp
led in both python2 and python3. > > > > The same error persists. > > How should I proceed? > > > > -- > Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ > > ----- > To unsubscribe e

Re: [build system] jenkins rebooting now

2020-05-14 Thread shane knapp
we're back. doesn't seem to have fixed the issue of the workers connecting to repository.apache.org but i'm still investigating. On Thu, May 14, 2020 at 9:11 AM shane knapp ☠ wrote: > that is all. > > -- > Shane Knapp > Computer Guy / Voice of Reason > UC Berkeley EECS Resear

[build system] jenkins rebooting now

2020-05-14 Thread shane knapp
that is all. -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [build system] jenkins wedged again

2020-10-14 Thread shane knapp
everything's up and jenkins is slowly chewing through the queue! :) On Wed, Oct 14, 2020 at 12:00 PM Xiao Li wrote: > Thank you, Shane! > > Xiao > > On Wed, Oct 14, 2020 at 12:00 PM shane knapp ☠ > wrote: > >> we're mostly back up, and just waiting for a couple o

[build system] jenkins wedged again

2020-10-14 Thread shane knapp
i'm going to reboot the primary and worker nodes, so it'll be a few minutes before everything is back up. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [build system] jenkins wedged again

2020-10-14 Thread shane knapp
we're mostly back up, and just waiting for a couple of ubuntu boxes to finish booting... prb seem to be building now! On Wed, Oct 14, 2020 at 11:48 AM shane knapp ☠ wrote: > i'm going to reboot the primary and worker nodes, so it'll be a few > minutes before everything is back up. >

Re: Running K8s integration tests for changes in core?

2020-08-18 Thread shane knapp
er.com/holdenkarau > Books (Learning Spark, High Performance Spark, etc.): > https://amzn.to/2MaRAG9 <https://amzn.to/2MaRAG9> > YouTube Live Streams: https://www.youtube.com/user/holdenkarau > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: Running K8s integration tests for changes in core?

2020-08-19 Thread shane knapp
> >> > > -- > Twitter: https://twitter.com/holdenkarau > Books (Learning Spark, High Performance Spark, etc.): > https://amzn.to/2MaRAG9 <https://amzn.to/2MaRAG9> > YouTube Live Streams: https://www.youtube.com/user/holdenkarau > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: Running K8s integration tests for changes in core?

2020-08-20 Thread shane knapp
> > A presubmit(which includes K8s integration tests) build will be run, once > the PR receives LGTM from "Approved reviewers". This is one criteria that > comes to my mind, others may have better suggestions. > > On Thu, Aug 20, 2020 at 12:25 AM shane knapp ☠ > wrote

[build system] shane out all next week (aug 22-29), support instructions

2020-08-20 Thread shane knapp
the number of tickets opened. :) if there are any other problems, file a JIRA and assign to me. i will look at it in early september. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

[build system] restarting jenkins now

2020-08-14 Thread shane knapp
there isn't much activity right now, and i'd like to restart jenkins quickly as it's consuming a lot of memory on the head node. shouldn't be more than a couple of minutes downtime... if something goes awry i'll send an email here. if you don't hear from me again, please carry on. :) -- Shane

[build system] downtime due to SSL cert errors

2020-09-23 Thread shane knapp
jenkins is up and building, but not reachable via https at the moment. i'm working on getting this sorted ASAP. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [build system] downtime due to SSL cert errors

2020-09-24 Thread shane knapp
certs delivered and installed... we're back! On Wed, Sep 23, 2020 at 6:07 PM shane knapp ☠ wrote: > jenkins is up and building, but not reachable via https at the moment. > i'm working on getting this sorted ASAP. > > shane > -- > Shane Knapp > Computer Guy / Voice of Reas

Re: Running K8s integration tests for changes in core?

2020-09-24 Thread shane knapp
ads up. I hope you get some time to relax :) > > On Thu, Aug 20, 2020 at 2:26 PM shane knapp ☠ wrote: > >> fyi, i won't be making this change until the 1st week of september. i'll >> be out, off the grid all next week! :) >> >> i will send an announcement out tom

Re: Build time limit in PR builder

2020-05-28 Thread shane knapp
On Thu, May 28, 2020 at 7:16 AM Sean Owen wrote: > What else can we do, I suppose? > > there,s not much else we can do. i'll add 30m to the timeout. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: Build time limit in PR builder

2020-05-28 Thread shane knapp
s.py#L201 > :-). > > 2020년 5월 29일 (금) 오전 12:14, shane knapp ☠ 님이 작성: > >> On Thu, May 28, 2020 at 7:16 AM Sean Owen wrote: >> >>> What else can we do, I suppose? >>> >>> there,s not much else we can do. i'll add 30m to the timeout. >>

Re: Build time limit in PR builder

2020-05-28 Thread shane knapp
https://github.com/apache/spark/pull/28666 On Thu, May 28, 2020 at 11:20 AM shane knapp ☠ wrote: > i'll get a PR put together now. > > On Thu, May 28, 2020 at 8:26 AM Hyukjin Kwon wrote: > >> I remember we were able to cut down pretty considerably in the past. For >>

Re: Build time limit in PR builder

2020-05-28 Thread shane knapp
the timer is set to 500m now in master, 3.0 and 2.4. On Thu, May 28, 2020 at 12:32 PM Kousuke Saruta wrote: > Thanks all. It's very helpful! > > - Kousuke > > On 2020/05/29 3:31, shane knapp ☠ wrote: > > https://github.com/apache/spark/pull/28666 > > On Thu, May 28, 2

R installation broken on ubuntu workers, impacts K8s PRB builds

2020-07-15 Thread shane knapp
of downtime. i'll file a JIRA, and figure out when i will be able to get to this... possibly this afternoon. -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-07 Thread shane knapp
i wasn't able to get to it today, so i'm hoping to squeeze in a quick trip to the colo tomorrow morning. if not, then first thing thursday. -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: Jenkins is down

2020-07-05 Thread shane knapp
t; Hi all and Shane, >> >> Is there something wrong with the Jenkins machines? Seems they are down. >> > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-09 Thread shane knapp
ok, we're back up and building (just waiting for one worker, -06 to finish cleaning itself up). On Thu, Jul 9, 2020 at 9:30 AM shane knapp ☠ wrote: > this is happening now. > > On Wed, Jul 8, 2020 at 9:07 AM shane knapp ☠ wrote: > >> this will be happening tomorrow... tod

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-09 Thread shane knapp
this is happening now. On Wed, Jul 8, 2020 at 9:07 AM shane knapp ☠ wrote: > this will be happening tomorrow... today is Meeting Hell Day[tm]. > > On Tue, Jul 7, 2020 at 1:59 PM shane knapp ☠ wrote: > >> i wasn't able to get to it today, so i'm hoping to squeeze in a quick &

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-08 Thread shane knapp
this will be happening tomorrow... today is Meeting Hell Day[tm]. On Tue, Jul 7, 2020 at 1:59 PM shane knapp ☠ wrote: > i wasn't able to get to it today, so i'm hoping to squeeze in a quick trip > to the colo tomorrow morning. if not, then first thing thursday. > > -- > Shane K

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-09 Thread shane knapp
and -06 is back! i'll keep an eye on things today, but suffice to say on each worker i: 1) rebooted 2) cleaned ~/.ivy2, ~/.m2, and other associated caches we should be g2g! please reply here if you continue to see weirdness. On Thu, Jul 9, 2020 at 10:08 AM shane knapp ☠ wrote: >

Re: m2 cache issues in Jenkins?

2020-07-06 Thread shane knapp
>>>>> Huh interesting that it’s the same worker. Have you filed a ticket to >>>>>> Shane? >>>>>> >>>>>> On Wed, Jul 1, 2020 at 8:50 PM Hyukjin Kwon >>>>>> wrote: >>>>>> >>>>&g

Re: m2 cache issues in Jenkins?

2020-07-06 Thread shane knapp
i killed and retriggered the PRB jobs on 04, and wiped that workers' m2 cache. On Mon, Jul 6, 2020 at 9:24 AM shane knapp ☠ wrote: > once the jobs running on that worker are finished, yes. > > On Sun, Jul 5, 2020 at 7:41 PM Hyukjin Kwon wrote: > >> Shane, can we remove .m2 i

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-10 Thread shane knapp
;> >> >> >> -- >> Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ >> >> ----- >> To unsubscribe e-mail: dev-unsubscr...@spark.apache.org >> >> --

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-13 Thread shane knapp
feel we're out of the woods right now. :) shane On Fri, Jul 10, 2020 at 3:43 PM Frank Yin wrote: > Great. Thanks. > > On Fri, Jul 10, 2020 at 3:39 PM shane knapp ☠ wrote: > >> no, 8 hours is plenty. things will speed up soon once the backlog of >> builds work

Re: [DISCUSS] Drop Python 2, 3.4 and 3.5

2020-07-14 Thread shane knapp
tages by dropping them: >>>>>>>>> 1. It removes a bunch of hacks we added around 700 lines in >>>>>>>>> PySpark. >>>>>>>>> 2. PyPy2 has a critical bug that causes a flaky test, >>>>>>>>> https://issues.apache.org

Re: Welcoming some new Apache Spark committers

2020-07-14 Thread shane knapp
al > > All three of them contributed to Spark 3.0 and we’re excited to have them > join the project. > > Matei and the Spark PMC > - > To unsubscribe e-mail: dev-unsubscr...@spark.apache.org > > -- Sha

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-10 Thread shane knapp
; infrastructure? > > On Fri, Jul 10, 2020 at 8:19 AM shane knapp ☠ wrote: > >> yeah, i can't do much for flaky tests... just flaky infrastructure. >> >> >> On Fri, Jul 10, 2020 at 12:41 AM Hyukjin Kwon >> wrote: >> >>> Couple of flaky

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-10 Thread shane knapp
arkPullRequestBuilder/125563/console > > https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125562/console > > https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125561/console > > On Fri, Jul 10, 2020 at 9:35 AM shane knapp ☠ wrote: > >>

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-10 Thread shane knapp
hanks. >> >> On Fri, Jul 10, 2020 at 12:43 PM shane knapp ☠ >> wrote: >> >>> only 125561, 125562 and 125564 were impacted by -9. >>> >>> 125565 exited w/a code of 15 (143 - 128), which means the process was >>> terminated for unknown rea

Re: m2 cache issues in Jenkins?

2020-07-06 Thread shane knapp
te: > >> Could this be a flaky or persistent issue? It failed with Scala gendoc >> but it didn't fail with the part the PR modified. It ran from worker-05. >> >> >> https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/125121/consoleFull >> &g

Re: restarting jenkins build system tomorrow (7/8) ~930am PDT

2020-07-09 Thread shane knapp
i'm seeing green PRB builds now, so i feel that we've gotten things building again! :) On Thu, Jul 9, 2020 at 5:33 PM Hyukjin Kwon wrote: > Thank you Shane. > > 2020년 7월 10일 (금) 오전 2:35, shane knapp ☠ 님이 작성: > >> and -06 is back! i'll keep an eye on things today, but

Re: m2 cache issues in Jenkins?

2020-06-24 Thread shane knapp
ks (Learning Spark, High Performance Spark, etc.): > https://amzn.to/2MaRAG9 <https://amzn.to/2MaRAG9> > YouTube Live Streams: https://www.youtube.com/user/holdenkarau > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: m2 cache issues in Jenkins?

2020-06-24 Thread shane knapp
: > The most recent one I noticed was > https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/124437/console > which > was run on amp-jenkins-worker-04. > > On Wed, Jun 24, 2020 at 10:44 AM shane knapp ☠ > wrote: > >> for those weird failures, it's super he

Re: m2 cache issues in Jenkins?

2020-06-24 Thread shane knapp
done: -bash-4.1$ cd .m2 -bash-4.1$ ls repository -bash-4.1$ time rm -rf * real17m4.607s user0m0.950s sys 0m18.816s -bash-4.1$ On Wed, Jun 24, 2020 at 10:50 AM shane knapp ☠ wrote: > ok, i've taken that worker offline and once the job running on it > finishes, i'll wipe the

Re: R installation broken on ubuntu workers, impacts K8s PRB builds

2020-07-17 Thread shane knapp
starting now... pausing jenkins so no new builds are launched. On Thu, Jul 16, 2020 at 3:09 PM Holden Karau wrote: > Sounds good, thanks. No rush :) > > On Thu, Jul 16, 2020 at 3:03 PM shane knapp ☠ wrote: > >> i'll get to this tomorrow afternoon, and there will be a short

Re: R installation broken on ubuntu workers, impacts K8s PRB builds

2020-07-17 Thread shane knapp
this is done, except for amp-jenkins-staging-worker-02 which is refusing to allow me to reinstall R... i marked that worker offline and will beat on it later today. On Fri, Jul 17, 2020 at 11:36 AM shane knapp ☠ wrote: > starting now... pausing jenkins so no new builds are launc

Re: R installation broken on ubuntu workers, impacts K8s PRB builds

2020-07-16 Thread shane knapp
-32326 > > On Wed, Jul 15, 2020 at 12:09 PM shane knapp ☠ > wrote: > >> i'm not entirely sure when the dep for R got bumped to 3.5+, but it's >> breaking the k8s builds. >> >> i'll need to purge these workers of all previous versions of R + >> packages, the

Re: jenkins downtime tomorrow evening/weekend

2020-11-21 Thread shane knapp
this is starting now On Thu, Nov 19, 2020 at 4:34 PM shane knapp ☠ wrote: > i'm going to be upgrading jenkins to something more reasonable, and there > will definitely be some downtime as i get things sorted. > > we should be back up and building by monday. > > shane

Re: jenkins downtime tomorrow evening/weekend

2020-11-21 Thread shane knapp
somehow that went pretty smoothly, tho i've got a bunch of plugins to deal with... we're back up and building w/a shiny new UI. :) On Sat, Nov 21, 2020 at 3:52 PM shane knapp ☠ wrote: > this is starting now > > On Thu, Nov 19, 2020 at 4:34 PM shane knapp ☠ wrote: >

Re: [build system] IMPORTANT UPDATE

2020-11-25 Thread shane knapp
at 6:08 PM shane knapp ☠ wrote: > all spark builds have been ported and triggered: > > https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/ > > not shown are the regular and k8s PRB, which are also running. > > i think i've nailed down most of the stup

Re: [build system] IMPORTANT UPDATE

2020-11-25 Thread shane knapp
On Wed, Nov 25, 2020 at 1:35 PM shane knapp ☠ wrote: > hey all, work is going quite well and smoothly for this project. > > today's update: > > we will experience significant downtime monday/tuesday as we spin up the > new primary jenkins node. until then, we'll be building

[build system] WE'RE LIVE!

2020-12-01 Thread shane knapp
for his work on the project! we couldn't have done it w/o him. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [build system] WE'RE LIVE!

2020-12-04 Thread shane knapp
c 2nd failed: > > https://amplab.cs.berkeley.edu/jenkins/view/Spark%20Packaging/job/spark-master-maven-snapshots/3186/ > > Not sure if this is result of upgrade? > > Thanks, > Tom > On Tuesday, December 1, 2020, 06:55:27 PM CST, shane knapp ☠ < > skn...@berkeley.edu> wrote:

Re: [build system] WE'RE LIVE!

2020-12-04 Thread shane knapp
ok, it's broken on the new nodes, so i tied the project to ubuntu16. i'll create a jira and investigate further at a later date. On Fri, Dec 4, 2020 at 8:58 AM shane knapp ☠ wrote: > no, it isn't but i'll try and take a look at this later today. > > On Fri, Dec 4, 2020 at 7:12 AM T

Re: jenkins downtime tomorrow evening/weekend

2020-11-24 Thread shane knapp
> > Please see https://issues.apache.org/jira/browse/SPARK-27177 for more > details. > > On Tue, Nov 24, 2020 at 8:23 AM shane knapp ☠ wrote: > >> it seems that the plugin upgrade went as smoothly as it could have... i >> still have a bunch of stack traces to filter th

Re: [build system] IMPORTANT UPDATE

2020-11-24 Thread shane knapp
! shane On Tue, Nov 24, 2020 at 11:24 AM shane knapp ☠ wrote: > this is a lengthy, but important read for everyone here. > > in the next few days, the remaining centos machines (PRB/SBT workers AND > primary) will have be reimaged from centos6.9 to ubuntu 20.04LTS. > > this mea

[build system] IMPORTANT UPDATE

2020-11-24 Thread shane knapp
like to have helped find the build system a new home, and sunset jenkins. over the past 11 years (i think), this system has built spark. it's getting a little tired and needs a well deserved break. :) shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff

Re: [build system] IMPORTANT UPDATE

2020-11-24 Thread shane knapp
: rack rearrangement, cleaning up networking, fixing hardware, reimaging and generally kicking ass! have a great holiday! shane On Tue, Nov 24, 2020 at 2:24 PM shane knapp ☠ wrote: > our very first ubuntu-based PRB is running: > https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestB

Re: [build system] IMPORTANT UPDATE

2020-11-24 Thread shane knapp
our very first ubuntu-based PRB is running: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/131701/ crossing my fingers! :) On Tue, Nov 24, 2020 at 1:30 PM shane knapp ☠ wrote: > due to scheduling, upcoming holiday and in-the-colo work requirements, all > of the

Re: jenkins downtime tomorrow evening/weekend

2020-11-23 Thread shane knapp
me here. also, my backlog of things i need to install will be addressed this week. the ansible is coming along nicely! On Mon, Nov 23, 2020 at 2:11 PM shane knapp ☠ wrote: > the third most terrifying event in the world, a massive jenkins plugin > update is happening in a couple of hours

Re: [build system] jenkins downtime today/tomorrow

2020-12-01 Thread shane knapp
and move on to fixing any lingering environment/system issues that pop up. shane On Mon, Nov 30, 2020 at 4:01 PM shane knapp ☠ wrote: > amplab jenkins is down. > > On Mon, Nov 30, 2020 at 3:25 PM shane knapp ☠ wrote: > >> old jenkins is getting shut down Real Soon Now[tm]! cros

[build system] jenkins downtime today/tomorrow

2020-11-30 Thread shane knapp
. shane/brian/jon -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: jenkins downtime tomorrow evening/weekend

2020-11-23 Thread shane knapp
. shane On Sat, Nov 21, 2020 at 4:23 PM shane knapp ☠ wrote: > somehow that went pretty smoothly, tho i've got a bunch of plugins to deal > with... we're back up and building w/a shiny new UI. :) > > On Sat, Nov 21, 2020 at 3:52 PM shane knapp ☠ wrote: > >> this is starting

Re: [build system] jenkins downtime today/tomorrow

2020-11-30 Thread shane knapp
amplab jenkins is down. On Mon, Nov 30, 2020 at 3:25 PM shane knapp ☠ wrote: > old jenkins is getting shut down Real Soon Now[tm]! crossing my fingers! > :) > > On Mon, Nov 30, 2020 at 10:05 AM shane knapp ☠ > wrote: > >> hey all! >> >> the Great Jen

Re: [build system] jenkins downtime today/tomorrow

2020-11-30 Thread shane knapp
old jenkins is getting shut down Real Soon Now[tm]! crossing my fingers! :) On Mon, Nov 30, 2020 at 10:05 AM shane knapp ☠ wrote: > hey all! > > the Great Jenkins Migration[tm] is well under way, and we will be > sunsetting the old amp-jenkins-master server and moving to a new o

[build system] jenkins downtime 01/02/2021 - 01/03/2020

2020-12-21 Thread shane knapp
spark jira. :) -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

jenkins downtime tomorrow evening/weekend

2020-11-19 Thread shane knapp
i'm going to be upgrading jenkins to something more reasonable, and there will definitely be some downtime as i get things sorted. we should be back up and building by monday. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https

[build system] IMPORTANT: builds will be impacted this month

2020-11-02 Thread shane knapp
things up to date while trying to remotely train up one of my sysadmins to take over some of my build system duties. -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [FYI] CI Infra issues (in both GitHub Action and Jenkins)

2021-01-08 Thread shane knapp
QA%20Test%20(Dashboard)/job/spark-master-test-sbt-hadoop-3.2/1836/console > > https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-sbt-hadoop-2.7/1887/console > > On Fri, Jan 8, 2021 at 2:13 PM shane knapp ☠ wrote: > >> 1. Jenkins machines

Re: [FYI] CI Infra issues (in both GitHub Action and Jenkins)

2021-01-08 Thread shane knapp
kins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-sbt-hadoop-2.7/1887/console >> >> On Fri, Jan 8, 2021 at 2:13 PM shane knapp ☠ wrote: >> >>> 1. Jenkins machines start to fail with the following recently. >>>> (master branch) >>

Re: [FYI] CI Infra issues (in both GitHub Action and Jenkins)

2021-01-08 Thread shane knapp
> > 1. Jenkins machines start to fail with the following recently. > (master branch) > > Python versions prior to 3.6 are not supported. > Build step 'Execute shell' marked build as failure > > examples please? -- Shane Knapp Computer Guy / Voice of Reason U

Re: How to think about SparkPullRequestBuilder-K8s?

2021-06-11 Thread shane knapp
the PR seems totally unrelated to K8S. I've kind of learned to > ignore them in that case but that seems wrong. Are they just kind of flaky? > am I imagining things? Just trying to figure out how much they're > 'accurate' in catching real vs false failures. > -- Shane Knapp Computer

Re: How to think about SparkPullRequestBuilder-K8s?

2021-06-11 Thread shane knapp
we're back. On Fri, Jun 11, 2021 at 2:30 PM shane knapp ☠ wrote: > btw i just noticed jenkins was down, and i restarted the primary node. > > On Fri, Jun 11, 2021 at 12:09 PM Sean Owen wrote: > >> I find that somewhat often, the K8S PR builders will fail

Re: quick jenkins restart

2021-07-09 Thread shane knapp
we're back up! On Fri, Jul 9, 2021 at 10:23 AM shane knapp ☠ wrote: > the primary is running out of memory pretty quickly, and i'm going to > reboot the server quickly so that it doesn't crash over the weekend. > > we'll investigate a bit more next week. > > shane > -- >

quick jenkins restart

2021-07-09 Thread shane knapp
the primary is running out of memory pretty quickly, and i'm going to reboot the server quickly so that it doesn't crash over the weekend. we'll investigate a bit more next week. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https

Re: [build system] jenkins down, working on it

2021-05-04 Thread shane knapp
we're back and building! On Tue, May 4, 2021 at 4:03 PM shane knapp ☠ wrote: > jenkins went down some time in the past few days, and i'm currently > investigating. > > if it's been down a while, i apologize as i've been dealing w/some health > issues. > > shane > -- >

[build system] jenkins down, working on it

2021-05-04 Thread shane knapp
jenkins went down some time in the past few days, and i'm currently investigating. if it's been down a while, i apologize as i've been dealing w/some health issues. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https

[build system] short downtime today, new workers coming soon

2021-03-23 Thread shane knapp
jenkins is acting up, and i'm going to take the opportunity to reboot the primary and all the workers. sorry for the short notice, but on the bright side we have a bunch of shiny new workers coming soon! shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab

Re: [build system] short downtime today, new workers coming soon

2021-03-23 Thread shane knapp
we're back! On Tue, Mar 23, 2021 at 12:31 PM shane knapp ☠ wrote: > jenkins is acting up, and i'm going to take the opportunity to reboot the > primary and all the workers. > > sorry for the short notice, but on the bright side we have a bunch of > shiny new workers coming

Re: [build system] github fetches timing out

2021-03-10 Thread shane knapp
...and just like that, overnight the builds started successfully git fetching! On Tue, Mar 9, 2021 at 12:31 PM shane knapp ☠ wrote: > it looks like over the past few days the master/branch builds have been > timing out... this hasn't happened in a few years, and honestly the last &

[build system] github fetches timing out

2021-03-09 Thread shane knapp
i had a more concrete answer or solution for what's going on... i'll continue to investigate as best i can today, and if this continues, i'll re-open my issue w/github and see if they can shed any light on the situation. shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research

Re: [build system] github fetches timing out

2021-03-17 Thread shane knapp
it's been happening a lot again recently... i'm investigating. On Wed, Mar 10, 2021 at 10:23 AM Liang-Chi Hsieh wrote: > Thanks Shane for looking at it! > > > shane knapp ☠ wrote > > ...and just like that, overnight the builds started successfully git > > fetching! &g

Re: minikube and kubernetes cluster versions for integration testing

2021-03-03 Thread shane knapp
n Mac but with a simple sed expression it can be tailored to >> linux too. >> >> >> >> *After all of this my questions:* >> *A) What about to change the required versions and suggest to use >> kubernetes v1.17.3 and Minikube v1.7.3 and greater for integration testing?* >> >> I would chose v1.17.3 for k8s cluster as that is the newest supported k8s >> version for that Minikube v1.7.3 (hoping it will be good for us for a long >> time). >> If you agree with this suggestion I go ahead and update the relevant >> documentation. >> >> >> >> *B) How about extending the integration test to check whether the >> Minikube version is sufficient? *By this we can provide a meaningful >> error when it is violated. >> >> Bests, >> Attila >> > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

[build system] jenkins wedged, going to restart after current builds finish

2021-02-23 Thread shane knapp
EOM -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: K8s integration test failure ("credentials Jenkins is using is probably wrong...")

2021-02-23 Thread shane knapp
>> probably wrong. Or the user account does not have write access to the repo. >> >> >> See >> https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder-K8s/39934/consoleFull >> >> Can anybody please advise? >> >> Thanks in advance. >> >> Phillip >> >> >> -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: K8s integration test failure ("credentials Jenkins is using is probably wrong...")

2021-02-23 Thread shane knapp
stupid bash variable assignment. i'm surprised this has lingered for as long as it had (3 years). it's fixed and shouldn't be an issue any more. On Tue, Feb 23, 2021 at 9:28 AM shane knapp ☠ wrote: > the AmplabJenks bot's github creds are out of date, which is causing that > non-fatal

Re: [build system] jenkins wedged, going to restart after current builds finish

2021-02-23 Thread shane knapp
this was done about an hour ago... rebooted several of the workers to clear out lingering builds, and one worker had an SSD fail on boot and is currently offline. shane On Tue, Feb 23, 2021 at 10:13 AM shane knapp ☠ wrote: > EOM > > -- > Shane Knapp > Computer Guy / Voice

Re: minikube and kubernetes cluster versions for integration testing

2021-03-04 Thread shane knapp
park-developers-list.1001551.n3.nabble.com/ > > - > To unsubscribe e-mail: dev-unsubscr...@spark.apache.org > > -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

please read: current state and the future of the apache spark build system

2021-04-07 Thread shane knapp
, but some things might be flaky. but the biggest question is what you all need w/regards to build infrastructure... and who's going to be responsible for it. thanks for reading! :) shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https

Re: please read: current state and the future of the apache spark build system

2021-04-14 Thread shane knapp
ng this out today: https://github.com/apache/spark/pull/32178 shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [SPARK-34738] issues w/k8s+minikube and PV tests

2021-04-15 Thread shane knapp
that one > test fails because it relies on some minikube specific functionality. That > test could be refactored because I think it’s just adding a minimal Ceph > cluster to the K8S cluster which can be done to any K8S cluster in principal > > > > > > > > Rob > >

Re: [SPARK-34738] issues w/k8s+minikube and PV tests

2021-04-14 Thread shane knapp
On Wed, Apr 14, 2021 at 10:32 AM Frank Luo wrote: > Is there any hard dependency on minkube? (i.e, GPU setting), kind ( > https://kind.sigs.k8s.io/) is a stabler and simpler k8s cluster env on a > single machine (only requires docker) , it been widely used by k8s projects > testing. > > there

[SPARK-34738] issues w/k8s+minikube and PV tests

2021-04-14 Thread shane knapp
virtualization layer. thanks in advance, shane -- Shane Knapp Computer Guy / Voice of Reason UC Berkeley EECS Research / RISELab Staff Technical Lead https://rise.cs.berkeley.edu

Re: [SPARK-34738] issues w/k8s+minikube and PV tests

2021-04-16 Thread shane knapp
next week. On Thu, Apr 15, 2021 at 3:05 PM shane knapp ☠ wrote: > i'm all for that... and once they're turned off, we can finish the > minikube/k8s/move-to-docker project in a couple of hours max. > > On Thu, Apr 15, 2021 at 3:00 PM Holden Karau wrote: > >> What about if we

<    2   3   4   5   6   7   8   >