Github user shaneknapp commented on the issue: https://github.com/apache/spark/pull/19884 @BryanCutler @HyukjinKwon pandas and pyarrow are most definitely installed on all of the jenkins workers. the 'missing' packages happened after we had a power outage at the colo, and the jenkins workers rebooted while the master (on UPS) didn't. this causes the PATH env var to be dropped, which means that instead of seeing the anaconda installation in PATH, jenkins defaults to system python (which has the absolute minimum of packages installed). regarding the pyarrow upgrade: let's schedule it for wednesday (tomorrow) morning, EST. i'm about to get on another plane and have a few more hours of traveling left today.
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org