Need Help Diagnosing/operating/tuning

2015-11-22 Thread Jeremy Davis
Hello, I’m at a loss trying to diagnose why my spark job is failing. (works fine on small data) It is failing during the repartition, or on the subsequent steps.. which then seem to fail and fall back to repartitioning.. I’ve tried adjusting every parameter I can find, but have had no success. I

Re: Need Help Diagnosing/operating/tuning

2015-11-22 Thread Jeremy Davis
It seems like the problem is related to —executor-cores. Is there possibly some sort of race condition when using multiple cores per executor? On Nov 22, 2015, at 12:38 PM, Jeremy Davis mailto:jda...@marketshare.com>> wrote: Hello, I’m at a loss trying to diagnose why my spark job is f

Re: OutOfMemory while calculating window functions

2016-09-21 Thread Jeremy Davis
0))) .withColumn("min"+(-win),min("value").over(baseWin.rowsBetween(-win,0))) .withColumn("max"+(-win),max("value").over(baseWin.rowsBetween(-win,0))) } resultFrame.show() } } > On Sep 20, 2016, at 10:26 PM, Jeremy Dav

Limit on the number of Jobs per Application

2018-05-30 Thread Jeremy Davis
I have an application that does many thousand univariate GLM regressions that seems to break down after completing around 25K jobs. Plenty of resources: disk, network, memory, CPU are free, but eventually it is only scheduling on a few threads (out of 400+ possible on the cluster) No task failu