Re: pyspark on yarn hdp hortonworks

2014-09-05 Thread Greg Hill
: Wednesday, September 3, 2014 4:19 PM To: Oleg Ruchovets oruchov...@gmail.commailto:oruchov...@gmail.com Cc: user@spark.apache.orgmailto:user@spark.apache.org user@spark.apache.orgmailto:user@spark.apache.org Subject: Re: pyspark on yarn hdp hortonworks Hi Oleg, There isn't much you need to do

Re: pyspark on yarn hdp hortonworks

2014-09-03 Thread Andrew Or
Hi Oleg, There isn't much you need to do to setup a Yarn cluster to run PySpark. You need to make sure all machines have python installed, and... that's about it. Your assembly jar will be shipped to all containers along with all the pyspark and py4j files needed. One caveat, however, is that the