Hi Deepak, Hortonworks has its own howto post here: http://hortonworks.com/blog/introduction-to-data-science-with-apache-spark/
Hope it helps. Thanks, -Randy From: "ÐΞ€ρ@Ҝ (๏̯͡๏)" Reply-To: "users@zeppelin.incubator.apache.org<mailto:users@zeppelin.incubator.apache.org>" Date: Monday, August 3, 2015 at 9:48 AM To: "users@zeppelin.incubator.apache.org<mailto:users@zeppelin.incubator.apache.org>" Subject: Re: Yarn + Spark + Zepplin ? I used Ambari to setup Hadoop cluster and they claim that their is pure open source version of Hadoop (2.7.x) Am not using CDH On Mon, 3 Aug 2015 at 6:01 AM Todd Nist <tsind...@gmail.com<mailto:tsind...@gmail.com>> wrote: Not sure which Hadoop package (vendor) you are using, but there is a good write up here on how to configure Zeppelin with Yarn for Cloudera, I would imagine most of this will carry over to any Hadoop package. http://blog.cloudera.com/blog/2015/07/how-to-install-apache-zeppelin-on-cdh/ HTH -Todd On Mon, Aug 3, 2015 at 12:24 AM, ÐΞ€ρ@Ҝ (๏̯͡๏) <deepuj...@gmail.com<mailto:deepuj...@gmail.com>> wrote: Hello, I would like to try out Zepplin and hence i got a 7 node Hadoop cluster with spark history server setup. I am able to run sample spark applications on my YARN cluster. I have no clue how to get zepplin to connect to this YARN cluster. Under https://zeppelin.incubator.apache.org/docs/install/install.html i see MASTER to point to spark master. I do not have a spark master running. How do i get Zepplin to be able to read data from YARN cluster ? Please share documentation. Regards, Deepak