Hi, Thanks for sharing helpful document. Do you mind sharing a link if this document is published?
Best, moon On Tue, Aug 4, 2015 at 8:13 AM ÐΞ€ρ@Ҝ (๏̯͡๏) <deepuj...@gmail.com> wrote: > Introduction > > A web-based notebook that enables interactive data analytics. You can > make beautiful data-driven, interactive and collaborative documents with > SQL, Scala and more. More details : https://zeppelin.incubator.apache.org/. > Zeppelin has integration with Spark, SQL, Hive among others to process the > data, visualization and sharing of reports. This wiki will describe how > to setup a Zeppelin on a existing YARN cluster. YARN cluster is created on > dev c3 using Ambari. > Setup > > We assume that a YARN cluster is available along Spark history server. > c3 Instance > > Create a c3 xLarge instance with CentOs 6.4.x as OS. A new instance is > chosen to make sure there is enough resources available for Zeppelin. > > Prepare > > Zeppelin node needs to have Hadoop clients installed and hence it must be > prepared before clients can be installed from Ambari. Run the below set of > commands to prepare zeppelin node. > *Prepare* > echo "ssh-rsa > AAAAB3NzaC1yc2EAAAABIwAAAQEAzBihXIpeZey1G1tQecThBZnJarkX2GjzbE+aQ8dL8TsHchAnwWGVwEmiSNes1O/2L7NV1OpO97gbG3DxhZ8joSxkv0or8WWh17FHY0wdS8ypypffE0YKWxeEJqTbTz6y0pizpZuexi2Sq07On3Nln2me9atVvDE0s0U0vH7JMYgcKSDTog/pvNk6Le54RRkQz5yi8bVDZiOMfhJfn2phXmNB42Upij+kiClXXOEz2E70fQo0Bo5+iTNF/oxSk1vrtYDOHtxGcPZYe60TEp8dASB8NG732vgOs6eecR4LQcGKiBN6uDEuMd3vWMK8or59tCVrEh+/h+2XipZ3hnmu7w== > root@ambariserver-3409" >> /root/.ssh/authorized_keys > # So that root user from ambari server can do passwordless login to > zeppelin server. > > ntpd > hostname -f > setenforce 0 > chkconfig iptables off > /etc/init.d/iptables stop > ipaddr=$(ifconfig | grep 'inet addr:'| grep -v '127.0.0.1' | cut -d: -f2 > | awk '{ print $1}') > fhost=`hostname -f` > echo "$ipaddr $fhost `hostname`" > echo "$ipaddr $fhost `hostname`" >> /etc/hosts > cat /etc/hosts > # Ensure full hostname is present in /etc/hosts and hostname -f shows full > hostname. > > Once the zeppelin server is prepared go to Ambari web interface and run a > action to add new host from Hosts tab. Install Hadoop clients only. > Building Zeppelin > > In order to build zeppelin Apache Maven, JDK 1.7 & Git needs to be > installed. You can run below commands as root user. (root permissions are > not mandatory) > *Install Git/Java/Maven* > # Install Git > yum install curl-devel expat-devel gettext-devel openssl-devel zlib-devel > yum install gcc perl-ExtUtils-MakeMaker > yum remove git > cd /usr/src > wget https://www.kernel.org/pub/software/scm/git/git-2.0.4.tar.gz > tar xzf git-2.0.4.tar.gz > cd git-2.0.4 > make prefix=/usr/local/git all > make prefix=/usr/local/git install > echo "export PATH=$PATH:/usr/local/git/bin" >> /etc/bashrc > source /etc/bashrc > git --version > > # Install JDK 1.7 > cd /usr/src > #wget > http://download.oracle.com/otn-pub/java/jdk/7u79-b15/jdk-7u79-linux-x64.tar.gz?AuthParam=1438626626_b7fb864ed0343b3322bd003ced1e03f5 > #Download JDK 1.7 > mv > jdk-7u79-linux-x64.tar.gz\?AuthParam\=1438626626_b7fb864ed0343b3322bd003ced1e03f5 > jdk-7u79-linux-x64.tar.gz > tar -xf jdk-7u79-linux-x64.tar.gz > export JAVA_HOME=/usr/src/jdk1.7.0_79 > > # Install Apache Maven > wget ftp://mirror.reverse.net/pub/apache/maven/maven-3/3.3.3 > /binaries/apache-maven-3.3.3-bin.tar.gz > tar -xf apache-maven-3.3.3-bin.tar.gz > cd apache-maven-3.3.3 > export MAVEN_HOME=/usr/src/apache-maven-3.3.3 > echo "export PATH=$PATH:/usr/src/apache-maven-3.3.3/bin" >> /etc/bashrc > source /etc/bashrc > > git --version > mvn -version > > Create a new user zeppelin and switch to that. > *Zeppelin User* > useradd zeppelin > su - zeppelin > > Checkout zeppelin from github > *Checkout Zeppelin* >