Evans Ye created BIGTOP-1336:
--------------------------------

             Summary: Puppet recipes failed to deploy kerberos enabled hadoop 
cluster
                 Key: BIGTOP-1336
                 URL: https://issues.apache.org/jira/browse/BIGTOP-1336
             Project: Bigtop
          Issue Type: Bug
          Components: Deployment
    Affects Versions: 0.7.0
            Reporter: Evans Ye


Here are some missing dependency setting in our puppet recipes in order to get 
kerberos enabled on the hadoop cluster.

The first one is that kerberos principal for hdfs user hasn't been created 
before formatting namenode, which cause the namenode formatting process failed.

The second one is that {{/etc/default/hadoop-hdfs-datanode}} doesn't get ready 
before starting up datanodes and results in datanodes failed to startup.
The datanode error log:
{noformat}
2014-06-16 15:10:10,711 FATAL org.apache.hadoop.hdfs.server.datanode.DataNode: 
Exception in secureMain
java.lang.RuntimeException: Cannot start secure cluster without privileged 
resources.
{noformat}


Here's the reproduce steps using 
[vagrant-puppet|https://github.com/apache/bigtop/tree/master/bigtop-deploy/vm/vagrant-puppet]:
1.) Enable kerberos on the hadoop cluster.
{noformat}
$ vim bigtop-deploy/vm/vagrant-puppet/provision.sh
{noformat}
Add kerberos definitions.
{noformat}
cat > /bigtop-puppet/config/site.csv << EOF
hadoop_head_node,$1
hadoop_storage_dirs,/data/1,/data/2
bigtop_yumrepo_uri,http://bigtop.s3.amazonaws.com/releases/0.7.0/redhat/6/x86_64
jdk_package_name,java-1.7.0-openjdk-devel.x86_64
components,hadoop,hbase,yarn,mapred-app
hadoop_security,kerberos
hadoop_kerberos_domain,vagrant
hadoop_kerberos_realm,BIGTOP.ORG
hadoop_kerberos_kdc_server,bigtop1.vagrant
EOF
{noformat}
2.) Spin up the cluster.
{code}
$ ./startup.sh --cluster
{code}
3-1.) Get an error while formating namenode.
{noformat}
err: /Stage[main]/Hadoop_head_node/Hadoop::Namenode[namenode]/Exec[namenode 
format]/returns: change from notrun to 0 failed: /bin/bash -c 'yes Y | hdfs 
namenode -format >> /var/lib/hadoop-hdfs/nn.format.log 2>&1' returned 1 instead 
of one of [0] at /tmp/vagrant-puppet-2/modules-0/hadoop/manifests/init.pp:361
{noformat}
3-2.) Get an error while starting up datanodes.
{noformat}
err: 
/Stage[main]/Hadoop_worker_node/Hadoop::Datanode[datanode]/Service[hadoop-hdfs-datanode]/ensure:
 change from stopped to running failed: Could not start 
Service[hadoop-hdfs-datanode]: Execution of '/sbin/service hadoop-hdfs-datanode 
start' returned 1:  at 
/tmp/vagrant-puppet-2/modules-0/hadoop/manifests/init.pp:158
{noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to