[ 
https://issues.apache.org/jira/browse/SLIDER-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15595869#comment-15595869
 ] 

Manoj Samel edited comment on SLIDER-1169 at 10/21/16 6:04 PM:
---------------------------------------------------------------

Hi [~gsaha], I applied the patch to version 0.80 and it had addressed the 
quorum issue

One question - The patch change only one java file AbstractActionArgs.java. 
After the change, the build created new slider-core-0.80.0-incubating.jar and 
new slider-agent.tar.gz. The tar slider-agent.tar.gz seem to contain only 
python files so not sure why the patch would rebuild slider-agent.tar.gz. 

I have tested patch by deploying only the new slider-core-0.80.0-incubating.jar 
on existing .80 cluster; do the new slider-agent.tar.gz should also be deployed 
?


was (Author: manojsamel):
Hi [~gsaha], I applied the patch to version 0.80 and it had addressed the 
quorum issue

One question - The patch change only one java file AbstractActionArgs.java. 
After the change, the build created new slider-core-0.80.0-incubating.jar and 
new slider-agent.tar.gz. The tar slider-agent.tar.gz seem to contain only 
python files so not sure why the patch would rebuild slider-agent.tar.gz. 

I have tested but deploying only the new slider-core-0.80.0-incubating.jar on 
existing cluster; do the new slider-agent.tar.gz should also be deployed ?

> Slider not honoring zookeeper quorum values passed
> --------------------------------------------------
>
>                 Key: SLIDER-1169
>                 URL: https://issues.apache.org/jira/browse/SLIDER-1169
>             Project: Slider
>          Issue Type: Bug
>          Components: appmaster
>    Affects Versions: Slider 0.91
>         Environment: RHEL-6 (64 Bit)
>            Reporter: Sandeep Nemuri
>            Assignee: Daniel Murnane
>            Priority: Critical
>             Fix For: Slider 1.0.0
>
>         Attachments: SLIDER-1169.patch
>
>
> *PROBLEM* :
> Customer has created a Slider App by passing zookeeper quorum using below 
> command : 
> {code}
> slider create test --template appConfig.json --resources resources.json 
> --zkhosts 
> sandy234new1.hwxblr.com:2181,sandy234new3.hwxblr.com:2181,sandy234new2.hwxblr.com:2181
> {code}
> Below is the application log, which show us that it only picks the 1st 
> zookeeper.
> {code}
> 2016-09-15 15:44:29,052 [main] INFO  appmaster.SliderAppMaster - Loading 
> slider-server.xml at 
> file:/hadoop/yarn/local/usercache/root/appcache/application_1473930641993_0005/container_e04_1473930641993_0005_01_000001/confdir/slider-server.xml
> 2016-09-15 15:44:29,077 [main] INFO  appmaster.SliderAppMaster - AM 
> configuration:
> hadoop.registry.zk.quorum=sandy234new1.hwxblr.com:2181
> hadoop.registry.zk.root=/registry
> yarn.resourcemanager.scheduler.address=0.0.0.0:8030
> {code}
> *BUSINESS IMPACT* : Slider throws exceptions when 1st zookeeper goes down 
> (Since it only picks 1st zookeeper) and this is impacting the AM.
> *STEPS TO REPRODUCE*: 
> Launch a Hbase app using step 1 & 2.
> 1) slider create test --template appConfig.json --resources resources.json 
> --zkhosts 
> sandy234new1.hwxblr.com:2181,sandy234new3.hwxblr.com:2181,sandy234new2.hwxblr.com:2181
> This will launch a application in RM. 
> From the RM UI --> application -> logs 
> first line will be as below : 
> {code}
> 2016-09-15 15:44:29,052 [main] INFO  appmaster.SliderAppMaster - Loading 
> slider-server.xml at 
> file:/hadoop/yarn/local/usercache/root/appcache/application_1473930641993_0005/container_e04_1473930641993_0005_01_000001/confdir/slider-server.xml
> 2016-09-15 15:44:29,077 [main] INFO  appmaster.SliderAppMaster - AM 
> configuration:
> hadoop.registry.zk.quorum=sandy234new1.hwxblr.com:2181
> hadoop.registry.zk.root=/registry
> yarn.resourcemanager.scheduler.address=0.0.0.0:8030
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to