Ted Yu created SLIDER-399:
-----------------------------

             Summary: Agent command doesn't consider the case where LOG_DIRS 
contains multiple directories
                 Key: SLIDER-399
                 URL: https://issues.apache.org/jira/browse/SLIDER-399
             Project: Slider
          Issue Type: Bug
            Reporter: Ted Yu


On a cluster where LOG_DIRS contains multiple directories, I saw the following 
in AM log:
{code}
14/09/04 21:43:48 INFO appmaster.RoleLaunchService: Starting container with 
command: python ./infra/agent/slider-agent/agent/main.py --label 
container_1407274006606_0097_01_000006___HBASE_THRIFT --zk-quorum 
hor10n07.gq1.ygridcore.net:2181 --zk-reg-path 
/registry/org-apache-slider/hbcluster27 > $LOG_DIRS/agent.out 2>&1 ; 
{code}
The above command resulted in:
{code}
Stack trace: ExitCodeException exitCode=1: /bin/bash: 
/grid/0/hdp/yarn/log/application_1407274006606_0097/container_1407274006606_0097_01_000002,/grid/1/hdp/yarn/log/application_1407274006606_0097/container_1407274006606_0097_01_000002,/grid/2/hdp/yarn/log/application_1407274006606_0097/container_1407274006606_0097_01_000002,/grid/3/hdp/yarn/log/application_1407274006606_0097/container_1407274006606_0097_01_000002,/grid/4/hdp/yarn/log/application_1407274006606_0097/container_1407274006606_0097_01_000002,/grid/5/hdp/yarn/log/application_1407274006606_0097/container_1407274006606_0097_01_000002/agent.out:
 No such file or directory
{code}
The cause seems to be that $LOG_DIRS is the concatenation of 6 directories. 
$LOG_DIRS/agent.out would not be accessible, leading to container launch 
failure.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to