[jira] [Created] (MYRIAD-255) unable to run yarn application with hadoop2.7.3 and mesos1.2.0

2017-07-20 Thread Liu You (JIRA)
Liu You created MYRIAD-255:
--

 Summary: unable to run yarn application with hadoop2.7.3 and 
mesos1.2.0
 Key: MYRIAD-255
 URL: https://issues.apache.org/jira/browse/MYRIAD-255
 Project: Myriad
  Issue Type: Bug
  Components: Executor, Scheduler
Affects Versions: Myriad 0.2.0
 Environment: Ubuntu 14.04.3 LTS
master hostname: s18
slaves hostname: s19, s20, s21
hadoop 2.7.3
mesos 1.2.0
Reporter: Liu You
 Attachments: mapred-site.xml, 
mesos-master.s18.mesos.log.INFO.20170720-173553.21812, 
mesos-master.s18.mesos.log.WARNING.20170720-173553.21812, Mesos Web UI.png, 
myriad-config-default.yml, myriad ui-1.png, myriad ui-2.png, stucked job.png, 
yarn-mesos-resourcemanager-s18.log, yarn-mesos-resourcemanager-s18.out, 
yarn-site.xml

I can't run yarn application, and the job gets stucked every time I execute the 
command 'hadoop jar xxx.'. Please help me. Thank you very much.

!stucked job.png|thumbnail!

Myriad Web UI:
!myriad ui-1.png|thumbnail!
!myriad ui-2.png|thumbnail!

Mesos Web UI:
!Mesos Web UI.png|thumbnail!

yarn-mesos-resourcemanager-s18.log:

{code:java}
2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
Responder: starting
2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server listener 
on 8032: starting
2017-07-20 17:36:46,955 INFO 
org.apache.myriad.scheduler.event.handlers.ExecutorLostEventHandler: Executor 
value: 
"myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O08e5e7115-060b-42c6-b24f-64642048cfa1-S0"
 of slave value: "8e5e7115-060b-42c6-b24f-64642048cfa1-S0"
 lost with exit status: 9
2017-07-20 17:36:46,969 INFO 
org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Status 
Update for task: nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898 | state: 
TASK_FAILED
2017-07-20 17:36:46,970 INFO 
org.apache.hadoop.yarn.server.resourcemanager.ResourceManager: Transitioned to 
active state
2017-07-20 17:36:46,971 INFO 
org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Marked as 
pending failed task with id value: 
"nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898"
{code}

myriad-config-default.yml:

{code:java}
mesosMaster: 192.168.130.18:5050
checkpoint: false
frameworkFailoverTimeout: 4320
frameworkName: MyriadAlpha
frameworkRole: "yarn"
frameworkUser: mesos # User the Node Manager runs as, required if 
nodeManagerURI set, otherwise defaults to the user
  # running the resource manager.
frameworkSuperUser: root  # To be deprecated, currently permissions need set by 
a superuser due to Mesos-1790.  Must be
  # root or have passwordless sudo. Required if 
nodeManagerURI set, ignored otherwise.
#nativeLibrary: /usr/local/lib/libmesos.so
nativeLibrary: /home/mesos/mesos/mesos-1.2.0/mesos_install/lib/libmesos.so
zkServers: 192.168.130.18:2181
zkTimeout: 2
restApiPort: 8192
#servedConfigPath: dist/config.tgz
servedConfigPath: /home/mesos/mesos/hdfs/hadoop-2.7.3/etc/config.tgz
#servedBinaryPath: dist/hadoop-2.6.0.tgz
servedBinaryPath: /home/mesos/mesos/hdfs/binary.tgz
profiles:
  zero:  # NMs launched with this profile dynamically obtain cpu/mem from Mesos
cpu: 0
mem: 0
  small:
cpu: 2
mem: 2048
  medium:
cpu: 4
mem: 4096
  large:
cpu: 10
mem: 12288
nmInstances: # NMs to start with. Requires at least 1 NM with a non-zero 
profile.
  medium: 1 # 
rebalancer: false
haEnabled: false
nodemanager:
  jvmMaxMemoryMB: 1024
  cpus: 0.2
  cgroups: false
executor:
  jvmMaxMemoryMB: 256
  path: hdfs://s18:8020/dist/myriad-executor-0.2.0.jar
  #path: 
file:///home/mesos/mesos/hdfs/hadoop-2.7.3/share/hadoop/yarn/lib/myriad-executor-0.2.0.jar
  #The following should be used for a remotely distributed URI, hdfs assumed 
but other URI types valid.
  #nodeManagerUri: hdfs://s18:8020/user/MR/binary.tgz
  #configUri: http://s18:8192/api/artifacts/config.tgz
  #jvmUri: https://downloads.mycompany.com/java/jre-7u76-linux-x64.tar.gz
yarnEnvironment:
  YARN_HOME: /home/mesos/mesos/hdfs/hadoop-2.7.3
  #HADOOP_CONF_DIR=config
  #HADOOP_TMP_DIR=$MESOS_SANDBOX
  #YARN_HOME: hadoop-2.7.0 #this should be relative if nodeManagerUri is set
  #JAVA_HOME: /usr/lib/jvm/java-default #System dependent, but sometimes 
necessary
  #JAVA_HOME: /home/mesos/jdk/jdk1.8.0_131
  #JAVA_HOME: jre1.7.0_76 # Path to JRE distribution, relative to sandbox 
directory
  #JAVA_LIBRARY_PATH: /opt/mycompany/lib
#mesosAuthenticationPrincipal:
#mesosAuthenticationSecretFilename:
{code}

yarn-site.xml:

{code:java}




yarn.resourcemanager.hostname
s18



yarn.nodemanager.aux-services
mapreduce_shuffle


 
yarn.nodemanager.aux-services
mapreduce_shuffle,myriad_executor



yarn.nodemanager.aux-services.mapreduce_shuffle.class
or

[jira] [Updated] (MYRIAD-255) unable to run yarn application with hadoop2.7.3 and mesos1.2.0

2017-07-20 Thread Liu You (JIRA)

 [ 
https://issues.apache.org/jira/browse/MYRIAD-255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liu You updated MYRIAD-255:
---
Description: 
I can't run yarn application, and the job gets stucked every time I execute the 
command 'hadoop jar xxx.'. Please help me. Thank you very much.

!stucked job.png|thumbnail!

Myriad Web UI:

!myriad ui-1.png|thumbnail!

!myriad ui-2.png|thumbnail!

Mesos Web UI:
!Mesos Web UI.png|thumbnail!

yarn-mesos-resourcemanager-s18.log:

{code:java}
2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
Responder: starting
2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server listener 
on 8032: starting
2017-07-20 17:36:46,955 INFO 
org.apache.myriad.scheduler.event.handlers.ExecutorLostEventHandler: Executor 
value: 
"myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O08e5e7115-060b-42c6-b24f-64642048cfa1-S0"
 of slave value: "8e5e7115-060b-42c6-b24f-64642048cfa1-S0"
 lost with exit status: 9
2017-07-20 17:36:46,969 INFO 
org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Status 
Update for task: nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898 | state: 
TASK_FAILED
2017-07-20 17:36:46,970 INFO 
org.apache.hadoop.yarn.server.resourcemanager.ResourceManager: Transitioned to 
active state
2017-07-20 17:36:46,971 INFO 
org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Marked as 
pending failed task with id value: 
"nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898"
{code}

myriad-config-default.yml:

{code:java}
mesosMaster: 192.168.130.18:5050
checkpoint: false
frameworkFailoverTimeout: 4320
frameworkName: MyriadAlpha
frameworkRole: "yarn"
frameworkUser: mesos # User the Node Manager runs as, required if 
nodeManagerURI set, otherwise defaults to the user
  # running the resource manager.
frameworkSuperUser: root  # To be deprecated, currently permissions need set by 
a superuser due to Mesos-1790.  Must be
  # root or have passwordless sudo. Required if 
nodeManagerURI set, ignored otherwise.
#nativeLibrary: /usr/local/lib/libmesos.so
nativeLibrary: /home/mesos/mesos/mesos-1.2.0/mesos_install/lib/libmesos.so
zkServers: 192.168.130.18:2181
zkTimeout: 2
restApiPort: 8192
#servedConfigPath: dist/config.tgz
servedConfigPath: /home/mesos/mesos/hdfs/hadoop-2.7.3/etc/config.tgz
#servedBinaryPath: dist/hadoop-2.6.0.tgz
servedBinaryPath: /home/mesos/mesos/hdfs/binary.tgz
profiles:
  zero:  # NMs launched with this profile dynamically obtain cpu/mem from Mesos
cpu: 0
mem: 0
  small:
cpu: 2
mem: 2048
  medium:
cpu: 4
mem: 4096
  large:
cpu: 10
mem: 12288
nmInstances: # NMs to start with. Requires at least 1 NM with a non-zero 
profile.
  medium: 1 # 
rebalancer: false
haEnabled: false
nodemanager:
  jvmMaxMemoryMB: 1024
  cpus: 0.2
  cgroups: false
executor:
  jvmMaxMemoryMB: 256
  path: hdfs://s18:8020/dist/myriad-executor-0.2.0.jar
  #path: 
file:///home/mesos/mesos/hdfs/hadoop-2.7.3/share/hadoop/yarn/lib/myriad-executor-0.2.0.jar
  #The following should be used for a remotely distributed URI, hdfs assumed 
but other URI types valid.
  #nodeManagerUri: hdfs://s18:8020/user/MR/binary.tgz
  #configUri: http://s18:8192/api/artifacts/config.tgz
  #jvmUri: https://downloads.mycompany.com/java/jre-7u76-linux-x64.tar.gz
yarnEnvironment:
  YARN_HOME: /home/mesos/mesos/hdfs/hadoop-2.7.3
  #HADOOP_CONF_DIR=config
  #HADOOP_TMP_DIR=$MESOS_SANDBOX
  #YARN_HOME: hadoop-2.7.0 #this should be relative if nodeManagerUri is set
  #JAVA_HOME: /usr/lib/jvm/java-default #System dependent, but sometimes 
necessary
  #JAVA_HOME: /home/mesos/jdk/jdk1.8.0_131
  #JAVA_HOME: jre1.7.0_76 # Path to JRE distribution, relative to sandbox 
directory
  #JAVA_LIBRARY_PATH: /opt/mycompany/lib
#mesosAuthenticationPrincipal:
#mesosAuthenticationSecretFilename:
{code}

yarn-site.xml:

{code:java}




yarn.resourcemanager.hostname
s18



yarn.nodemanager.aux-services
mapreduce_shuffle


 
yarn.nodemanager.aux-services
mapreduce_shuffle,myriad_executor



yarn.nodemanager.aux-services.mapreduce_shuffle.class
org.apache.hadoop.mapred.ShuffleHandler


yarn.nodemanager.aux-services.myriad_executor.class
org.apache.myriad.executor.MyriadExecutorAuxService


yarn.nm.liveness-monitor.expiry-interval-ms
2000


yarn.am.liveness-monitor.expiry-interval-ms
1


yarn.resourcemanager.nm.liveness-monitor.interval-ms
1000



yarn.nodemanager.resource.cpu-vcores
${nodemanager.resource.cpu-vcores}


yarn.nodemanager.resource.memory-mb
${nodemanager.resource.memory-mb}



  yarn.nodemanager.pmem-check-enabled
  false
   

[jira] [Updated] (MYRIAD-255) unable to run yarn application with hadoop2.7.3 and mesos1.2.0

2017-07-20 Thread Liu You (JIRA)

 [ 
https://issues.apache.org/jira/browse/MYRIAD-255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liu You updated MYRIAD-255:
---
Description: 
I can't run yarn application, and the job gets stucked every time I execute the 
command 'hadoop jar xxx.'. 

Please help me. Thank you very much.

!stucked job.png|thumbnail!

Myriad Web UI:

!myriad ui-1.png|thumbnail!

!myriad ui-2.png|thumbnail!

Mesos Web UI:
!Mesos Web UI.png|thumbnail!

yarn-mesos-resourcemanager-s18.log:

{code:java}
2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
Responder: starting
2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server listener 
on 8032: starting
2017-07-20 17:36:46,955 INFO 
org.apache.myriad.scheduler.event.handlers.ExecutorLostEventHandler: Executor 
value: 
"myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O08e5e7115-060b-42c6-b24f-64642048cfa1-S0"
 of slave value: "8e5e7115-060b-42c6-b24f-64642048cfa1-S0"
 lost with exit status: 9
2017-07-20 17:36:46,969 INFO 
org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Status 
Update for task: nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898 | state: 
TASK_FAILED
2017-07-20 17:36:46,970 INFO 
org.apache.hadoop.yarn.server.resourcemanager.ResourceManager: Transitioned to 
active state
2017-07-20 17:36:46,971 INFO 
org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Marked as 
pending failed task with id value: 
"nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898"
{code}

myriad-config-default.yml:

{code:java}
mesosMaster: 192.168.130.18:5050
checkpoint: false
frameworkFailoverTimeout: 4320
frameworkName: MyriadAlpha
frameworkRole: "yarn"
frameworkUser: mesos # User the Node Manager runs as, required if 
nodeManagerURI set, otherwise defaults to the user
  # running the resource manager.
frameworkSuperUser: root  # To be deprecated, currently permissions need set by 
a superuser due to Mesos-1790.  Must be
  # root or have passwordless sudo. Required if 
nodeManagerURI set, ignored otherwise.
#nativeLibrary: /usr/local/lib/libmesos.so
nativeLibrary: /home/mesos/mesos/mesos-1.2.0/mesos_install/lib/libmesos.so
zkServers: 192.168.130.18:2181
zkTimeout: 2
restApiPort: 8192
#servedConfigPath: dist/config.tgz
servedConfigPath: /home/mesos/mesos/hdfs/hadoop-2.7.3/etc/config.tgz
#servedBinaryPath: dist/hadoop-2.6.0.tgz
servedBinaryPath: /home/mesos/mesos/hdfs/binary.tgz
profiles:
  zero:  # NMs launched with this profile dynamically obtain cpu/mem from Mesos
cpu: 0
mem: 0
  small:
cpu: 2
mem: 2048
  medium:
cpu: 4
mem: 4096
  large:
cpu: 10
mem: 12288
nmInstances: # NMs to start with. Requires at least 1 NM with a non-zero 
profile.
  medium: 1 # 
rebalancer: false
haEnabled: false
nodemanager:
  jvmMaxMemoryMB: 1024
  cpus: 0.2
  cgroups: false
executor:
  jvmMaxMemoryMB: 256
  path: hdfs://s18:8020/dist/myriad-executor-0.2.0.jar
  #path: 
file:///home/mesos/mesos/hdfs/hadoop-2.7.3/share/hadoop/yarn/lib/myriad-executor-0.2.0.jar
  #The following should be used for a remotely distributed URI, hdfs assumed 
but other URI types valid.
  #nodeManagerUri: hdfs://s18:8020/user/MR/binary.tgz
  #configUri: http://s18:8192/api/artifacts/config.tgz
  #jvmUri: https://downloads.mycompany.com/java/jre-7u76-linux-x64.tar.gz
yarnEnvironment:
  YARN_HOME: /home/mesos/mesos/hdfs/hadoop-2.7.3
  #HADOOP_CONF_DIR=config
  #HADOOP_TMP_DIR=$MESOS_SANDBOX
  #YARN_HOME: hadoop-2.7.0 #this should be relative if nodeManagerUri is set
  #JAVA_HOME: /usr/lib/jvm/java-default #System dependent, but sometimes 
necessary
  #JAVA_HOME: /home/mesos/jdk/jdk1.8.0_131
  #JAVA_HOME: jre1.7.0_76 # Path to JRE distribution, relative to sandbox 
directory
  #JAVA_LIBRARY_PATH: /opt/mycompany/lib
#mesosAuthenticationPrincipal:
#mesosAuthenticationSecretFilename:
{code}

yarn-site.xml:

{code:java}




yarn.resourcemanager.hostname
s18



yarn.nodemanager.aux-services
mapreduce_shuffle


 
yarn.nodemanager.aux-services
mapreduce_shuffle,myriad_executor



yarn.nodemanager.aux-services.mapreduce_shuffle.class
org.apache.hadoop.mapred.ShuffleHandler


yarn.nodemanager.aux-services.myriad_executor.class
org.apache.myriad.executor.MyriadExecutorAuxService


yarn.nm.liveness-monitor.expiry-interval-ms
2000


yarn.am.liveness-monitor.expiry-interval-ms
1


yarn.resourcemanager.nm.liveness-monitor.interval-ms
1000



yarn.nodemanager.resource.cpu-vcores
${nodemanager.resource.cpu-vcores}


yarn.nodemanager.resource.memory-mb
${nodemanager.resource.memory-mb}



  yarn.nodemanager.pmem-check-enabled
  false
 

[jira] [Updated] (MYRIAD-255) unable to run yarn application with hadoop2.7.3 and mesos1.2.0

2017-07-20 Thread Liu You (JIRA)

 [ 
https://issues.apache.org/jira/browse/MYRIAD-255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liu You updated MYRIAD-255:
---
Description: 
I can't run yarn application, and the job gets stucked every time I execute the 
command 'hadoop jar xxx.'. 

Please help me. Thank you very much.

The Myriad Web UI shows that the status of the task changes from staging to 
pending, then staging, then pending. 

There are some screenshots in the attachments below.

yarn-mesos-resourcemanager-s18.log:

{code:java}
2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
Responder: starting
2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server listener 
on 8032: starting
2017-07-20 17:36:46,955 INFO 
org.apache.myriad.scheduler.event.handlers.ExecutorLostEventHandler: Executor 
value: 
"myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O08e5e7115-060b-42c6-b24f-64642048cfa1-S0"
 of slave value: "8e5e7115-060b-42c6-b24f-64642048cfa1-S0"
 lost with exit status: 9
2017-07-20 17:36:46,969 INFO 
org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Status 
Update for task: nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898 | state: 
TASK_FAILED
2017-07-20 17:36:46,970 INFO 
org.apache.hadoop.yarn.server.resourcemanager.ResourceManager: Transitioned to 
active state
2017-07-20 17:36:46,971 INFO 
org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Marked as 
pending failed task with id value: 
"nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898"
{code}

myriad-config-default.yml:

{code:java}
mesosMaster: 192.168.130.18:5050
checkpoint: false
frameworkFailoverTimeout: 4320
frameworkName: MyriadAlpha
frameworkRole: "yarn"
frameworkUser: mesos # User the Node Manager runs as, required if 
nodeManagerURI set, otherwise defaults to the user
  # running the resource manager.
frameworkSuperUser: root  # To be deprecated, currently permissions need set by 
a superuser due to Mesos-1790.  Must be
  # root or have passwordless sudo. Required if 
nodeManagerURI set, ignored otherwise.
#nativeLibrary: /usr/local/lib/libmesos.so
nativeLibrary: /home/mesos/mesos/mesos-1.2.0/mesos_install/lib/libmesos.so
zkServers: 192.168.130.18:2181
zkTimeout: 2
restApiPort: 8192
#servedConfigPath: dist/config.tgz
servedConfigPath: /home/mesos/mesos/hdfs/hadoop-2.7.3/etc/config.tgz
#servedBinaryPath: dist/hadoop-2.6.0.tgz
servedBinaryPath: /home/mesos/mesos/hdfs/binary.tgz
profiles:
  zero:  # NMs launched with this profile dynamically obtain cpu/mem from Mesos
cpu: 0
mem: 0
  small:
cpu: 2
mem: 2048
  medium:
cpu: 4
mem: 4096
  large:
cpu: 10
mem: 12288
nmInstances: # NMs to start with. Requires at least 1 NM with a non-zero 
profile.
  medium: 1 # 
rebalancer: false
haEnabled: false
nodemanager:
  jvmMaxMemoryMB: 1024
  cpus: 0.2
  cgroups: false
executor:
  jvmMaxMemoryMB: 256
  path: hdfs://s18:8020/dist/myriad-executor-0.2.0.jar
  #path: 
file:///home/mesos/mesos/hdfs/hadoop-2.7.3/share/hadoop/yarn/lib/myriad-executor-0.2.0.jar
  #The following should be used for a remotely distributed URI, hdfs assumed 
but other URI types valid.
  #nodeManagerUri: hdfs://s18:8020/user/MR/binary.tgz
  #configUri: http://s18:8192/api/artifacts/config.tgz
  #jvmUri: https://downloads.mycompany.com/java/jre-7u76-linux-x64.tar.gz
yarnEnvironment:
  YARN_HOME: /home/mesos/mesos/hdfs/hadoop-2.7.3
  #HADOOP_CONF_DIR=config
  #HADOOP_TMP_DIR=$MESOS_SANDBOX
  #YARN_HOME: hadoop-2.7.0 #this should be relative if nodeManagerUri is set
  #JAVA_HOME: /usr/lib/jvm/java-default #System dependent, but sometimes 
necessary
  #JAVA_HOME: /home/mesos/jdk/jdk1.8.0_131
  #JAVA_HOME: jre1.7.0_76 # Path to JRE distribution, relative to sandbox 
directory
  #JAVA_LIBRARY_PATH: /opt/mycompany/lib
#mesosAuthenticationPrincipal:
#mesosAuthenticationSecretFilename:
{code}

yarn-site.xml:

{code:java}




yarn.resourcemanager.hostname
s18



yarn.nodemanager.aux-services
mapreduce_shuffle


 
yarn.nodemanager.aux-services
mapreduce_shuffle,myriad_executor



yarn.nodemanager.aux-services.mapreduce_shuffle.class
org.apache.hadoop.mapred.ShuffleHandler


yarn.nodemanager.aux-services.myriad_executor.class
org.apache.myriad.executor.MyriadExecutorAuxService


yarn.nm.liveness-monitor.expiry-interval-ms
2000


yarn.am.liveness-monitor.expiry-interval-ms
1


yarn.resourcemanager.nm.liveness-monitor.interval-ms
1000



yarn.nodemanager.resource.cpu-vcores
${nodemanager.resource.cpu-vcores}


yarn.nodemanager.resource.memory-mb
${nodemanager.resource.memory-mb}



  yarn.nodemanager.pmem-

[jira] [Updated] (MYRIAD-255) unable to run yarn application with hadoop2.7.3 and mesos1.2.0

2017-07-20 Thread Liu You (JIRA)

 [ 
https://issues.apache.org/jira/browse/MYRIAD-255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liu You updated MYRIAD-255:
---
Description: 
I can't run yarn application, and the job gets stucked every time I execute the 
command 'hadoop jar xxx.'. 

The Myriad Web UI shows that the status of the task changes from staging to 
pending, then staging, then pending. 

There are some screenshots in the attachments below.

Please help me. Thank you very much.

yarn-mesos-resourcemanager-s18.log:

{code:java}
2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
Responder: starting
2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server listener 
on 8032: starting
2017-07-20 17:36:46,955 INFO 
org.apache.myriad.scheduler.event.handlers.ExecutorLostEventHandler: Executor 
value: 
"myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O08e5e7115-060b-42c6-b24f-64642048cfa1-S0"
 of slave value: "8e5e7115-060b-42c6-b24f-64642048cfa1-S0"
 lost with exit status: 9
2017-07-20 17:36:46,969 INFO 
org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Status 
Update for task: nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898 | state: 
TASK_FAILED
2017-07-20 17:36:46,970 INFO 
org.apache.hadoop.yarn.server.resourcemanager.ResourceManager: Transitioned to 
active state
2017-07-20 17:36:46,971 INFO 
org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Marked as 
pending failed task with id value: 
"nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898"
{code}

myriad-config-default.yml:

{code:java}
mesosMaster: 192.168.130.18:5050
checkpoint: false
frameworkFailoverTimeout: 4320
frameworkName: MyriadAlpha
frameworkRole: "yarn"
frameworkUser: mesos # User the Node Manager runs as, required if 
nodeManagerURI set, otherwise defaults to the user
  # running the resource manager.
frameworkSuperUser: root  # To be deprecated, currently permissions need set by 
a superuser due to Mesos-1790.  Must be
  # root or have passwordless sudo. Required if 
nodeManagerURI set, ignored otherwise.
#nativeLibrary: /usr/local/lib/libmesos.so
nativeLibrary: /home/mesos/mesos/mesos-1.2.0/mesos_install/lib/libmesos.so
zkServers: 192.168.130.18:2181
zkTimeout: 2
restApiPort: 8192
#servedConfigPath: dist/config.tgz
servedConfigPath: /home/mesos/mesos/hdfs/hadoop-2.7.3/etc/config.tgz
#servedBinaryPath: dist/hadoop-2.6.0.tgz
servedBinaryPath: /home/mesos/mesos/hdfs/binary.tgz
profiles:
  zero:  # NMs launched with this profile dynamically obtain cpu/mem from Mesos
cpu: 0
mem: 0
  small:
cpu: 2
mem: 2048
  medium:
cpu: 4
mem: 4096
  large:
cpu: 10
mem: 12288
nmInstances: # NMs to start with. Requires at least 1 NM with a non-zero 
profile.
  medium: 1 # 
rebalancer: false
haEnabled: false
nodemanager:
  jvmMaxMemoryMB: 1024
  cpus: 0.2
  cgroups: false
executor:
  jvmMaxMemoryMB: 256
  path: hdfs://s18:8020/dist/myriad-executor-0.2.0.jar
  #path: 
file:///home/mesos/mesos/hdfs/hadoop-2.7.3/share/hadoop/yarn/lib/myriad-executor-0.2.0.jar
  #The following should be used for a remotely distributed URI, hdfs assumed 
but other URI types valid.
  #nodeManagerUri: hdfs://s18:8020/user/MR/binary.tgz
  #configUri: http://s18:8192/api/artifacts/config.tgz
  #jvmUri: https://downloads.mycompany.com/java/jre-7u76-linux-x64.tar.gz
yarnEnvironment:
  YARN_HOME: /home/mesos/mesos/hdfs/hadoop-2.7.3
  #HADOOP_CONF_DIR=config
  #HADOOP_TMP_DIR=$MESOS_SANDBOX
  #YARN_HOME: hadoop-2.7.0 #this should be relative if nodeManagerUri is set
  #JAVA_HOME: /usr/lib/jvm/java-default #System dependent, but sometimes 
necessary
  #JAVA_HOME: /home/mesos/jdk/jdk1.8.0_131
  #JAVA_HOME: jre1.7.0_76 # Path to JRE distribution, relative to sandbox 
directory
  #JAVA_LIBRARY_PATH: /opt/mycompany/lib
#mesosAuthenticationPrincipal:
#mesosAuthenticationSecretFilename:
{code}

yarn-site.xml:

{code:java}




yarn.resourcemanager.hostname
s18



yarn.nodemanager.aux-services
mapreduce_shuffle


 
yarn.nodemanager.aux-services
mapreduce_shuffle,myriad_executor



yarn.nodemanager.aux-services.mapreduce_shuffle.class
org.apache.hadoop.mapred.ShuffleHandler


yarn.nodemanager.aux-services.myriad_executor.class
org.apache.myriad.executor.MyriadExecutorAuxService


yarn.nm.liveness-monitor.expiry-interval-ms
2000


yarn.am.liveness-monitor.expiry-interval-ms
1


yarn.resourcemanager.nm.liveness-monitor.interval-ms
1000



yarn.nodemanager.resource.cpu-vcores
${nodemanager.resource.cpu-vcores}


yarn.nodemanager.resource.memory-mb
${nodemanager.resource.memory-mb}



  yarn.nodemanager.pmem-

[jira] [Commented] (MYRIAD-255) unable to run yarn application with hadoop2.7.3 and mesos1.2.0

2017-07-20 Thread Liu You (JIRA)

[ 
https://issues.apache.org/jira/browse/MYRIAD-255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16095683#comment-16095683
 ] 

Liu You commented on MYRIAD-255:


Thanks for your reply.
 Strangely, the stdout and stderr from the task logs in the mesos ui is empty, 
0 size and no content.
I read some log files related to mesos:

File: mesos-agent.ERROR

E0720 17:36:52.323874 23854 fetcher.cpp:558] Failed to run mesos-fetcher: 
Failed to chown 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/5b12e719-6990-4546-9370-b984fc35b759-/executors/myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/7506cfd4-5777-4a02-9e92-5e566ae30dec/stdout'
 to user 'root' : Operation not permitted
E0720 17:36:52.324268 23856 slave.cpp:4650] Container 
'7506cfd4-5777-4a02-9e92-5e566ae30dec' for executor 
'myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0'
 of framework 5b12e719-6990-4546-9370-b984fc35b759- failed to start: Failed 
to chown 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/5b12e719-6990-4546-9370-b984fc35b759-/executors/myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/7506cfd4-5777-4a02-9e92-5e566ae30dec/stdout'
 to user 'root' : Operation not permitted


File: mesos-agent.WARNING

Running on machine: s19
Log line format: [IWEF]mmdd hh:mm:ss.uu threadid file:line] msg
W0721 10:07:42.287765 25785 backend.cpp:76] Failed to create 'aufs' backend: 
AufsBackend requires root privileges W0721 10:07:42.287967 
25785 backend.cpp:76] Failed to create 'bind' backend: BindBackend requires 
root privileges W0721 10:08:26.406080 25789 paths.cpp:555] 
Failed to chown executor directory 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/14a83b03-a84d-4cd2-99a9-01c553b3dfb6-/executors/myriad_executor14a83b03-a84d-4cd2-99a9-01c553b3dfb6-14a83b03-a84d-4cd2-99a9-01c553b3dfb6-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/e8ed1280-4dd9-41b1-88ac-2244a49efab4'.
 This may be due to attempting to run the executor as a nonexistent user on the 
agent; see the description for the `--switch_user` flag for more information: 
Operation not permitted   E0721 
10:08:26.431746 25793 fetcher.cpp:558] Failed to run mesos-fetcher: Failed to 
chown 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/14a83b03-a84d-4cd2-99a9-01c553b3dfb6-/executors/myriad_executor14a83b03-a84d-4cd2-99a9-01c553b3dfb6-14a83b03-a84d-4cd2-99a9-01c553b3dfb6-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/e8ed1280-4dd9-41b1-88ac-2244a49efab4/stdout'
 to user 'root' : Operation not permitted


> unable to run yarn application with hadoop2.7.3 and mesos1.2.0
> --
>
> Key: MYRIAD-255
> URL: https://issues.apache.org/jira/browse/MYRIAD-255
> Project: Myriad
>  Issue Type: Bug
>  Components: Executor, Scheduler
>Affects Versions: Myriad 0.2.0
> Environment: Ubuntu 14.04.3 LTS
> master hostname: s18
> slaves hostname: s19, s20, s21
> hadoop 2.7.3
> mesos 1.2.0
>Reporter: Liu You
> Attachments: mapred-site.xml, 
> mesos-master.s18.mesos.log.INFO.20170720-173553.21812, 
> mesos-master.s18.mesos.log.WARNING.20170720-173553.21812, Mesos Web UI.png, 
> myriad-config-default.yml, myriad ui-1.png, myriad ui-2.png, stucked job.png, 
> yarn-mesos-resourcemanager-s18.log, yarn-mesos-resourcemanager-s18.out, 
> yarn-site.xml
>
>
> I can't run yarn application, and the job gets stucked every time I execute 
> the command 'hadoop jar xxx.'. 
> The Myriad Web UI shows that the status of the task changes from staging to 
> pending, then staging, then pending. 
> There are some screenshots in the attachments below.
> Please help me. Thank you very much.
> yarn-mesos-resourcemanager-s18.log:
> {code:java}
> 2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
> Responder: starting
> 2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
> listener on 8032: starting
> 2017-07-20 17:36:46,955 INFO 
> org.apache.myriad.scheduler.event.handlers.ExecutorLostEventHandler: Executor 
> value: 
> "myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O08e5e7115-060b-42c6-b24f-64642048cfa1-S0"
>  of slave value: "8e5e7115-060b-42c6-b24f-64642048cfa1-S0"
>  lost with exit status: 9
> 20

[jira] [Comment Edited] (MYRIAD-255) unable to run yarn application with hadoop2.7.3 and mesos1.2.0

2017-07-20 Thread Liu You (JIRA)

[ 
https://issues.apache.org/jira/browse/MYRIAD-255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16095683#comment-16095683
 ] 

Liu You edited comment on MYRIAD-255 at 7/21/17 2:29 AM:
-

[~darinj] Thanks for your reply.
 Strangely, the stdout and stderr from the task logs in the mesos ui is empty, 
0 size and no content.
I read some log files related to mesos:

File: mesos-agent.ERROR

E0720 17:36:52.323874 23854 fetcher.cpp:558] Failed to run mesos-fetcher: 
Failed to chown 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/5b12e719-6990-4546-9370-b984fc35b759-/executors/myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/7506cfd4-5777-4a02-9e92-5e566ae30dec/stdout'
 to user 'root' : Operation not permitted
E0720 17:36:52.324268 23856 slave.cpp:4650] Container 
'7506cfd4-5777-4a02-9e92-5e566ae30dec' for executor 
'myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0'
 of framework 5b12e719-6990-4546-9370-b984fc35b759- failed to start: Failed 
to chown 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/5b12e719-6990-4546-9370-b984fc35b759-/executors/myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/7506cfd4-5777-4a02-9e92-5e566ae30dec/stdout'
 to user 'root' : Operation not permitted


File: mesos-agent.WARNING

Running on machine: s19
Log line format: [IWEF]mmdd hh:mm:ss.uu threadid file:line] msg
W0721 10:07:42.287765 25785 backend.cpp:76] Failed to create 'aufs' backend: 
AufsBackend requires root privileges W0721 10:07:42.287967 
25785 backend.cpp:76] Failed to create 'bind' backend: BindBackend requires 
root privileges W0721 10:08:26.406080 25789 paths.cpp:555] 
Failed to chown executor directory 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/14a83b03-a84d-4cd2-99a9-01c553b3dfb6-/executors/myriad_executor14a83b03-a84d-4cd2-99a9-01c553b3dfb6-14a83b03-a84d-4cd2-99a9-01c553b3dfb6-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/e8ed1280-4dd9-41b1-88ac-2244a49efab4'.
 This may be due to attempting to run the executor as a nonexistent user on the 
agent; see the description for the `--switch_user` flag for more information: 
Operation not permitted   E0721 
10:08:26.431746 25793 fetcher.cpp:558] Failed to run mesos-fetcher: Failed to 
chown 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/14a83b03-a84d-4cd2-99a9-01c553b3dfb6-/executors/myriad_executor14a83b03-a84d-4cd2-99a9-01c553b3dfb6-14a83b03-a84d-4cd2-99a9-01c553b3dfb6-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/e8ed1280-4dd9-41b1-88ac-2244a49efab4/stdout'
 to user 'root' : Operation not permitted



was (Author: elimsy):
Thanks for your reply.
 Strangely, the stdout and stderr from the task logs in the mesos ui is empty, 
0 size and no content.
I read some log files related to mesos:

File: mesos-agent.ERROR

E0720 17:36:52.323874 23854 fetcher.cpp:558] Failed to run mesos-fetcher: 
Failed to chown 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/5b12e719-6990-4546-9370-b984fc35b759-/executors/myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/7506cfd4-5777-4a02-9e92-5e566ae30dec/stdout'
 to user 'root' : Operation not permitted
E0720 17:36:52.324268 23856 slave.cpp:4650] Container 
'7506cfd4-5777-4a02-9e92-5e566ae30dec' for executor 
'myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0'
 of framework 5b12e719-6990-4546-9370-b984fc35b759- failed to start: Failed 
to chown 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/5b12e719-6990-4546-9370-b984fc35b759-/executors/myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/7506cfd4-5777-4a02-9e92-5e566ae30dec/stdout'
 to user 'root' : Operation not permitted


File: mesos-agent.WARNING

Running on machine: s19
Log line format: [IWEF]mmdd hh:mm:ss.uu threadid file:line] msg
W0721 10:07:42.287765 25785 backend.cpp:76] Failed to create 'aufs' backend: 
AufsBackend requires root privileges W0721 10:07:42.287967 
25785 backend.cpp:76] Failed to create 'bind' backend: BindBa

[jira] [Issue Comment Deleted] (MYRIAD-255) unable to run yarn application with hadoop2.7.3 and mesos1.2.0

2017-07-20 Thread Liu You (JIRA)

 [ 
https://issues.apache.org/jira/browse/MYRIAD-255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liu You updated MYRIAD-255:
---
Comment: was deleted

(was: [~darinj] Thanks for your reply.
 Strangely, the stdout and stderr from the task logs in the mesos ui is empty, 
0 size and no content.
I read some log files related to mesos:

File: mesos-agent.ERROR

E0720 17:36:52.323874 23854 fetcher.cpp:558] Failed to run mesos-fetcher: 
Failed to chown 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/5b12e719-6990-4546-9370-b984fc35b759-/executors/myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/7506cfd4-5777-4a02-9e92-5e566ae30dec/stdout'
 to user 'root' : Operation not permitted
E0720 17:36:52.324268 23856 slave.cpp:4650] Container 
'7506cfd4-5777-4a02-9e92-5e566ae30dec' for executor 
'myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0'
 of framework 5b12e719-6990-4546-9370-b984fc35b759- failed to start: Failed 
to chown 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/5b12e719-6990-4546-9370-b984fc35b759-/executors/myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/7506cfd4-5777-4a02-9e92-5e566ae30dec/stdout'
 to user 'root' : Operation not permitted


File: mesos-agent.WARNING

Running on machine: s19
Log line format: [IWEF]mmdd hh:mm:ss.uu threadid file:line] msg
W0721 10:07:42.287765 25785 backend.cpp:76] Failed to create 'aufs' backend: 
AufsBackend requires root privileges W0721 10:07:42.287967 
25785 backend.cpp:76] Failed to create 'bind' backend: BindBackend requires 
root privileges W0721 10:08:26.406080 25789 paths.cpp:555] 
Failed to chown executor directory 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/14a83b03-a84d-4cd2-99a9-01c553b3dfb6-/executors/myriad_executor14a83b03-a84d-4cd2-99a9-01c553b3dfb6-14a83b03-a84d-4cd2-99a9-01c553b3dfb6-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/e8ed1280-4dd9-41b1-88ac-2244a49efab4'.
 This may be due to attempting to run the executor as a nonexistent user on the 
agent; see the description for the `--switch_user` flag for more information: 
Operation not permitted   E0721 
10:08:26.431746 25793 fetcher.cpp:558] Failed to run mesos-fetcher: Failed to 
chown 
'/home/mesos/mesos/mesos-1.2.0/mesos_install/mesos_work/slaves/ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/frameworks/14a83b03-a84d-4cd2-99a9-01c553b3dfb6-/executors/myriad_executor14a83b03-a84d-4cd2-99a9-01c553b3dfb6-14a83b03-a84d-4cd2-99a9-01c553b3dfb6-O8ce5a20f6-7e1f-4af2-9061-f07859b40bbe-S0/runs/e8ed1280-4dd9-41b1-88ac-2244a49efab4/stdout'
 to user 'root' : Operation not permitted
)

> unable to run yarn application with hadoop2.7.3 and mesos1.2.0
> --
>
> Key: MYRIAD-255
> URL: https://issues.apache.org/jira/browse/MYRIAD-255
> Project: Myriad
>  Issue Type: Bug
>  Components: Executor, Scheduler
>Affects Versions: Myriad 0.2.0
> Environment: Ubuntu 14.04.3 LTS
> master hostname: s18
> slaves hostname: s19, s20, s21
> hadoop 2.7.3
> mesos 1.2.0
>Reporter: Liu You
> Attachments: mapred-site.xml, 
> mesos-master.s18.mesos.log.INFO.20170720-173553.21812, 
> mesos-master.s18.mesos.log.WARNING.20170720-173553.21812, Mesos Web UI.png, 
> myriad-config-default.yml, myriad ui-1.png, myriad ui-2.png, stucked job.png, 
> yarn-mesos-resourcemanager-s18.log, yarn-mesos-resourcemanager-s18.out, 
> yarn-site.xml
>
>
> I can't run yarn application, and the job gets stucked every time I execute 
> the command 'hadoop jar xxx.'. 
> The Myriad Web UI shows that the status of the task changes from staging to 
> pending, then staging, then pending. 
> There are some screenshots in the attachments below.
> Please help me. Thank you very much.
> yarn-mesos-resourcemanager-s18.log:
> {code:java}
> 2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
> Responder: starting
> 2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
> listener on 8032: starting
> 2017-07-20 17:36:46,955 INFO 
> org.apache.myriad.scheduler.event.handlers.ExecutorLostEventHandler: Executor 
> value: 
> "myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O08e5e7115-060b-42c6-b24f-64642048cfa1-S0"
>  of slave value: "8e5e7115-060b-42c6-b24f-64642048cfa1-S0"
>  lost with exit status: 9
> 2017-07-20 17:36:

[jira] [Commented] (MYRIAD-255) unable to run yarn application with hadoop2.7.3 and mesos1.2.0

2017-07-20 Thread Liu You (JIRA)

[ 
https://issues.apache.org/jira/browse/MYRIAD-255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16095707#comment-16095707
 ] 

Liu You commented on MYRIAD-255:


[~darinj] Thanks for your reply.

It seems that myriad needs to run with root privilege, and I add the current 
user to the sudoers and it works.

Thanks again.

> unable to run yarn application with hadoop2.7.3 and mesos1.2.0
> --
>
> Key: MYRIAD-255
> URL: https://issues.apache.org/jira/browse/MYRIAD-255
> Project: Myriad
>  Issue Type: Bug
>  Components: Executor, Scheduler
>Affects Versions: Myriad 0.2.0
> Environment: Ubuntu 14.04.3 LTS
> master hostname: s18
> slaves hostname: s19, s20, s21
> hadoop 2.7.3
> mesos 1.2.0
>Reporter: Liu You
> Attachments: mapred-site.xml, 
> mesos-master.s18.mesos.log.INFO.20170720-173553.21812, 
> mesos-master.s18.mesos.log.WARNING.20170720-173553.21812, Mesos Web UI.png, 
> myriad-config-default.yml, myriad ui-1.png, myriad ui-2.png, stucked job.png, 
> yarn-mesos-resourcemanager-s18.log, yarn-mesos-resourcemanager-s18.out, 
> yarn-site.xml
>
>
> I can't run yarn application, and the job gets stucked every time I execute 
> the command 'hadoop jar xxx.'. 
> The Myriad Web UI shows that the status of the task changes from staging to 
> pending, then staging, then pending. 
> There are some screenshots in the attachments below.
> Please help me. Thank you very much.
> yarn-mesos-resourcemanager-s18.log:
> {code:java}
> 2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
> Responder: starting
> 2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
> listener on 8032: starting
> 2017-07-20 17:36:46,955 INFO 
> org.apache.myriad.scheduler.event.handlers.ExecutorLostEventHandler: Executor 
> value: 
> "myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O08e5e7115-060b-42c6-b24f-64642048cfa1-S0"
>  of slave value: "8e5e7115-060b-42c6-b24f-64642048cfa1-S0"
>  lost with exit status: 9
> 2017-07-20 17:36:46,969 INFO 
> org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Status 
> Update for task: nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898 | state: 
> TASK_FAILED
> 2017-07-20 17:36:46,970 INFO 
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager: Transitioned 
> to active state
> 2017-07-20 17:36:46,971 INFO 
> org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Marked 
> as pending failed task with id value: 
> "nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898"
> {code}
> myriad-config-default.yml:
> {code:java}
> mesosMaster: 192.168.130.18:5050
> checkpoint: false
> frameworkFailoverTimeout: 4320
> frameworkName: MyriadAlpha
> frameworkRole: "yarn"
> frameworkUser: mesos # User the Node Manager runs as, required if 
> nodeManagerURI set, otherwise defaults to the user
>   # running the resource manager.
> frameworkSuperUser: root  # To be deprecated, currently permissions need set 
> by a superuser due to Mesos-1790.  Must be
>   # root or have passwordless sudo. Required if 
> nodeManagerURI set, ignored otherwise.
> #nativeLibrary: /usr/local/lib/libmesos.so
> nativeLibrary: /home/mesos/mesos/mesos-1.2.0/mesos_install/lib/libmesos.so
> zkServers: 192.168.130.18:2181
> zkTimeout: 2
> restApiPort: 8192
> #servedConfigPath: dist/config.tgz
> servedConfigPath: /home/mesos/mesos/hdfs/hadoop-2.7.3/etc/config.tgz
> #servedBinaryPath: dist/hadoop-2.6.0.tgz
> servedBinaryPath: /home/mesos/mesos/hdfs/binary.tgz
> profiles:
>   zero:  # NMs launched with this profile dynamically obtain cpu/mem from 
> Mesos
> cpu: 0
> mem: 0
>   small:
> cpu: 2
> mem: 2048
>   medium:
> cpu: 4
> mem: 4096
>   large:
> cpu: 10
> mem: 12288
> nmInstances: # NMs to start with. Requires at least 1 NM with a non-zero 
> profile.
>   medium: 1 # 
> rebalancer: false
> haEnabled: false
> nodemanager:
>   jvmMaxMemoryMB: 1024
>   cpus: 0.2
>   cgroups: false
> executor:
>   jvmMaxMemoryMB: 256
>   path: hdfs://s18:8020/dist/myriad-executor-0.2.0.jar
>   #path: 
> file:///home/mesos/mesos/hdfs/hadoop-2.7.3/share/hadoop/yarn/lib/myriad-executor-0.2.0.jar
>   #The following should be used for a remotely distributed URI, hdfs assumed 
> but other URI types valid.
>   #nodeManagerUri: hdfs://s18:8020/user/MR/binary.tgz
>   #configUri: http://s18:8192/api/artifacts/config.tgz
>   #jvmUri: https://downloads.mycompany.com/java/jre-7u76-linux-x64.tar.gz
> yarnEnvironment:
>   YARN_HOME: /home/mesos/mesos/hdfs/hadoop-2.7.3
>   #HADOOP_CONF_DIR=config
>   #HADOOP_TMP_DIR=$MESOS_SANDBOX
>   #YARN_HOME: hadoop-2.7.0 #this should be relative if nodeManagerUri is set
>   #JAVA_HOME: /usr/lib/jvm/java-default #System

[jira] [Closed] (MYRIAD-255) unable to run yarn application with hadoop2.7.3 and mesos1.2.0

2017-07-21 Thread Liu You (JIRA)

 [ 
https://issues.apache.org/jira/browse/MYRIAD-255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liu You closed MYRIAD-255.
--
Resolution: Not A Bug

> unable to run yarn application with hadoop2.7.3 and mesos1.2.0
> --
>
> Key: MYRIAD-255
> URL: https://issues.apache.org/jira/browse/MYRIAD-255
> Project: Myriad
>  Issue Type: Bug
>  Components: Executor, Scheduler
>Affects Versions: Myriad 0.2.0
> Environment: Ubuntu 14.04.3 LTS
> master hostname: s18
> slaves hostname: s19, s20, s21
> hadoop 2.7.3
> mesos 1.2.0
>Reporter: Liu You
> Attachments: mapred-site.xml, 
> mesos-master.s18.mesos.log.INFO.20170720-173553.21812, 
> mesos-master.s18.mesos.log.WARNING.20170720-173553.21812, Mesos Web UI.png, 
> myriad-config-default.yml, myriad ui-1.png, myriad ui-2.png, stucked job.png, 
> yarn-mesos-resourcemanager-s18.log, yarn-mesos-resourcemanager-s18.out, 
> yarn-site.xml
>
>
> I can't run yarn application, and the job gets stucked every time I execute 
> the command 'hadoop jar xxx.'. 
> The Myriad Web UI shows that the status of the task changes from staging to 
> pending, then staging, then pending. 
> There are some screenshots in the attachments below.
> Please help me. Thank you very much.
> yarn-mesos-resourcemanager-s18.log:
> {code:java}
> 2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
> Responder: starting
> 2017-07-20 17:36:46,941 INFO org.apache.hadoop.ipc.Server: IPC Server 
> listener on 8032: starting
> 2017-07-20 17:36:46,955 INFO 
> org.apache.myriad.scheduler.event.handlers.ExecutorLostEventHandler: Executor 
> value: 
> "myriad_executor5b12e719-6990-4546-9370-b984fc35b759-5b12e719-6990-4546-9370-b984fc35b759-O08e5e7115-060b-42c6-b24f-64642048cfa1-S0"
>  of slave value: "8e5e7115-060b-42c6-b24f-64642048cfa1-S0"
>  lost with exit status: 9
> 2017-07-20 17:36:46,969 INFO 
> org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Status 
> Update for task: nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898 | state: 
> TASK_FAILED
> 2017-07-20 17:36:46,970 INFO 
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager: Transitioned 
> to active state
> 2017-07-20 17:36:46,971 INFO 
> org.apache.myriad.scheduler.event.handlers.StatusUpdateEventHandler: Marked 
> as pending failed task with id value: 
> "nm.medium.ddd2cdab-db6b-4dbc-9d5d-8c339c056898"
> {code}
> myriad-config-default.yml:
> {code:java}
> mesosMaster: 192.168.130.18:5050
> checkpoint: false
> frameworkFailoverTimeout: 4320
> frameworkName: MyriadAlpha
> frameworkRole: "yarn"
> frameworkUser: mesos # User the Node Manager runs as, required if 
> nodeManagerURI set, otherwise defaults to the user
>   # running the resource manager.
> frameworkSuperUser: root  # To be deprecated, currently permissions need set 
> by a superuser due to Mesos-1790.  Must be
>   # root or have passwordless sudo. Required if 
> nodeManagerURI set, ignored otherwise.
> #nativeLibrary: /usr/local/lib/libmesos.so
> nativeLibrary: /home/mesos/mesos/mesos-1.2.0/mesos_install/lib/libmesos.so
> zkServers: 192.168.130.18:2181
> zkTimeout: 2
> restApiPort: 8192
> #servedConfigPath: dist/config.tgz
> servedConfigPath: /home/mesos/mesos/hdfs/hadoop-2.7.3/etc/config.tgz
> #servedBinaryPath: dist/hadoop-2.6.0.tgz
> servedBinaryPath: /home/mesos/mesos/hdfs/binary.tgz
> profiles:
>   zero:  # NMs launched with this profile dynamically obtain cpu/mem from 
> Mesos
> cpu: 0
> mem: 0
>   small:
> cpu: 2
> mem: 2048
>   medium:
> cpu: 4
> mem: 4096
>   large:
> cpu: 10
> mem: 12288
> nmInstances: # NMs to start with. Requires at least 1 NM with a non-zero 
> profile.
>   medium: 1 # 
> rebalancer: false
> haEnabled: false
> nodemanager:
>   jvmMaxMemoryMB: 1024
>   cpus: 0.2
>   cgroups: false
> executor:
>   jvmMaxMemoryMB: 256
>   path: hdfs://s18:8020/dist/myriad-executor-0.2.0.jar
>   #path: 
> file:///home/mesos/mesos/hdfs/hadoop-2.7.3/share/hadoop/yarn/lib/myriad-executor-0.2.0.jar
>   #The following should be used for a remotely distributed URI, hdfs assumed 
> but other URI types valid.
>   #nodeManagerUri: hdfs://s18:8020/user/MR/binary.tgz
>   #configUri: http://s18:8192/api/artifacts/config.tgz
>   #jvmUri: https://downloads.mycompany.com/java/jre-7u76-linux-x64.tar.gz
> yarnEnvironment:
>   YARN_HOME: /home/mesos/mesos/hdfs/hadoop-2.7.3
>   #HADOOP_CONF_DIR=config
>   #HADOOP_TMP_DIR=$MESOS_SANDBOX
>   #YARN_HOME: hadoop-2.7.0 #this should be relative if nodeManagerUri is set
>   #JAVA_HOME: /usr/lib/jvm/java-default #System dependent, but sometimes 
> necessary
>   #JAVA_HOME: /home/mesos/jdk/jdk1.8.0_131
>   #JAVA_HOME: jre1.7.0_76 # Path to JRE distribution, relative to sandbox 
> directory
>   #JAVA_LIBRARY_PATH