Re: spark mesos shuffle service failing under marathon

2015-11-07 Thread Klaus Ma
Can you share more logs? I used to start spark shuffle in Mesos + Marathon cluster; logs will be helpful to identify issues. Da (Klaus), Ma (马达) | PMP® | Advisory Software Engineer Platform Symphony/DCOS Development & Support, STG, IBM GCG +86-10-8245 4084 | klaus1982...@gmail.com |

Mesos and Zookeeper TCP keepalive

2015-11-07 Thread Jeremy Olexa
Hello all, We have been fighting some network/session disconnection issues between datacenters and I'm curious if there is anyway to enable tcp keepalive on the zookeeper/mesos sockets? If there was a way, then the sysctl tcp kernel settings would be used. I believe keepalive has to be

Re: spark mesos shuffle service failing under marathon

2015-11-07 Thread Timothy Chen
If you want to use Marathon start the mesos shuffle service, don't use the sbin script since it runs it as a daemon in background. Instead use spark-class script and run the MesosExternalShuffleService class directly so it runs in the foreground. Tim > On Nov 7, 2015, at 7:02 AM, Klaus Ma

Job Constraints from Marathon or Spark

2015-11-07 Thread Rodrick Brown
I have a few dozen marathon and spark jobs I would like to use constraints with across my slaves but I can never get this to work at all using the latest release 0.25.1 I have the following set on a few of my slaves. slaves[1-5] $ cat /etc/mesos-slave/attributes