Hi Tim, We are running Spark 1.1.0 with Hadoop 2.4. Mesos is in Version 0.20.1 all in binary releases.
The Spark console is running in default mode, which is fine grained. The Spark process is started from a physical Machine running Ubuntu, the Mesos nodes are running in VMs also in Ubuntu. This is the output of the Spark Shell: -------------------------------------------------------------------------------------------------------------------------------- Spark assembly has been built with Hive, including Datanucleus jars on classpath Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties 14/10/15 15:18:24 INFO SecurityManager: Changing view acls to: USERNAME, 14/10/15 15:18:24 INFO SecurityManager: Changing modify acls to: USERNAME, 14/10/15 15:18:24 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(USERNAME, ); users with modify permissions: Set(USERNAME, ) 14/10/15 15:18:24 INFO HttpServer: Starting HTTP Server 14/10/15 15:18:24 INFO Utils: Successfully started service 'HTTP class server' on port 42469. Welcome to ____ __ / __/__ ___ _____/ /__ _\ \/ _ \/ _ `/ __/ '_/ /___/ .__/\_,_/_/ /_/\_\ version 1.1.0 /_/ Using Scala version 2.10.4 (OpenJDK 64-Bit Server VM, Java 1.7.0_65) Type in expressions to have them evaluated. Type :help for more information. 14/10/15 15:18:26 WARN Utils: Your hostname, karwjohannes01 resolves to a loopback address: 127.0.1.1; using CLIENT_IP instead (on interface eth0) 14/10/15 15:18:26 WARN Utils: Set SPARK_LOCAL_IP if you need to bind to another address 14/10/15 15:18:27 INFO SecurityManager: Changing view acls to: USERNAME, 14/10/15 15:18:27 INFO SecurityManager: Changing modify acls to: USERNAME, 14/10/15 15:18:27 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(USERNAME, ); users with modify permissions: Set(USERNAME, ) 14/10/15 15:18:27 INFO Slf4jLogger: Slf4jLogger started 14/10/15 15:18:27 INFO Remoting: Starting remoting 14/10/15 15:18:27 INFO Remoting: Remoting started; listening on addresses :[akka.tcp://sparkDriver@CLIENT_IP:51879] 14/10/15 15:18:27 INFO Remoting: Remoting now listens on addresses: [akka.tcp://sparkDriver@CLIENT_IP:51879] 14/10/15 15:18:27 INFO Utils: Successfully started service 'sparkDriver' on port 51879. 14/10/15 15:18:27 INFO SparkEnv: Registering MapOutputTracker 14/10/15 15:18:27 INFO SparkEnv: Registering BlockManagerMaster 14/10/15 15:18:27 INFO DiskBlockManager: Created local directory at /tmp/spark-local-20141015151827-1a2e 14/10/15 15:18:27 INFO Utils: Successfully started service 'Connection manager for block manager' on port 60963. 14/10/15 15:18:27 INFO ConnectionManager: Bound socket to port 60963 with id = ConnectionManagerId(CLIENT_IP,60963) 14/10/15 15:18:27 INFO MemoryStore: MemoryStore started with capacity 265.4 MB 14/10/15 15:18:27 INFO BlockManagerMaster: Trying to register BlockManager 14/10/15 15:18:27 INFO BlockManagerMasterActor: Registering block manager CLIENT_IP:60963 with 265.4 MB RAM 14/10/15 15:18:27 INFO BlockManagerMaster: Registered BlockManager 14/10/15 15:18:27 INFO HttpFileServer: HTTP File server directory is /tmp/spark-b032c76c-93e1-473e-802c-c55e12e85d41 14/10/15 15:18:27 INFO HttpServer: Starting HTTP Server 14/10/15 15:18:27 INFO Utils: Successfully started service 'HTTP file server' on port 47989. 14/10/15 15:18:27 INFO Utils: Successfully started service 'SparkUI' on port 4040. 14/10/15 15:18:27 INFO SparkUI: Started SparkUI at http://CLIENT_IP:4040 14/10/15 15:18:27 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable I1015 15:18:28.524736 4748 sched.cpp:139] Version: 0.20.1 I1015 15:18:28.527180 4750 sched.cpp:235] New master detected at master@MESOS_MASTER_IP:5050 I1015 15:18:28.527300 4750 sched.cpp:243] No credentials provided. Attempting to register without authentication -------------------------------------------------------------------------------------------------------------------------------- Mesos master WARNING log: W1015 14:13:00.235213 1118 master.cpp:3452] Master returning resources offered to framework 20141007-102213-343139338-5050-1037-3490 because the framework has terminated or is inactive W1015 14:13:35.244055 1121 master.cpp:3452] Master returning resources offered to framework 20141007-102213-343139338-5050-1037-3525 because the framework has terminated or is inactive W1015 14:13:50.252436 1121 master.cpp:3452] Master returning resources offered to framework 20141007-102213-343139338-5050-1037-3540 because the framework has terminated or is inactive W1015 14:14:05.252708 1117 master.cpp:3452] Master returning resources offered to framework 20141007-102213-343139338-5050-1037-3555 because the framework has terminated or is inactive Mesos slave WARNING log : W1015 13:58:19.103196 1211 slave.cpp:1421] Cannot shut down unknown framework 20141007-102213-343139338-5050-1037-3116 W1015 13:58:20.104650 1210 slave.cpp:1421] Cannot shut down unknown framework 20141007-102213-343139338-5050-1037-3117 W1015 13:58:21.119839 1211 slave.cpp:1421] Cannot shut down unknown framework 20141007-102213-343139338-5050-1037-3118 W1015 13:58:22.115965 1210 slave.cpp:1421] Cannot shut down unknown framework 20141007-102213-343139338-5050-1037-3119 W1015 13:58:23.104925 1211 slave.cpp:1421] Cannot shut down unknown framework 20141007-102213-343139338-5050-1037-3120 W1015 13:58:24.104652 1210 slave.cpp:1421] Cannot shut down unknown framework 20141007-102213-343139338-5050-1037-3121 W1015 13:58:59.853744 1212 slave.cpp:1421] Cannot shut down unknown framework 20141007-102213-343139338-5050-1037-3122 W1015 13:59:00.853086 1214 slave.cpp:1421] Cannot shut down unknown framework 20141007-102213-343139338-5050-1037-3123 W1015 13:59:01.853137 1212 slave.cpp:1421] Cannot shut down unknown framework 20141007-102213-343139338-5050-1037-3124 W1015 13:59:03.318259 1214 slave.cpp:1421] Cannot shut down unknown framework 20141007-102213-343139338-5050-1037-3029 I hope this information helps, please ask if you have any more questions and thank you for your help! Johannes From: Tim St Clair [mailto:tstcl...@redhat.com] Sent: Mittwoch, 15. Oktober 2014 15:11 To: user@mesos.apache.org Subject: Re: Connecting spark from a different Machine to mesos cluster Details? 1. What versions are you running? 2. Fine grained mode or Course Gained? 3. Are you running in VM's? Logs always help too. Cheers, Tim ________________________________ From: "Johannes Schillinger (Intern)" <johannes.schillin...@citrix.com<mailto:johannes.schillin...@citrix.com>> To: user@mesos.apache.org<mailto:user@mesos.apache.org> Sent: Wednesday, October 15, 2014 7:42:36 AM Subject: Connecting spark from a different Machine to mesos cluster Hi, we are currently trying to get a mesos cluster running as a base for Spark. The mesos cluster itself runs and connecting a spark shell from the machine the maser runs on works perfectly. We can see the Framework being started and the slaves working. If we try to connect the exact same shell from a different machine to the exact same cluster the screen stops at … 4013 sched.cpp:243] No credentials provided. Attempting to register without authentication The cluster spins up a framework every two seconds with a new ID and stops it immediately. This continues (we stopped it after a few dozen starts). We can see the frameworks being started in the master- and slave-logs as well as the command of the master to terminate it. Has anyone ever encountered a similar problem or has any advice on solving this problem? Thanks! Johannes -- Cheers, Timothy St. Clair Red Hat Inc.