What is the outpur of 'host s1.machine.org <http://s1.machine.org:7077/>' if you execute from your worker machine.
ping will work but if this does not work it implies DNS entry is present for this machine (s1.machine.org <http://s1.machine.org:7077/>) 2 alternatives could be: - add dns entry - start master with SPARK_MASTER_IP=<master ip addess> env variable set Thanks, Sourav On Fri, Feb 7, 2014 at 12:39 PM, Pillis W <[email protected]> wrote: > I have a "Connection Refused" error on the first worker (standalone > cluster - no YARN, Mesos). No firewalls, and can ping master-worker nodes > from the other. > > Master process started manually. It is running and can see Web UI at 8080. > > Using "spark-0.9.0-incubating-bin-hadoop2.tgz" > > =============================================== > spark-0.9.0-incubating-bin-hadoop2]$ ./bin/spark-class > org.apache.spark.deploy.worker.Worker spark://s1.machine.org:7077 > 14/02/07 07:00:58 INFO Utils: Using Spark's default log4j profile: > org/apache/spark/log4j-defaults.properties > 14/02/07 07:00:58 WARN Utils: Your hostname, s2.machine.org resolves to a > loopback address: 127.0.0.1; using 192.168.64.122 instead (on interface > eth1) > 14/02/07 07:00:58 WARN Utils: Set SPARK_LOCAL_IP if you need to bind to > another address > 14/02/07 07:00:59 INFO Slf4jLogger: Slf4jLogger started > 14/02/07 07:00:59 INFO Remoting: Starting remoting > 14/02/07 07:00:59 INFO Remoting: Remoting started; listening on addresses > :[akka.tcp://sparkWorker@s2:49614] > 14/02/07 07:00:59 INFO Worker: Starting Spark worker s2:49614 with 1 > cores, 853.0 MB RAM > 14/02/07 07:00:59 INFO Worker: Spark home: > /home/vagrant/spark-0.9.0-incubating-bin-hadoop2 > 14/02/07 07:00:59 INFO WorkerWebUI: Started Worker web UI at > http://s2:8081 > 14/02/07 07:00:59 INFO Worker: Connecting to master > spark://s1.machine.org:7077... > 14/02/07 07:00:59 ERROR EndpointWriter: AssociationError > [akka.tcp://sparkWorker@s2:49614] -> [akka.tcp:// > [email protected]:7077]: Error [Association failed with > [akka.tcp://[email protected]:7077]] [ > akka.remote.EndpointAssociationException: Association failed with > [akka.tcp://[email protected]:7077] > Caused by: > akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2: > Connection refused: s1.machine.org/192.168.64.121:7077 > ] > 14/02/07 07:00:59 ERROR EndpointWriter: AssociationError > [akka.tcp://sparkWorker@s2:49614] -> [akka.tcp:// > [email protected]:7077]: Error [Association failed with > [akka.tcp://[email protected]:7077]] [ > akka.remote.EndpointAssociationException: Association failed with > [akka.tcp://[email protected]:7077] > Caused by: > akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2: > Connection refused: s1.machine.org/192.168.64.121:7077 > ] > 14/02/07 07:00:59 ERROR EndpointWriter: AssociationError > [akka.tcp://sparkWorker@s2:49614] -> [akka.tcp:// > [email protected]:7077]: Error [Association failed with > [akka.tcp://[email protected]:7077]] [ > akka.remote.EndpointAssociationException: Association failed with > [akka.tcp://[email protected]:7077] > Caused by: > akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2: > Connection refused: s1.machine.org/192.168.64.121:7077 > ] > 14/02/07 07:00:59 ERROR EndpointWriter: AssociationError > [akka.tcp://sparkWorker@s2:49614] -> [akka.tcp:// > [email protected]:7077]: Error [Association failed with > [akka.tcp://[email protected]:7077]] [ > akka.remote.EndpointAssociationException: Association failed with > [akka.tcp://[email protected]:7077] > Caused by: > akka.remote.transport.netty.NettyTransport$$anonfun$associate$1$$anon$2: > Connection refused: s1.machine.org/192.168.64.121:7077 > ] > 14/02/07 07:00:59 INFO RemoteActorRefProvider$RemoteDeadLetterActorRef: > Message [org.apache.spark.deploy.DeployMessages$RegisterWorker] from > Actor[akka://sparkWorker/user/Worker#607746123] to > Actor[akka://sparkWorker/deadLetters] was not delivered. [1] dead letters > encountered. This logging can be turned off or adjusted with configuration > settings 'akka.log-dead-letters' and > 'akka.log-dead-letters-during-shutdown'. > > ... > > 14/02/07 07:01:59 ERROR Worker: All masters are unresponsive! Giving up. > =============================================== > -- Sourav Chandra Senior Software Engineer · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · · [email protected] o: +91 80 4121 8723 m: +91 988 699 3746 skype: sourav.chandra Livestream "Ajmera Summit", First Floor, #3/D, 68 Ward, 3rd Cross, 7th C Main, 3rd Block, Koramangala Industrial Area, Bangalore 560034 www.livestream.com
