Sasi commented on SPARK-17607:

It's different, because I was able to start my master with ip and I 
saw that each worker regsitered on
I also was able to see that each worker as its own private ip, e.g. worker1 -, worker3 - and etc.
My spark-env.sh contained bother master-ip and worker-ip has it should be.

The problem on my bug is that once I requested data from each worker then it 
set the driverUrl, in my case, to

Do you want guide me on which logs/info you need for this issue?

> --driver-url doesn't point to my master_ip.
> -------------------------------------------
>                 Key: SPARK-17607
>                 URL: https://issues.apache.org/jira/browse/SPARK-17607
>             Project: Spark
>          Issue Type: Bug
>    Affects Versions: 1.5.2
>            Reporter: Sasi
>            Priority: Critical
> Hi,
> I have master machine and slave machine.
> My master machine contains 2 interfaces.
> First interface has the following ip, and the other interface has 
> the following ip
> I configured the MASTER_IP to be, so once the master goes up and its 
> worker I see the following INFO lines:
> {code}
> 16/09/20 12:32:32 INFO Worker: Successfully registered with master 
> spark://
> 16/09/20 12:39:15 INFO Worker: Asked to launch executor 
> app-20160920123915-0000/0 for Spark-DataAccessor-JBoss
> {code}
> I set the SPARK_LOCAL_IP on each worker to be its own ip, e.g
> Both constants were configured on spark-env.sh.
> The problem started when I tried to get data from my workers.
> I got the following INFO line in each worker log.
> {code} 
> "--driver-url" 
> "akka.tcp://sparkDriver@" "
> {code}
> As you can see the masterIp is different then the driver-url ip.
> Master ip is but driver-url is, therefore i'm getting 
> the following errors:
> {code}
> 16/09/20 12:17:57 INFO Slf4jLogger: Slf4jLogger started
> 16/09/20 12:17:57 INFO Remoting: Starting remoting
> 16/09/20 12:17:57 INFO Remoting: Remoting started; listening on addresses 
> :[akka.tcp://driverPropsFetcher@]
> 16/09/20 12:17:57 INFO Utils: Successfully started service 
> 'driverPropsFetcher' on port 34961.
> 16/09/20 12:19:00 WARN ReliableDeliverySupervisor: Association with remote 
> system [akka.tcp://sparkDriver@] has failed, address is now 
> gated for [5000] ms. Reason: [Association failed with 
> [akka.tcp://sparkDriver@]] Caused by: [Connection timed out: 
> /]
> Exception in thread "main" akka.actor.ActorNotFound: Actor not found for: 
> ActorSelection[Anchor(akka.tcp://sparkDriver@, 
> Path(/user/CoarseGrainedScheduler)]
>         at
> {code}
> {code}
>  "org.apache.spark.executor.CoarseGrainedExecutorBackend" "--driver-url" 
> "akka.tcp://sparkDriver@"
> {code}
> The master is listen and open for communicate via and not 
> Looks like the driver-url ignore the real MASTER_IP.
> Thanks,
> Sasi

This message was sent by Atlassian JIRA

To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to