Hm, I've seen the issue with Bridge networking where the IP advertised by
libprocess in the container is not reachable by the master, i.e.
https://issues.apache.org/jira/browse/MESOS-2587. I've see this ticket is
resolved, however. It may require a patch in Spark itself. If possible, you
could try with host networking and see if the issue still exists.

-Elizabeth

On Thu, Oct 22, 2015 at 12:08 PM, Stavros Kontopoulos <
[email protected]> wrote:

> Bridge... with the latest mesos library vesion 0.25...
>
> On Thu, Oct 22, 2015 at 9:07 PM, Elizabeth Lingg <[email protected]>
> wrote:
>
>> Are you using Bridge or Host Networking?
>>
>> -Elizabeth
>>
>>
>>
>> On Thu, Oct 22, 2015 at 12:02 PM, Stavros Kontopoulos <
>> [email protected]> wrote:
>>
>>> Hi,
>>>
>>> Im using spark on mesos on docker. I have linked my slaves to the master
>>> and a
>>> spark repl works fine inside the master container.
>>>
>>> If i try to crate the same spark repl form the host i get stuck at the
>>> point when the framework tries to register to the mesos master (here the
>>> framework is the spark repl itself).
>>> I can ping the container from my host and vice versa. So networking its
>>> not the problem.
>>> What i noticed form the logs is that mesos does not resolve the correct
>>> ip:
>>>
>>> Framework failover timeout, removing framework
>>> b3605c33-f573-4d40-806f-b9b0abee2e32-0012 (Spark shell) at
>>> [email protected]:40186
>>>
>>> docker0 interface is on 172.17.x.x and my host is one such ip so i didnt
>>> expect there to see
>>> 127.0.1.1. I have tried several things like spark.driver.host,
>>> SPARK_LOCAL_IP to be set correctly but with no result...
>>> I suspect this is a mesos problem on docker...
>>>
>>> Thnx,
>>>
>>> S.
>>>
>>
>>
>

Reply via email to