So even if I make the Spark executors run on the same node as Casssandra
nodes, I am not sure each worker will connect to c* nodes on the same mesos
agent ?

2017-01-12 21:13 GMT+01:00 Michael Gummelt <mgumm...@mesosphere.io>:

> The code in there w/ docs that reference CNI doesn't actually run when CNI
> is in effect, and doesn't have anything to do with locality.  It's just
> making Spark work in a no-DNS environment
>
> On Thu, Jan 12, 2017 at 12:04 PM, vincent gromakowski <
> vincent.gromakow...@gmail.com> wrote:
>
>> I have found this but I am not sure how it can help...
>> https://github.com/mesosphere/spark-build/blob/a9efef8850976
>> f787956660262f3b77cd636f3f5/conf/spark-env.sh
>>
>>
>> 2017-01-12 20:16 GMT+01:00 Michael Gummelt <mgumm...@mesosphere.io>:
>>
>>> That's a good point. I hadn't considered the locality implications of
>>> CNI yet.  I think tasks are placed based on the hostname reported by the
>>> executor, which in a CNI container will be different than the
>>> HDFS/Cassandra hostname.  I'm not aware of anyone running Spark+CNI in prod
>>> yet, either.
>>>
>>> However, locality in Mesos isn't great right now anyway.  Executors are
>>> placed w/o regard to locality.  Locality is only taken into account when
>>> tasks are assigned to executors.  So if you get a locality-poor executor
>>> placement, you'll also have locality poor task placement.  It could be
>>> better.
>>>
>>> On Thu, Jan 12, 2017 at 7:55 AM, vincent gromakowski <
>>> vincent.gromakow...@gmail.com> wrote:
>>>
>>>> Hi all,
>>>> Does anyone have experience running Spark on Mesos with CNI (ip per
>>>> container) ?
>>>> How would Spark use IP or hostname for data locality with backend
>>>> framework like HDFS or Cassandra ?
>>>>
>>>> V
>>>>
>>>
>>>
>>>
>>> --
>>> Michael Gummelt
>>> Software Engineer
>>> Mesosphere
>>>
>>
>>
>
>
> --
> Michael Gummelt
> Software Engineer
> Mesosphere
>

Reply via email to