Re: forcing node local processing

Andrew Ash Tue, 26 Nov 2013 14:46:19 -0800

Hi Erik,

I would guess that if you set spark.locality.wait to an absurdly large
value then you would have essentially that effect.


Maybe you aren't setting the system property before creating your Spark
context?

http://spark.incubator.apache.org/docs/latest/configuration.html

Andrew


On Tue, Nov 26, 2013 at 2:40 PM, Erik Freed <[email protected]>wrote:

> Hi All,
> After switching to 0.8, and reducing the number of partitions/tasks for a
> large scale computation, I have been unable to force Spark to use only
> executors on nodes where hbase data is local. I have not been able to find
> a setting for spark.locality.wait that makes any difference. It is not an
> option for us to let spark chose non data local nodes. Is their some
> example code of how to get this to work the way we want? We have our own
> input RDD that mimics the NewHadoopRdd and it seems to be doing the correct
> thing in all regards wrt to preferred locations.
>
> Do I have to write my own compute Tasks and schedule them myself?
>
> Anyone have any suggestions? I am stumped.
>
> cheers,
> Erik
>
>
>

Re: forcing node local processing

Reply via email to