Re: Can we allow executor to exit when tasks fail too many time?

2015-07-07 Thread Tao Li
Any Response? 2015-07-06 12:28 GMT+08:00 Tao Li : > > ​ > Node cloud10141049104.wd.nm.nop.sogou-op.org and > cloud101417770.wd.nm.ss.nop.sogou-op.org failed too many times, I want to > know if it can be auto offline when failed too many times? > > 2015-07-06 12:25 GMT+08:00 Tao Li : > >> I have a

Re: Can we allow executor to exit when tasks fail too many time?

2015-07-05 Thread Tao Li
​ Node cloud10141049104.wd.nm.nop.sogou-op.org and cloud101417770.wd.nm.ss.nop.sogou-op.org failed too many times, I want to know if it can be auto offline when failed too many times? 2015-07-06 12:25 GMT+08:00 Tao Li : > I have a long live spark application running on YARN. > > In some nodes, it

Can we allow executor to exit when tasks fail too many time?

2015-07-05 Thread Tao Li
I have a long live spark application running on YARN. In some nodes, it try to write to the shuffle path in the shuffle map task. But the root path /search/hadoop10/yarn_local/usercache/spark/ was deleted, so the task is failed. So every time when running shuffle map task on this node, it was alwa