Hello, all I have met the problem "too many fetch failures" when I submit a big job(e.g. tasks>10000). And I know this error occurs when several reducers are unable to fetch the given map output. However, I'm sure slaves can contact each other. I feel puzzled and have no idea to deal with it. Maybe the network transfer is bad, but how can I solve it? Increase mapred.reduce.parallel.copies and mapred.reduce.copy.backoff can make changes? Thank you! Inifok
- How to deal with "too many fetch failures"? yang song
- Re: How to deal with "too many fetch failures"... Ted Dunning
- Re: How to deal with "too many fetch failures&... yang song
- Re: How to deal with "too many fetch failu... Ted Dunning
- Re: How to deal with "too many fetch f... yang song
- Re: How to deal with "too many fe... Ted Dunning
- Re: How to deal with "too man... Jason Venner
- Re: How to deal with "too... Koji Noguchi
- Re: How to deal with "too many fetch failures"... Arun C Murthy
- Re: How to deal with "too many fetch failures"... 谭东
- Re: How to deal with "too many fetch failures"... 谭东