You should probably look at the RMProxy code and the configs it uses. I am hoping that all clients including the MR AM now use that proxy and so older configs are no longer valid.
Bikas -----Original Message----- From: Karthik Kambatla [mailto:[email protected]] Sent: Sunday, August 11, 2013 8:45 PM To: [email protected] Subject: AM timeout on RM failure? Hi YARN devs, I am working on the ZKRMStateStore, and had a very basic question - on RM failure, how long does the AM fail before crashing, or more importantly what controls it. Looking into the code, I see the following two parameters: 1. yarn.app.mapreduce.am.scheduler.connection.wait.interval-ms - set to 1 min 2. Fix configs yarn.resourcemanager.resourcemanager.connect.{max.wait.secs|retry_interval .secs} - set by default to 15 mins and 30 seconds respectively The AM crashes only after 20 minutes. Are there any other configs that influence this? Thanks Karthik
