That feature does not exist but would be great to have. IIRC there is already a jira open for this. If not, please open one and if possible submit a patch :)
Bikas -----Original Message----- From: Abhishek Das [mailto:[email protected]] Sent: Wednesday, June 29, 2016 11:13 AM To: [email protected]; [email protected] Subject: Job failed because of disk space Hi, I have run into a job failure because of disk space. I noticed that in case of multi stage job (e.g M-R-R-R) the intermediate output data from all the stages are not deleted until the whole job is complete. Is there any configuration that will help deletion of the intermediate data if we see some preconfigured number of child level is already complete. I know we keep that for failure recovery but in case of M-R-R-R dag, when we are processing the last level we don't need the output of M stage. I am using tez 0.7 Regards, Abhishek Das
