Re: Job restart hook

2018-04-04 Thread Kostas Kloudas
Hi Navneeth, I am sending the answer to the user mailing list so that we keep the discussion public. There may also be other users interested in the question. So the answer to the question is that you cannot restart from an externalized checkpoint with a different parallelism. To be able to

Re: Job restart hook

2018-04-03 Thread Kostas Kloudas
Hi Navneeth, If I understand correctly, you have a job with parallelism p=20, a TM goes down (eg. with 4 slots), and you want until the TM comes up, to run the job with p=16 and then re-running it with 20 again, when the TM comes up. If this is the case, one important thing to keep in mind is

Job restart hook

2018-03-29 Thread Navneeth Krishnan
Hi, Is there a way for a script to be called whenever a job gets restarted? My scenario is lets say there are 20 slots and the job runs on all 20 slots. After a while a task manager goes down and now there are only 14 slots and I need to readjust the parallelism of my job to ensure the job runs