zhengruifeng commented on PR #51149:
URL: https://github.com/apache/spark/pull/51149#issuecomment-3095748274

   > Thank you for checking. I have two additional questions.
   > 
   > 1. How do you check the failure is OOM or not?
   > 2. For non-OOM failure like `Syntax Error`, can we fail at the first 
failure?
   
   @dongjoon-hyun Good point, I think we should analyze the failure and only 
retry when OOM
   
   
   > hmm the compliation would take 30 mins. I worry that it uses too much 
resources when the failure is actually legitimate and real.
   
   @HyukjinKwon this OOM normally occurs at 10~12 mins, and in most cases 
re-run can pass. So I guess it won't takes too much resources if re-try is only 
for OOM. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to