mingmwang commented on code in PR #261: URL: https://github.com/apache/arrow-ballista/pull/261#discussion_r985174106
########## ballista/rust/scheduler/src/state/execution_graph.rs: ########## @@ -110,6 +114,20 @@ pub struct ExecutionGraph { output_partitions: usize, /// Locations of this `ExecutionGraph` final output locations output_locations: Vec<PartitionLocation>, + /// Task ID generator, generate unique TID in the execution graph + task_id_gen: usize, + /// Failed stage attempts, record the failed stage attempts to limit the retry times. + /// Map from Stage ID -> Set<Stage_ATTPMPT_NUM> + failed_stage_attempts: HashMap<usize, HashSet<usize>>, Review Comment: > I don't get why we are saving a `HashSet` of attempts. Shouldn't it just be `usize`? The purpose of using HashSet is to record what are the exact distinct failed attempts. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org