dlmarion commented on PR #5726: URL: https://github.com/apache/accumulo/pull/5726#issuecomment-3063944122
7c7ecef pushes the class of the exception that occurred on the Compactor back to the Coordinator. This is used for logging and for incrementing failure counters for the queue, compactor, and table. Subsequent compaction successes will decrement the counters for the queue, compactor, and table. Using these counters we can return an empty job back to the Compactor when the current error rate is over some threshold. We could also emit metrics from the Coordinator based on these failure counts. The logic and accounting in 7c7ecef is not 100% correct. I pushed it up as-is to get feedback on if and how we should move forward with this idea. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: notifications-unsubscr...@accumulo.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org