agam-99 opened a new pull request, #4195:
URL: https://github.com/apache/gobblin/pull/4195

   ## Summary
   - Add explicit Gobblin Service scheduler liveness metrics so operators can 
alert when Quartz stops firing scheduled-flow triggers.
   - Track the last regular scheduled-flow trigger fire time and age separately 
from reminder triggers, since reminders should not mask scheduler liveness 
failures.
   - Preserve the existing broad `ScheduledFlows` meter while adding more 
targeted fired/failure metrics for alerting.
   
   ## Test plan
   - [x] `JAVA_HOME="$(/usr/libexec/java_home -v 1.8)" ./gradlew 
:gobblin-service:test --tests 
"org.apache.gobblin.service.modules.scheduler.GobblinServiceJobSchedulerTest"`
   - [x] `git diff --check`
   - [ ] `mint build` not applicable: this checkout is not a LinkedIn 
multiproduct directory
   
   ## Notes
   - The new `flowScheduler.lastScheduledFlowTriggerFireAgeMillis` gauge is 
intended for a scheduler-liveness SLO alert: page if the age exceeds the 
expected schedule interval while scheduled flow specs are present.
   - The first validation attempt with newer Java failed before compilation due 
the repository's Gradle/Groovy compatibility; JDK 8 validation passed.
   
   Made with [Cursor](https://cursor.com)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to