[
https://issues.apache.org/jira/browse/FLINK-31268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17751499#comment-17751499
]
Yun Tang commented on FLINK-31268:
----------------------------------
[~ruanhang1993] I think this is an actual blocker for releasing flink-1.18.
Once restored and using metrics to report, this bug would lead to no more
checkpoints could be triggered if implemented operator coordinator metrics.
> OperatorCoordinator.Context#metricGroup will return null when restore from a
> savepoint
> --------------------------------------------------------------------------------------
>
> Key: FLINK-31268
> URL: https://issues.apache.org/jira/browse/FLINK-31268
> Project: Flink
> Issue Type: Improvement
> Components: Runtime / Metrics
> Reporter: Hang Ruan
> Assignee: Hang Ruan
> Priority: Major
> Labels: pull-request-available
>
> The `metricGroup` is initialized lazily in the method
> `OperatorCoordinatorHandler#initializeOperatorCoordinators`.
> This will cause the NullPointerException when we use it in the method like
> `Source#restoreEnumerator`, which will be invoked through
> `SchedulerBase#createAndRestoreExecutionGraph` before
> `OperatorCoordinatorHandler#initializeOperatorCoordinators` in class
> `SchedulerBase#<init>`.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)