[
https://issues.apache.org/jira/browse/FLINK-29708?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gyula Fora closed FLINK-29708.
------------------------------
Resolution: Fixed
merged to main 3c63653820742993926ff8901cfaa618a2f855cb
> Enrich Flink Kubernetes Operator CRD error field
> ------------------------------------------------
>
> Key: FLINK-29708
> URL: https://issues.apache.org/jira/browse/FLINK-29708
> Project: Flink
> Issue Type: Improvement
> Components: Kubernetes Operator
> Affects Versions: kubernetes-operator-1.3.0
> Reporter: Daren Wong
> Assignee: Daren Wong
> Priority: Major
> Labels: pull-request-available
> Fix For: kubernetes-operator-1.3.0
>
>
> h1. Problem Statement:
> FlinkDeployment and FlinkSessionJob CRD has a CommonStatus error field of
> String type. Currently, this field stores various errors such as:
> * CR validation error
> * Missing SessionJob error/ Missing JobManager deployment error
> * Unknown Job error
> * DeploymentFailedException
> * ReconciliationError such as RestClientException from Flink Internal such
> as FlinkRest and FlinkRuntime
> It is insufficient to store each error simply as string only. We need to
> include some exception metadata to help operator handle this error
> accordingly. For example, it is very useful to know the HttpResponseStatus
> code from RestClientException.
> h1. Proposed Solution:
> * The error field should store a JSON with exception metadata. For example:
> {code:java}
> {
> "type": "JobManagerNotFoundException",
> "message": "JobManager with leadership ID: 1234 was not found",
> "stackTrace": "JobManager lost connection at ....",
> "additionalMetadata": {
> "httpResponseCode": "400"
> },
> "throwableList": [
> {
> "type": "FlinkRuntimeException",
> "message": "other exception"
> },
> ....
> ]
> } {code}
> * The stackTrace field can be enabled or disabled via spec change.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)