[ https://issues.apache.org/jira/browse/FLINK-20695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Chesnay Schepler updated FLINK-20695: ------------------------------------- Description: I used flink 1.11 in standalone cluster mode for batch job. The enviornment was configured as zookeeper HA mode. After job was commited, flink runtime created nodes under /flink/default/leader and /flink/default/leaderlatch with job id. Though jobs were finished, these nodes were remaining in zookeeper path forever. After a period of running, more and more jobs had been executed and there were a greate number of nodes under /flink/default/leader and slowed down the performance of zookeeper. Why not delete the nodes after job finished? Flink runtime could get job status by listeners and delete the leader nodes for job immidiately. was: I used flink 1.11 in standalone cluster mode for batch job. The enviornment was configed as zookeeper HA mode. After job was commited, flink runtime created nodes under /flink/default/leader and /flink/default/leaderlatch with job id. Though jobs were finished, these nodes were remaining in zookeeper path forever. After a period of running, more and more jobs had been executed and there were a greate number of nodes under /flink/default/leader and slowed down the performance of zookeeper. Why not delete the nodes after job finished? Flink runtime could get job status by listeners and delete the leader nodes for job immidiately. > Zookeeper node under leader and leaderlatch is not deleted after job finished > ----------------------------------------------------------------------------- > > Key: FLINK-20695 > URL: https://issues.apache.org/jira/browse/FLINK-20695 > Project: Flink > Issue Type: Improvement > Components: Runtime / Task > Reporter: lidesheng > Priority: Critical > > I used flink 1.11 in standalone cluster mode for batch job. The enviornment > was configured as zookeeper HA mode. > After job was commited, flink runtime created nodes under > /flink/default/leader and /flink/default/leaderlatch with job id. Though > jobs were finished, these nodes were remaining in zookeeper path forever. > After a period of running, more and more jobs had been executed and there > were a greate number of nodes under /flink/default/leader and slowed down the > performance of zookeeper. Why not delete the nodes after job finished? Flink > runtime could get job status by listeners and delete the leader nodes for job > immidiately. -- This message was sent by Atlassian Jira (v8.3.4#803005)