陈磊 created FLINK-36661: -------------------------- Summary: the managed memory setting is relatively small, resulting in duplicate processing results for batch tasks Key: FLINK-36661 URL: https://issues.apache.org/jira/browse/FLINK-36661 Project: Flink Issue Type: Bug Reporter: 陈磊 Attachments: image-2024-11-05-15-45-44-349.png
1. Operating environment: Flinksql 1.16+Batch Task+TM Specification (4c 8g) 2. Core topology diagram & SQL frame: !image-2024-11-05-15-45-44-349.png! {code:sql} create view tmp1 select .... from source group by xx, xx; create view tmp select ... from tmp1 a left join tmp1 b on a.xx = b.xx group by x, y, ....; insert into t select * from tmp; {code} 3. Problem triggering conditions The managed fraction is set to 0.1, and the calculated managed memory is approximately 600M 4. Result performance The correct output count for this task is 3828w When the managed fraction is 0.1, the output of the written result is unstable, which may be 120 million data points, 150 million data points, or 210 million data points. When the managed fraction is set to 0.4, the task output is stable and meets expectations 5. Personal investigation & think 1)Through monitoring verification, it was found that when managed is set to 0.1, the output parameters of sort significantly increase. 2)If the managed fraction is not adjusted, expanding TM memory can also ensure stable output of technical results 3)Although managed memory is important for batch tasks, in situations where managed memory is insufficient, it should be due to slow task execution or OOM, rather than causing duplicate output data. -- This message was sent by Atlassian Jira (v8.20.10#820010)