[jira] [Commented] (FLINK-36661) the managed memory setting is relatively small, resulting in duplicate processing results for batch tasks

xuyang (Jira) Mon, 18 Nov 2024 18:23:11 -0800


    [ 
https://issues.apache.org/jira/browse/FLINK-36661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17899327#comment-17899327
 ]


xuyang commented on FLINK-36661:
--------------------------------

Hi, [~empcl] 

Could you check if this bug can be reproduced in the latest Flink version 1.20?

> the managed memory setting is relatively small, resulting in duplicate 
> processing results for batch tasks
> ---------------------------------------------------------------------------------------------------------
>
>                 Key: FLINK-36661
>                 URL: https://issues.apache.org/jira/browse/FLINK-36661
>             Project: Flink
>          Issue Type: Bug
>            Reporter: 陈磊
>            Priority: Major
>         Attachments: image-2024-11-05-15-45-44-349.png
>
>
> 1. Operating environment:
> Flinksql 1.16+Batch Task+TM Specification (4c 8g)
> 2. Core topology diagram & SQL frame:
> !image-2024-11-05-15-45-44-349.png!
> {code:sql}
> create view tmp1
> select ....
> from source
> group by xx, xx;
> create view tmp
> select ...
> from tmp1 a
> left join
> tmp1 b
> on a.xx = b.xx
> group by x, y, ....;
> insert into t
> select * from tmp;
> {code}
> 3. Problem triggering conditions
> The managed fraction is set to 0.1, and the calculated managed memory is 
> approximately 600M，there are a total of 3000w data sources, each with around 
> 160 fields
> 4. Result performance
> The correct output count for this task is 3828w
> When the managed fraction is 0.1, the output of the written result is 
> unstable, which may be 120 million data points, 150 million data points, or 
> 210 million data points.
> When the managed fraction is set to 0.4, the task output is stable and meets 
> expectations
> 5. Personal investigation & think
> * Through monitoring verification, it was found that when managed is set to 
> 0.1, the output parameters of sort significantly increase.
> * If the managed fraction is not adjusted, expanding TM memory can also 
> ensure stable output of technical results
> * Although managed memory is important for batch tasks, in situations where 
> managed memory is insufficient, it should be due to slow task execution or 
> OOM, rather than causing duplicate output data.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (FLINK-36661) the managed memory setting is relatively small, resulting in duplicate processing results for batch tasks

Reply via email to