[
https://issues.apache.org/jira/browse/FLINK-31212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lyn Zhang updated FLINK-31212:
------------------------------
Description:
I have a case in [^test.sql] that records in table_1 left join fail will be
discard by group window.
I check the interval join operator implements. If one record in left table join
right table fail, the record will not be emitted realtime but emitted waiting
for additional half join bound time. In the test.sql, table_1 left join table_2
in 5 minute bound, and the output will delay 2.5 minute this will cause window
discard the records.
h2. testing
h4. input:
!image-2023-02-24-17-58-44-461.png!
{"n":"n1","ts":"2023-02-24 14:00:00"}
{"n":"n2","ts":"2023-02-24 14:00:00"}
{"n":"n1","ts":"2023-02-24 14:06:01"}
!image-2023-02-24-17-58-57-238.png!
{"n":"n1","ts":"2023-02-24 14:00:00","v":111}
{"n":"n1","ts":"2023-02-24 14:06:01","v":111}
h4. output:
expect:
!image-2023-02-24-18-00-52-891.png!
real:
!image-2023-02-24-17-59-25-179.png!
I remove this logic in [https://github.com/apache/flink/pull/22014] Please
help to review this PR.
was:
I have a case in [^test.sql] that records in table_1 left join fail will be
discard by group window.
I check the interval join operator implements. If one record in left table join
right table fail, the record will not be emitted realtime but emitted waiting
for half join bound time. In the test.sql, table_1 left join table_2 in 5
minute bound, and the output will delay 2.5 minute this will cause window
discard the records.
h2. testing
h4. input:
!image-2023-02-24-17-58-44-461.png!
{"n":"n1","ts":"2023-02-24 14:00:00"}\{"n":"n2","ts":"2023-02-24
14:00:00"}{"n":"n1","ts":"2023-02-24 14:06:01"}
!image-2023-02-24-17-58-57-238.png!
{"n":"n1","ts":"2023-02-24 14:00:00","v":111}\{"n":"n1","ts":"2023-02-24
14:06:01","v":111}
h4. output:
expect:
!image-2023-02-24-18-00-52-891.png!
real:
!image-2023-02-24-17-59-25-179.png!
I remove this logic in [https://github.com/apache/flink/pull/22014] Please
help to review this PR.
> Data lost on interval left join with window group
> -------------------------------------------------
>
> Key: FLINK-31212
> URL: https://issues.apache.org/jira/browse/FLINK-31212
> Project: Flink
> Issue Type: Bug
> Components: Table SQL / Runtime
> Affects Versions: 1.16.1
> Reporter: Lyn Zhang
> Priority: Major
> Labels: pull-request-available
> Attachments: image-2023-02-24-17-58-44-461.png,
> image-2023-02-24-17-58-57-238.png, image-2023-02-24-17-59-25-179.png,
> image-2023-02-24-18-00-52-891.png, test.sql
>
>
>
> I have a case in [^test.sql] that records in table_1 left join fail will be
> discard by group window.
> I check the interval join operator implements. If one record in left table
> join right table fail, the record will not be emitted realtime but emitted
> waiting for additional half join bound time. In the test.sql, table_1 left
> join table_2 in 5 minute bound, and the output will delay 2.5 minute this
> will cause window discard the records.
> h2. testing
> h4. input:
> !image-2023-02-24-17-58-44-461.png!
> {"n":"n1","ts":"2023-02-24 14:00:00"}
> {"n":"n2","ts":"2023-02-24 14:00:00"}
> {"n":"n1","ts":"2023-02-24 14:06:01"}
> !image-2023-02-24-17-58-57-238.png!
> {"n":"n1","ts":"2023-02-24 14:00:00","v":111}
> {"n":"n1","ts":"2023-02-24 14:06:01","v":111}
> h4. output:
> expect:
> !image-2023-02-24-18-00-52-891.png!
> real:
> !image-2023-02-24-17-59-25-179.png!
> I remove this logic in [https://github.com/apache/flink/pull/22014] Please
> help to review this PR.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)