[jira] [Updated] (FLINK-36165) Support capturing snapshot data with conditions

ASF GitHub Bot (Jira) Tue, 03 Dec 2024 19:57:11 -0800


     [ 
https://issues.apache.org/jira/browse/FLINK-36165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


ASF GitHub Bot updated FLINK-36165:
-----------------------------------
    Labels: pull-request-available  (was: )

> Support capturing snapshot data with conditions
> -----------------------------------------------
>
>                 Key: FLINK-36165
>                 URL: https://issues.apache.org/jira/browse/FLINK-36165
>             Project: Flink
>          Issue Type: Improvement
>          Components: Flink CDC
>            Reporter: Thorne
>            Priority: Minor
>              Labels: pull-request-available
>         Attachments: image-2024-08-28-14-06-40-501.png
>
>
> 1. When I want to supplement some data (business consumption data failed)
> 2. Unable to start from checkpoint, because the binlog files has been cleared.
> 3. At this point, only the full amount of data can be synchronized from 
> scratch. If the data volume is small, it is acceptable. However, if the data 
> volume is large, the process of supplementing the data requires significant 
> resources.
> Now：snapshot all data 
> {code:java}
> select * from table{code}
> snapshot  chunk data
> {code:java}
> select * from table where chunkStart>? and  chunkEnd< ?{code}
> So ,I think that we can achieve this feature 
> Just like Datax,users obtain data from table without all snapshot data. at 
> first analysis Table.
> {code:java}
> select * from table where col >x1 and col <x2  
> {code}
> [https://github.com/alibaba/DataX/blob/master/mysqlreader/doc/mysqlreader.md|http://example.com/]
>  
> !image-2024-08-28-14-06-40-501.png|width=987,height=244!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (FLINK-36165) Support capturing snapshot data with conditions

Reply via email to