[jira] [Updated] (FLINK-27626) Introduce pre-aggregated merge to table store

Flink Jira Bot (Jira) Sat, 13 Aug 2022 15:38:10 -0700


     [ 
https://issues.apache.org/jira/browse/FLINK-27626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Flink Jira Bot updated FLINK-27626:
-----------------------------------
    Labels: pull-request-available stale-major  (was: pull-request-available)

I am the [Flink Jira Bot|https://github.com/apache/flink-jira-bot/] and I help 
the community manage its development. I see this issues has been marked as 
Major but is unassigned and neither itself nor its Sub-Tasks have been updated 
for 60 days. I have gone ahead and added a "stale-major" to the issue". If this 
ticket is a Major, please either assign yourself or give an update. Afterwards, 
please remove the label or in 7 days the issue will be deprioritized.


> Introduce pre-aggregated merge to table store
> ---------------------------------------------
>
>                 Key: FLINK-27626
>                 URL: https://issues.apache.org/jira/browse/FLINK-27626
>             Project: Flink
>          Issue Type: New Feature
>          Components: Table Store
>            Reporter: Jingsong Lee
>            Priority: Major
>              Labels: pull-request-available, stale-major
>
> We can introduce richer merge strategies, one of which is already introduced 
> is PartialUpdateMergeFunction, which completes non-NULL fields when merging. 
> We can introduce more powerful merge strategies, such as support for 
> pre-aggregated merges.
> Usage 1:
> CREATE TABLE T (
>     pk STRING PRIMARY KEY NOT ENFOCED,
>     sum_field1 BIGINT,
>     sum_field1 BIGINT
> ) WITH (
>      'merge-engine' = 'aggregation',
>      'sum_field1.aggregate-function' = 'sum',
>      'sum_field2.aggregate-function' = 'sum'
> );
> INSERT INTO T VALUES ('pk1', 1, 1);
> INSERT INTO T VALUES ('pk1', 1, 1);
> SELECT * FROM T;
> => output 'pk1', 2, 2
> Usage 2:
> CREATE MATERIALIZED VIEW T
> with (
>     'merge-engine' = 'aggregation'
> ) AS SELECT
>     pk,
>     SUM(field1) AS sum_field1,
>     SUM(field2) AS sum_field1
> FROM source_t
> GROUP BY pk ;
> This will start a stream job to synchronize data, consume source data, and 
> write incrementally to T. This data synchronization job has no state.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (FLINK-27626) Introduce pre-aggregated merge to table store

Reply via email to