[DISCUSS] FLIP-376: Add DISTRIBUTED BY clause

Timo Walther Thu, 26 Oct 2023 02:00:44 -0700

Hi everyone,

I would like to start a discussion on FLIP-376: Add DISTRIBUTED BYclause [1].

Many SQL vendors expose the concepts of Partitioning, Bucketing, andClustering. This FLIP continues the work of previous FLIPs and wouldlike to introduce the concept of "Bucketing" to Flink.

This is a pure connector characteristic and helps both Apache Kafka andApache Paimon connectors in avoiding a complex WITH clause by providingimproved syntax.


Here is an example:

CREATE TABLE MyTable
  (
    uid BIGINT,
    name STRING
  )
  DISTRIBUTED BY (uid) INTO 6 BUCKETS
  WITH (
    'connector' = 'kafka'
  )

The full syntax specification can be found in the document. The clauseshould be optional and fully backwards compatible.


Regards,
Timo

[1]https://cwiki.apache.org/confluence/display/FLINK/FLIP-376%3A+Add+DISTRIBUTED+BY+clause

[DISCUSS] FLIP-376: Add DISTRIBUTED BY clause

Reply via email to