[
https://issues.apache.org/jira/browse/FLINK-19644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17219071#comment-17219071
]
Jark Wu commented on FLINK-19644:
---------------------------------
First of all, I think they are two features: partition pruning and temporal
join *latest* partition.
IIRC, almost all the use cases now are loading the latest partition for
temporal join. We haven't seen any use cases that there is a secondary
partition key in addition to the time partition keys. Therefore, I would
suggest to only support the most common case, and leave the secondary partition
key feature to the future.
We can provide "read.partition=max_partition()" and
"read.reload-interval=1hour" for this case. Note, the options should be
compatible when we migrate the implementation to scan mode in the future.
Regarding to 'lookup.join.partition' = 'pt_year=2020;pt_month=09;pt_day=15',
IIUC, this option is used to prune partitions for lookup source? Why not
support this feature through WHERE condition and {{SupportsPartitionPushDown}}
interface?
> Support read specific partition of Hive table in temporal join
> --------------------------------------------------------------
>
> Key: FLINK-19644
> URL: https://issues.apache.org/jira/browse/FLINK-19644
> Project: Flink
> Issue Type: Improvement
> Components: Connectors / Hive, Table SQL / Ecosystem
> Reporter: Leonard Xu
> Assignee: Leonard Xu
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.12.0
>
>
> It's a common case to use hive partitioned table as dimension table.
> Currently Hive table only supports load all data, It will be helpful if we
> can supportĀ read user specific partition in temporal table.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)