[
https://issues.apache.org/jira/browse/FLINK-35233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated FLINK-35233:
-----------------------------------
Labels: pull-request-available (was: )
> HBase lookup result is wrong when lookup cache is enabled
> ---------------------------------------------------------
>
> Key: FLINK-35233
> URL: https://issues.apache.org/jira/browse/FLINK-35233
> Project: Flink
> Issue Type: Bug
> Components: Connectors / HBase
> Affects Versions: hbase-3.0.0
> Reporter: tanjialiang
> Priority: Critical
> Labels: pull-request-available
>
> HBase table
> ||rowkey||name||age||
> |1|ben|18|
> |2|ken|19|
> |3|mark|20|
>
> FlinkSQL lookup join with lookup cahce
> {code:java}
> CREATE TABLE dim_user (
> rowkey STRING,
> info ROW<name STRING, age STRING>,
> PRIMARY KEY (rowkey) NOT ENFORCED
> ) WITH (
> 'connector' = 'hbase-2.2',
> 'zookeeper.quorum' = 'localhost:2181',
> 'zookeeper.znode.parent' = '/hbase',
> 'table-name' = 'default:test',
> 'lookup.cache' = 'PARTIAL',
> 'lookup.partial-cache.max-rows' = '1000',
> 'lookup.partial-cache.expire-after-write' = '1h'
> );
> CREATE VIEW user_click AS
> SELECT user_id, proctime() AS proc_time
> FROM (
> VALUES('1'), ('2'), ('3'), ('1'), ('2')
> ) AS t (user_id);
> SELECT
> user_id,
> info.name,
> info.age
> FROM user_click INNER JOIN dim_user
> FOR SYSTEM_TIME AS OF user_click.proc_time
> ON dim_user.rowkey = user_click.user_id;{code}
>
> Expect Result
> ||rowkey||name||age||
> |1|ben|18|
> |2|ken|19|
> |3|mark|20|
> |1|ben|18|
> |2|ken|19|
>
> Actual Result
> ||rowkey||name||age||
> |1|ben|18|
> |2|ken|19|
> |3|mark|20|
> |1|mark|20|
> |2|mark|20|
>
> Wrong result when we lookup user_id 1 and 2 the second time.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)