[ 
https://issues.apache.org/jira/browse/FLINK-14663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kurt Young reassigned FLINK-14663:
----------------------------------

    Assignee: Terry Wang

> Distinguish unknown column stats and zero
> -----------------------------------------
>
>                 Key: FLINK-14663
>                 URL: https://issues.apache.org/jira/browse/FLINK-14663
>             Project: Flink
>          Issue Type: Bug
>          Components: Connectors / Hive, Table SQL / API
>            Reporter: Kurt Young
>            Assignee: Terry Wang
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> When converting from hive stats to flink's column stats, we didn't check 
> whether some columns stats is really set or just an initial value. For 
> example:
> {code:java}
> // code placeholder
> LongColumnStatsData longColStats = stats.getLongStats();
> return new CatalogColumnStatisticsDataLong(
>       longColStats.getLowValue(),
>       longColStats.getHighValue(),
>       longColStats.getNumDVs(),
>       longColStats.getNumNulls());
> {code}
>  Hive `LongColumnStatsData` actually has information whether some stats is 
> set through APIs like `isSetNumDVs()`. And the initial values are all 0, it 
> will confuse us is it really 0 or just an initial value. 
>  
> We can use -1 to represent UNKNOWN value for column stats. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to