[ 
https://issues.apache.org/jira/browse/HIVE-18079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gopal V updated HIVE-18079:
---------------------------
    Attachment: HIVE-18079.5.patch

Rebase + explain one change to a test.

{code}
2018-01-02T20:25:52,726 DEBUG [36da1c38-207d-4b70-bb09-ff320a0062a3 main] 
calcite.sql2rel: Plan after trimming unused fields
HiveProject(key=[$0], key1=[$3])
  HiveJoin(condition=[=($1, $4)], joinType=[inner], algorithm=[none], cost=[not 
available])
    HiveJoin(condition=[=($0, $2)], joinType=[inner], algorithm=[none], 
cost=[not available])
      HiveFilter(condition=[AND(IS NOT NULL($0), IS NOT NULL($1))])
        HiveProject(key=[$0], value=[$1])
          HiveTableScan(table=[[default.tab_part]], table:alias=[a])
      HiveFilter(condition=[IS NOT NULL($0)])
        HiveProject(key=[$0])
          HiveTableScan(table=[[default.tab_part]], table:alias=[c])
    HiveFilter(condition=[IS NOT NULL($1)])
      HiveProject(key=[$0], value=[$1])
        HiveTableScan(table=[[default.tab_part]], table:alias=[b])
{code}

became 

{code}
2018-01-02T20:25:52,925 DEBUG [36da1c38-207d-4b70-bb09-ff320a0062a3 main] 
translator.PlanModifierForASTConv: Original plan for PlanModifier
 HiveProject(key=[$0], key1=[$2])
  HiveJoin(condition=[=($0, $4)], joinType=[inner], algorithm=[none], cost=[not 
available])
    HiveJoin(condition=[=($1, $3)], joinType=[inner], algorithm=[none], 
cost=[not available])
      HiveProject(key=[$0], value=[$1])
        HiveFilter(condition=[AND(IS NOT NULL($0), IS NOT NULL($1))])
          HiveTableScan(table=[[default.tab_part]], table:alias=[a])
      HiveProject(key=[$0], value=[$1])
        HiveFilter(condition=[IS NOT NULL($1)])
          HiveTableScan(table=[[default.tab_part]], table:alias=[b])
    HiveProject(key=[$0])
      HiveFilter(condition=[IS NOT NULL($0)])
        HiveTableScan(table=[[default.tab_part]], table:alias=[c]
{code}

> Statistics: Allow HyperLogLog to be merged to the lowest-common-denominator 
> bit-size
> ------------------------------------------------------------------------------------
>
>                 Key: HIVE-18079
>                 URL: https://issues.apache.org/jira/browse/HIVE-18079
>             Project: Hive
>          Issue Type: Improvement
>          Components: Standalone Metastore, Statistics
>    Affects Versions: 3.0.0
>            Reporter: Gopal V
>            Assignee: Gopal V
>         Attachments: HIVE-18079.1.patch, HIVE-18079.2.patch, 
> HIVE-18079.4.patch, HIVE-18079.5.patch
>
>
> HyperLogLog can merge a 14 bit HLL into a 10 bit HLL bitset, because of its 
> mathematical hash distribution & construction.
> Allow the squashing of a 14 bit HLL -> 10 bit HLL without needing a second 
> scan over the data-set.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to