[ 
https://issues.apache.org/jira/browse/IMPALA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong updated IMPALA-1706:
----------------------------------
    Component/s:     (was: Perf Investigation)
                 Frontend

> Join returning single distinct column unnecessarily computes cross product
> --------------------------------------------------------------------------
>
>                 Key: IMPALA-1706
>                 URL: https://issues.apache.org/jira/browse/IMPALA-1706
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Frontend
>    Affects Versions: Impala 2.0
>            Reporter: Dileep Kumar
>            Priority: Minor
>              Labels: performance
>         Attachments: profile, summary
>
>
> While joining two simple table's on a columns with many duplicate values 
> never completes. The query page show 100% progress but no result returned.
> The query is :
> {{select distinct ca_country from customer_address, customer where 
> c_birth_country = upper(ca_country);}}
> The table stats are as belowe:
> customer:
> {code}
> Query: show table stats customer
> +---------+--------+----------+--------------+---------+-------------------+
> | #Rows   | #Files | Size     | Bytes Cached | Format  | Incremental stats |
> +---------+--------+----------+--------------+---------+-------------------+
> | 3850000 | 21     | 214.42MB | NOT CACHED   | PARQUET | false             |
> +---------+--------+----------+--------------+---------+-------------------+
> customer_address:
> Query: show table stats customer_address
> +---------+--------+----------+--------------+--------+-------------------+
> | #Rows   | #Files | Size     | Bytes Cached | Format | Incremental stats |
> +---------+--------+----------+--------------+--------+-------------------+
> | 1925000 | 21     | 208.55MB | NOT CACHED   | TEXT   | false             |
> +---------+--------+----------+--------------+--------+-------------------+
> {code}
> The query profile and execution summary is attached.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to