[ 
https://issues.apache.org/jira/browse/IMPALA-12737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17824506#comment-17824506
 ] 

Michael Smith commented on IMPALA-12737:
----------------------------------------

If we identified just columns that were missing stats, would that be 
sufficient? Working on how to name this, maybe "columns_missing_stats" or 
"key_columns".

> Include List of Referenced Columns in Query Log Table
> -----------------------------------------------------
>
>                 Key: IMPALA-12737
>                 URL: https://issues.apache.org/jira/browse/IMPALA-12737
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Manish Maheshwari
>            Assignee: Michael Smith
>            Priority: Major
>              Labels: workload-management
>
> In the Impala query log table where completed queries are stored, add a list 
> of all columns that were referenced in the query. The purpose behind this 
> functionality is to know which columns are part of 
>  * Select clause
>  * Where clause
>  * Join clause
>  * Aggegrate clause
> With this information, we will feed into compute stats command to collect 
> stats only on the required columns that are using in joins / filters and 
> aggegrates and not on all the table columns.
> The information can be collected as an array of 
> [db1.table1.column1,db1.table1.column2]
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to