[
https://issues.apache.org/jira/browse/IMPALA-12737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17856589#comment-17856589
]
Jason Fehr commented on IMPALA-12737:
-------------------------------------
[[email protected]] -- I'm picking up this Jira again. I see that
[~MikaelSmith] had a couple of unanswered questions. Can you please take a
look and answer them?
> Include List of Referenced Columns in Query Log Table
> -----------------------------------------------------
>
> Key: IMPALA-12737
> URL: https://issues.apache.org/jira/browse/IMPALA-12737
> Project: IMPALA
> Issue Type: Improvement
> Reporter: Manish Maheshwari
> Assignee: Jason Fehr
> Priority: Major
> Labels: workload-management
>
> In the Impala query log table where completed queries are stored, add lists
> of columns that were referenced in the query. The purpose behind this
> functionality is to know which columns are part of
> * Select clause
> * Where clause
> * Join clause
> * Aggegrate clause
> * Order by clause
> There should be a column for each type of clause, so that decisions can be
> made based on specific usage or on the union of those clauses.
> With this information, we will feed into compute stats command to collect
> stats only on the required columns that are using in joins / filters and
> aggegrates and not on all the table columns.
> The information can be collected as an array of
> [db1.table1.column1,db1.table1.column2]
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]