[
https://issues.apache.org/jira/browse/IMPALA-12981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17851719#comment-17851719
]
Jason Fehr edited comment on IMPALA-12981 at 6/3/24 4:24 PM:
-------------------------------------------------------------
Use Case -- customers can run compute stats on the weekends for the most
popular columns on the most popular tables. This use case must be accomplished
using only sql without using a scripting language.
was (Author: JIRAUSER298428):
Use Case -- customers can run compute stats on the weekends for the most
popular columns on the most popular tables.
> Support a column list in compute stats that is retrieved via a subquery
> -------------------------------------------------------------------------
>
> Key: IMPALA-12981
> URL: https://issues.apache.org/jira/browse/IMPALA-12981
> Project: IMPALA
> Issue Type: Improvement
> Components: Backend, Frontend
> Reporter: Manish Maheshwari
> Priority: Major
>
> Support a column list in compute stats that is retrived via a subquery -
> Specifically we want to use Impala query history tables where we collect the
> columns in a table that are using for joins, aggegrates, filters etc to be
> passed into compute stats command -
> Suggested Syntax -
> {code:java}
> compute stats db.tbl (
> select distinct join_columns from
> from sys.impala_query_log
> where contains(tables_queried, "db.tbl")
> and query_dttm >current_timestamp()-7
> and join_columns rlike 'db.tbl'
> ) {code}
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]