Quanlong Huang created IMPALA-14698:
---------------------------------------
Summary: Support mutiple partitions in ALTER TABLE statement
Key: IMPALA-14698
URL: https://issues.apache.org/jira/browse/IMPALA-14698
Project: IMPALA
Issue Type: New Feature
Reporter: Quanlong Huang
Assignee: Quanlong Huang
Users might need to update tblproperties of multiple partitions, e.g.
explicitly setting numRows and STATS_GENERATED_VIA_STATS_TASK=true in an ETL
pipeline:
{code:sql}
alter table my_tbl partition (p=1) set tblproperties('numRows'='1035',
'STATS_GENERATED_VIA_STATS_TASK'='true');
alter table my_tbl partition (p=2) set tblproperties('numRows'='1030',
'STATS_GENERATED_VIA_STATS_TASK'='true');
alter table my_tbl partition (p=3) set tblproperties('numRows'='1000',
'STATS_GENERATED_VIA_STATS_TASK'='true');
{code}
Similar to IMPALA-14089, support doing this in a single statement helps to
reduce the lock contention in catalogd side. Also the table version won't
change frequently which avoids query planning retries (IMPALA-14695). Modifying
multiple partitions in a single operation also improve the metadata reloading
on them since they will be reloaded in parallel.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]