[
https://issues.apache.org/jira/browse/IMPALA-7854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Armstrong updated IMPALA-7854:
----------------------------------
Priority: Major (was: Critical)
> Slow ALTER TABLE and LOAD DATA statements for tables with large number of
> partitions
> ------------------------------------------------------------------------------------
>
> Key: IMPALA-7854
> URL: https://issues.apache.org/jira/browse/IMPALA-7854
> Project: IMPALA
> Issue Type: Improvement
> Components: Catalog
> Affects Versions: Impala 2.12.0
> Environment: 14 Nodes
> Table in question has 20 columns, 3 partition columns, and 57,475 partitions
> Reporter: vietn
> Priority: Major
> Labels: impala, performance
>
> ALTER TABLE and LOAD DATA statements take minutes (9 minutes for ALTER TABLE
> and 6 minutes for LOAD DATA) for tables with a large number of partitions.
> Our workaround was to use Hive to perform the LOAD DATA and then perform a
> REFRESH PARTITION using Impala.
> * 14 Nodes
> * Table in question has 20 columns, 3 partition columns, and 57,475
> partitions
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]