[
https://issues.apache.org/jira/browse/HIVE-14803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15566568#comment-15566568
]
Siddharth Seth commented on HIVE-14803:
---------------------------------------
I'm reverting this patch, and re-submitting it to jenkins. Ran the entire
test-suite with and without the patch. Quite a few additional failures show up
with the patch only.
We really need to be running everything through Jenkins at this point.
> S3: Stats gathering for insert queries can be expensive for partitioned
> dataset
> -------------------------------------------------------------------------------
>
> Key: HIVE-14803
> URL: https://issues.apache.org/jira/browse/HIVE-14803
> Project: Hive
> Issue Type: Improvement
> Components: Metastore
> Affects Versions: 2.1.0
> Reporter: Rajesh Balamohan
> Assignee: Rajesh Balamohan
> Priority: Minor
> Fix For: 2.2.0
>
> Attachments: HIVE-14803.1.patch
>
>
> StatsTask's aggregateStats populates stats details for all partitions by
> checking the file sizes which turns out to be expensive when larger number of
> partitions are inserted.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)