[
https://issues.apache.org/jira/browse/HIVE-19995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16523835#comment-16523835
]
Zoltan Haindrich commented on HIVE-19995:
-----------------------------------------
This is only about basic stats: like rowcount.
It seems OrccRecordUpdater already provides rowCountDelta correctly (and it
can be negative) ; so it will keep track of deletes as well - for free :D
I think this will probably work very reliably in general for acid tables.
> Aggregate row traffic for acid tables
> -------------------------------------
>
> Key: HIVE-19995
> URL: https://issues.apache.org/jira/browse/HIVE-19995
> Project: Hive
> Issue Type: Sub-task
> Components: Statistics, Transactions
> Reporter: Zoltan Haindrich
> Assignee: Zoltan Haindrich
> Priority: Major
>
> for transactional tables we store basic stats in case of explicit
> analyze/rewrite; but doesn't do anything in other cases....which may even
> lead to plans which oom...
> It would be better to aggregate the total row traffic...because that is
> already available; so that operator tree estimations could work with a real
> upper bound of the row numbers.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)