[
https://issues.apache.org/jira/browse/HIVE-2471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231603#comment-13231603
]
Phabricator commented on HIVE-2471:
-----------------------------------
njain has commented on the revision "HIVE-2471 [jira] Add timestamp column with
index to the partition stats table.".
INLINE COMMENTS
ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsSetupConstants.java:26
Write a big comment here that it is the users responsibility
to delete the old table
ql/src/java/org/apache/hadoop/hive/ql/stats/jdbc/JDBCStatsUtils.java:128 I am
not sure this will work -
I am assuming this is invoked by StatsAggregator, but the data is
inserted by StatsPublisher.
The timestamp will be different in the 2 places
REVISION DETAIL
https://reviews.facebook.net/D2367
> Add timestamp column to the partition stats table.
> --------------------------------------------------
>
> Key: HIVE-2471
> URL: https://issues.apache.org/jira/browse/HIVE-2471
> Project: Hive
> Issue Type: Improvement
> Reporter: Kevin Wilfong
> Assignee: Kevin Wilfong
> Attachments: HIVE-2471.1.patch.txt, HIVE-2471.D2367.1.patch,
> HIVE-2471.D2367.2.patch
>
>
> Occasionally, when entries are added to the partition stats table the program
> is halted before it can delete those entries, by an exception, keyboard
> interrupt, etc. These build up to the point where the table gets very large,
> and it hurts the performance of the update statement which is often called.
> In order to fix this, I am adding a column to the table which is
> auto-populated with the current timestamp. This will allow us to create
> scripts that go through periodically and clean out old entries from the table.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira