László Pintér created HIVE-25842:
------------------------------------
Summary: Reimplement delta file metric collection
Key: HIVE-25842
URL: https://issues.apache.org/jira/browse/HIVE-25842
Project: Hive
Issue Type: Improvement
Reporter: László Pintér
Assignee: László Pintér
FUNCTIONALITY: Metrics are collected only when a Tez query runs a table (select
* and select count( * ) don't update the metrics)
Metrics aren't updated after compaction or cleaning after compaction, so users
will probably see "issues" with compaction (like many active or obsolete or
small deltas) that don't exist.
RISK: Metrics are collected during queries – we tried to put a try-catch around
each method in DeltaFilesMetricsReporter but of course this isn't foolproof.
This is a HUGE performance and functionality liability. Tests caught some
issues, but our tests aren't perfect.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)