Himanshu Gwalani created HBASE-28328:
----------------------------------------
Summary: Add an option to count different types of Delete Markers
in RowCounter
Key: HBASE-28328
URL: https://issues.apache.org/jira/browse/HBASE-28328
Project: HBase
Issue Type: Improvement
Components: mapreduce
Reporter: Himanshu Gwalani
Assignee: Himanshu Gwalani
Add an option (count-delete-markers) to the
[RowCounter|https://github.com/apache/hbase/blob/8a9ad0736621fa1b00b5ae90529ca6065f88c67f/hbase-mapreduce/src/main/java/org/apache/hadoop/hbase/mapreduce/RowCounter.java#L240C62-L240C75]
tool to count the number of Delete Markers of all types, i.e. (DELETE,
DELETE_COLUMN, DELETE_FAMILY,DELETE_FAMILY_VERSION)
We already have such a feature within our internal implementation of RowCounter
and it's very useful.
Implementation Ideas:
1. If the option is passed, initialize the empty job counters for all 4 types
of deletes.
2. Within mapper, increase the respective delete counts while processing each
row.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)