Himanshu Gwalani created HBASE-28328:
----------------------------------------

             Summary: Add an option to count different types of Delete Markers 
in RowCounter
                 Key: HBASE-28328
                 URL: https://issues.apache.org/jira/browse/HBASE-28328
             Project: HBase
          Issue Type: Improvement
          Components: mapreduce
            Reporter: Himanshu Gwalani
            Assignee: Himanshu Gwalani


Add an option (count-delete-markers) to the 
[RowCounter|https://github.com/apache/hbase/blob/8a9ad0736621fa1b00b5ae90529ca6065f88c67f/hbase-mapreduce/src/main/java/org/apache/hadoop/hbase/mapreduce/RowCounter.java#L240C62-L240C75]
 tool to count the number of Delete Markers of all types, i.e. (DELETE, 
DELETE_COLUMN, DELETE_FAMILY,DELETE_FAMILY_VERSION)

We already have such a feature within our internal implementation of RowCounter 
and it's very useful.

Implementation Ideas:
1. If the option is passed, initialize the empty job counters for all 4 types 
of deletes.
2. Within mapper, increase the respective delete counts while processing each 
row.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to