Gary Helmling created HBASE-15773:
-------------------------------------

             Summary: CellCounter improvements
                 Key: HBASE-15773
                 URL: https://issues.apache.org/jira/browse/HBASE-15773
             Project: HBase
          Issue Type: Improvement
          Components: mapreduce
            Reporter: Gary Helmling


Looking at the CellCounter map reduce, it seems like it can be improved in a 
few areas:

* it does not currently support setting scan batching.  This is important when 
we're fetching all versions for columns.  Actually, it would be nice to support 
all of the scan configuration currently provided in TableInputFormat.
* generating job counters containing row keys and column qualifiers is 
guaranteed to blow up on anything but the smallest table.  This is not usable 
and doesn't make any sense when the same counts are in the job output.  The row 
and qualifier specific counters should be dropped.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to