Gary Helmling created HBASE-15773: ------------------------------------- Summary: CellCounter improvements Key: HBASE-15773 URL: https://issues.apache.org/jira/browse/HBASE-15773 Project: HBase Issue Type: Improvement Components: mapreduce Reporter: Gary Helmling
Looking at the CellCounter map reduce, it seems like it can be improved in a few areas: * it does not currently support setting scan batching. This is important when we're fetching all versions for columns. Actually, it would be nice to support all of the scan configuration currently provided in TableInputFormat. * generating job counters containing row keys and column qualifiers is guaranteed to blow up on anything but the smallest table. This is not usable and doesn't make any sense when the same counts are in the job output. The row and qualifier specific counters should be dropped. -- This message was sent by Atlassian JIRA (v6.3.4#6332)