Thanks again Russ! "but it might not be in this case if most of the data has already been combined" Does this mean Accumulo actually combine and persist the combined result after the scan/compaction (depending on which op the combiner is applied)? For example, let say there are 100 rows of [foo, 1], it will actually be 'combined' to a single row [foo, 100]? If that is the case, then combiner is not expensive.
Wow! that's brilliant using -1 approach, I didn't even think about it before. Yes, this will work for my case because i only need to know the count. Thanks, Z -- View this message in context: http://apache-accumulo.1065345.n5.nabble.com/using-combiner-vs-building-stats-cache-tp14979p14988.html Sent from the Developers mailing list archive at Nabble.com.
