Yes. It will be less useful if you can't scan only the newest data, as you'll be recombining the same pieces of data on subsequent runs.
On Fri, May 16, 2014 at 1:54 PM, David Medinets <[email protected]>wrote: > If I have the following simple set of data: > > NAME John > NAME Jake > NAME John > NAME Mary > > I want to end up with the following: > > NAME 3 > > I'm thinking that perhaps a HyperLogLog approach should work. See > http://en.wikipedia.org/wiki/HyperLogLog for more information. > > Has anyone done this before in Accumulo? >
