On Wed, Jul 2, 2014 at 12:07 AM, Carl Austin <[email protected]> wrote:
> Thanks for the time to look and comment and glad it sounds interesting, > > The reason I started on this was that I'm using Accumulo and want to make > an application usable on both HBase and Accumulo with the same codebase. I > do a lot of aggregations on data and I feel the Accumulo iterator mechanism > is superior for this use case; it's one of the main reasons I went with > Accumulo and one of the only remaining major differences between the two > applications now that HBase has implemented cell level ACLs. > For example, as I am ingesting a main table of data I am creating many > other question focused tables that keep answers like how many times did I > see combinations of values, when was the last time I saw combinations > together, how many distinct values where in this field for each combination > (using probabilistic counting of course) and many more. All of these things > are well suited to Accumulo iterators for performance at scale because of > how they run at compaction time across key/values that are already being > read at that point, rather than having to update the answers to these > questions on every single insert. > > This use case won't be for everyone, but the iterator mechanism is pretty > neat, powerful and a real differentiator in Accumulo (of course there are > many differentiators in HBase too!). > Thank you for sharing your experience. I'm watching your repo. Feel free to ping me off-list if you want an opinion on how to hbase it or if you want a review. Thanks Carl, St.Ack
