Hi, Interesting, we did a similar thing for indexing HBase content into Solr, you can find it here:
https://github.com/NGDATA/hbase-indexer The part which picks up on the HBase replication stream is available as standalone library: https://github.com/NGDATA/hbase-indexer/tree/master/hbase-sep You could consider reusing that. Here's an example of how to use it: https://github.com/NGDATA/hbase-indexer/blob/master/hbase-sep/hbase-sep-demo/src/main/java/com/ngdata/sep/demo/LoggingConsumer.java There's also a blog on this at http://www.ngdata.com/the-hbase-side-effect-processor-and-hbase-replication-monitoring/ On Tue, Aug 6, 2013 at 12:05 AM, Alex Newman <[email protected]> wrote: > Based on the previous work using async libraries to index HBase into > elastic search, I've created. > > https://github.com/posix4e/Elasticsearch-HBase-River > > This river uses the replication feature in HBase to replicate into > elastic search which means it's harder to setup, but works way more > efficiently. It hasn't been tested in production so please be careful, > but I'd love feedback. If you like the project make sure to fork/star > it on github. Any suggestions for next steps? > -- Bruno Dumon NGDATA - Making Sense of Data http://www.ngdata.com/
