All,

I wanted to "announce" some work that I've been doing on allowing Pig to interface with Accumulo [1]. The code is available in the ACCUMULO-1783 branch [2] but still relies on some changes I made to Accumulo that aren't yet in a released version of Accumulo.

The origins of this work have been around for some time in a Git repo underneath the Accumulo "umbrella" but had mostly ignored as of late. My recent efforts have been to bring it up to speed up upstream Accumulo releases and ensure a full breadth of Pig Latin functionality.

Much of the design was modeled off of how HBaseStorage works, with some differences between how HBase and Accumulo themselves differ. I've tried to make a decent write up on what currently works, a high-level view on API/usage, an actual example with non-contrived data, and where I see future work leading with the integration [3].

A few questions for the Pig community: would you be interested in including some or all of this functionality into Pig itself? I'd be happy to work with you all to take this out of an "Accumulo" repo and into Pig itself. Additionally, any feedback on convention, style and/or best practices would be greatly appreciated as I'm still relatively new to working with Pig.

Thanks for your time!

- Josh


[1] http://accumulo.apache.org
[2] https://git-wip-us.apache.org/repos/asf?p=accumulo-pig.git
[3] http://people.apache.org/~elserj/accumulo-pig/

Reply via email to