Hi Mike, It would be great to have a StoreFunc for HBase! There is a rewrite underway for the Load/Store stuff that will make that a lot easier -- see https://issues.apache.org/jira/browse/PIG-966 . You may want to consider writing it for the load-store redesign branch. This is what's probably going to be in 0.7. The first step would be to open a jira and look at the existing StoreFunc implementations.
-D On Thu, Jan 14, 2010 at 9:59 PM, Michael Dalton <mwdal...@gmail.com> wrote: > Hi all, > > I was looking at the current Pig code in SVN, and it seems like HBase is > supported for loading, but not for storing. If this is the case, I'd like to > add support for writing to HBase to Pig. Is there anyone else working on > this, and if not is this something that you'd like contributed? Based on a > cursory evaluation of the StoreFunc interface, it looks like the APIs there > are pretty file-centric and may need to be modified to accomodate HBase's > table-based design. For example, you aren't going to be serializing your > output to an OutputStream object in all likelihood. > > I haven't contributed to Pig before, and I wanted to see if this is > something that would be beneficial to the rest of the Pig community, and if > so what next steps I should take (like starting a JIRA) to get the ball > rolling. Thanks > > Best regards, > > Mike >