[
https://issues.apache.org/jira/browse/PIG-2921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13872900#comment-13872900
]
Harsh J commented on PIG-2921:
------------------------------
Hi Nezih,
Thanks for the ping! I've not got adequate time to work on it myself at the
moment unfortunately. I'm un-assigning from self so its back in the pick-up
queue. Please feel free to post a patch if you intend to work on this!
> Provide a bulkloadable option in HBaseStorage
> ---------------------------------------------
>
> Key: PIG-2921
> URL: https://issues.apache.org/jira/browse/PIG-2921
> Project: Pig
> Issue Type: New Feature
> Components: data
> Affects Versions: 0.9.2
> Reporter: Harsh J
> Assignee: Harsh J
>
> Right now, the Pig HBaseStorage writes Puts directly into HBase. This is slow
> for bulk operations (such as the ones Pig exactly does). The Puts/Deletes are
> more meant for realtime operations, so it would be nice if Pig had an
> automatic mechanism to prepare bulkloadable HFiles for the target table, and
> bulkload it in right at the end of the job.
> For compatibility reasons, this can be optional and turned off by default
> until it is agreed that this must be default (but can continue to provide a
> turn-off option).
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)