[
https://issues.apache.org/jira/browse/HBASE-3967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13264252#comment-13264252
]
Kannan Muthukkaruppan commented on HBASE-3967:
----------------------------------------------
The point of the JIRA was to really just provide a way to be able to bulk
import delete mutations in addition to put mutations. We solved this on 89-fb
branch by introducing a RowMutation (which extends Row) and its constructor can
take a "Put" or "Delete". And by using a RowMutationSortReducer (that is
variant of PutSortReducer, except that it handles Deletes & Puts). I will dig
up the commit revs on 89-fb branch and try to post the links shortly for you to
take a look. Unless there are any technical objections, we should just port the
same approach to trunk.
> Support deletes in HFileOutputFormat based bulk import mechanism
> ----------------------------------------------------------------
>
> Key: HBASE-3967
> URL: https://issues.apache.org/jira/browse/HBASE-3967
> Project: HBase
> Issue Type: Sub-task
> Reporter: Kannan Muthukkaruppan
> Priority: Critical
> Fix For: 0.96.0
>
> Attachments: diff.patch
>
>
> During bulk imports, it'll be useful to be able to do delete mutations
> (either to delete data that already exists in HBase or was inserted earlier
> during this run of the import).
> For example, we have a use case, where we are processing a log of data which
> may have both inserts and deletes in the mix and we want to upload that into
> HBase using the bulk import mechanism.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira