[ 
https://issues.apache.org/jira/browse/HBASE-3967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13264252#comment-13264252
 ] 

Kannan Muthukkaruppan commented on HBASE-3967:
----------------------------------------------

The point of the JIRA was to really just provide a way to be able to bulk 
import delete mutations in addition to put mutations. We solved this on 89-fb 
branch by introducing a RowMutation (which extends Row) and its constructor can 
take a "Put" or "Delete". And by using a RowMutationSortReducer (that is 
variant of PutSortReducer, except that it handles Deletes & Puts). I will dig 
up the commit revs on 89-fb branch and try to post the links shortly for you to 
take a look. Unless there are any technical objections, we should just port the 
same approach to trunk.
                
> Support deletes in HFileOutputFormat based bulk import mechanism
> ----------------------------------------------------------------
>
>                 Key: HBASE-3967
>                 URL: https://issues.apache.org/jira/browse/HBASE-3967
>             Project: HBase
>          Issue Type: Sub-task
>            Reporter: Kannan Muthukkaruppan
>            Priority: Critical
>             Fix For: 0.96.0
>
>         Attachments: diff.patch
>
>
> During bulk imports, it'll be useful to be able to do delete mutations 
> (either to delete data that already exists in HBase or was inserted earlier 
> during this run of the import). 
> For example, we have a use case, where we are processing a log of data which 
> may have both inserts and deletes in the mix and we want to upload that into 
> HBase using the bulk import mechanism.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to