[ 
https://issues.apache.org/jira/browse/HBASE-13153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16290715#comment-16290715
 ] 

Akshay Gehi commented on HBASE-13153:
-------------------------------------

Understood. You are saying that the documentation should be updated since the 
limitation no longer applies to Bulk load but this section should not be 
removed. If someone uses an API such as Put.setDurability(SKIP_WAL), the 
limitation would still apply. 

My second question was related to the bulk load feature. Are there any other 
known problems using Bulk load feature on a regular basis on a production 
system. I have seen a lot of problems in various articles such as the following 
link but those articles seem to be dated and those limitations don't seem to 
apply any more. 

http://blog.cloudera.com/blog/2013/09/how-to-use-hbase-bulk-loading-and-why/ 

Some snippets from the (perhaps outdated) article:

*Potential Issues*
* Recently deleted data reappearing
* Bulk-loaded data cannot be overwritten by another bulk load

We intend to use bulk load on a production system on a daily basis since we 
expect to receive millions of insert/updates a day. 

> Bulk Loaded HFile Replication
> -----------------------------
>
>                 Key: HBASE-13153
>                 URL: https://issues.apache.org/jira/browse/HBASE-13153
>             Project: HBase
>          Issue Type: New Feature
>          Components: Replication
>            Reporter: sunhaitao
>            Assignee: Ashish Singhi
>             Fix For: 2.0.0, 1.3.0
>
>         Attachments: HBASE-13153-branch-1-v20.patch, 
> HBASE-13153-branch-1-v21.patch, HBASE-13153-v1.patch, HBASE-13153-v10.patch, 
> HBASE-13153-v11.patch, HBASE-13153-v12.patch, HBASE-13153-v13.patch, 
> HBASE-13153-v14.patch, HBASE-13153-v15.patch, HBASE-13153-v16.patch, 
> HBASE-13153-v17.patch, HBASE-13153-v18.patch, HBASE-13153-v19.patch, 
> HBASE-13153-v2.patch, HBASE-13153-v20.patch, HBASE-13153-v21.patch, 
> HBASE-13153-v3.patch, HBASE-13153-v4.patch, HBASE-13153-v5.patch, 
> HBASE-13153-v6.patch, HBASE-13153-v7.patch, HBASE-13153-v8.patch, 
> HBASE-13153-v9.patch, HBASE-13153.patch, HBase Bulk Load 
> Replication-v1-1.pdf, HBase Bulk Load Replication-v2.pdf, HBase Bulk Load 
> Replication-v3.pdf, HBase Bulk Load Replication.pdf, HDFS_HA_Solution.PNG
>
>
> Currently we plan to use HBase Replication feature to deal with disaster 
> tolerance scenario.But we encounter an issue that we will use bulkload very 
> frequently,because bulkload bypass write path, and will not generate WAL, so 
> the data will not be replicated to backup cluster. It's inappropriate to 
> bukload twice both on active cluster and backup cluster. So i advise do some 
> modification to bulkload feature to enable bukload to both active cluster and 
> backup cluster



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to