[
https://issues.apache.org/jira/browse/HBASE-13153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16290715#comment-16290715
]
Akshay Gehi commented on HBASE-13153:
-------------------------------------
Understood. You are saying that the documentation should be updated since the
limitation no longer applies to Bulk load but this section should not be
removed. If someone uses an API such as Put.setDurability(SKIP_WAL), the
limitation would still apply.
My second question was related to the bulk load feature. Are there any other
known problems using Bulk load feature on a regular basis on a production
system. I have seen a lot of problems in various articles such as the following
link but those articles seem to be dated and those limitations don't seem to
apply any more.
http://blog.cloudera.com/blog/2013/09/how-to-use-hbase-bulk-loading-and-why/
Some snippets from the (perhaps outdated) article:
*Potential Issues*
* Recently deleted data reappearing
* Bulk-loaded data cannot be overwritten by another bulk load
We intend to use bulk load on a production system on a daily basis since we
expect to receive millions of insert/updates a day.
> Bulk Loaded HFile Replication
> -----------------------------
>
> Key: HBASE-13153
> URL: https://issues.apache.org/jira/browse/HBASE-13153
> Project: HBase
> Issue Type: New Feature
> Components: Replication
> Reporter: sunhaitao
> Assignee: Ashish Singhi
> Fix For: 2.0.0, 1.3.0
>
> Attachments: HBASE-13153-branch-1-v20.patch,
> HBASE-13153-branch-1-v21.patch, HBASE-13153-v1.patch, HBASE-13153-v10.patch,
> HBASE-13153-v11.patch, HBASE-13153-v12.patch, HBASE-13153-v13.patch,
> HBASE-13153-v14.patch, HBASE-13153-v15.patch, HBASE-13153-v16.patch,
> HBASE-13153-v17.patch, HBASE-13153-v18.patch, HBASE-13153-v19.patch,
> HBASE-13153-v2.patch, HBASE-13153-v20.patch, HBASE-13153-v21.patch,
> HBASE-13153-v3.patch, HBASE-13153-v4.patch, HBASE-13153-v5.patch,
> HBASE-13153-v6.patch, HBASE-13153-v7.patch, HBASE-13153-v8.patch,
> HBASE-13153-v9.patch, HBASE-13153.patch, HBase Bulk Load
> Replication-v1-1.pdf, HBase Bulk Load Replication-v2.pdf, HBase Bulk Load
> Replication-v3.pdf, HBase Bulk Load Replication.pdf, HDFS_HA_Solution.PNG
>
>
> Currently we plan to use HBase Replication feature to deal with disaster
> tolerance scenario.But we encounter an issue that we will use bulkload very
> frequently,because bulkload bypass write path, and will not generate WAL, so
> the data will not be replicated to backup cluster. It's inappropriate to
> bukload twice both on active cluster and backup cluster. So i advise do some
> modification to bulkload feature to enable bukload to both active cluster and
> backup cluster
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)