[
https://issues.apache.org/jira/browse/HBASE-11339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14118654#comment-14118654
]
Andrew Purtell commented on HBASE-11339:
----------------------------------------
bq. we cannot store 10mb Cells in HBase then that's something to address.
We can store 10 MB cells in HBase. It is true that beyond some
use-case-dependent threshold we risk OOME under load with very large cells.
This is because the complete cell contents are materialized on the server for
RPC, as you mention.
bq. The fact that we cannot stream into and out of HBase needs to be addressed,
that is the real problem anyway.
Definitely the lack of a streaming API is an issue worth looking at.
Related, the MOB design also attempts to avoid write amplification of large
cells during compaction, by segregating large values into separate files set
outside the normal compaction process. Rather than normal compaction, an
external MapReduce based tool is used for compacting MOB files. HBase has never
*required* MapReduce before and we should really think hard before introducing
such a change. Are we *sure* the desired objectives cannot be met with a
pluggable compaction policy?
> HBase MOB
> ---------
>
> Key: HBASE-11339
> URL: https://issues.apache.org/jira/browse/HBASE-11339
> Project: HBase
> Issue Type: Umbrella
> Components: regionserver, Scanners
> Reporter: Jingcheng Du
> Assignee: Jingcheng Du
> Attachments: HBase MOB Design-v2.pdf, HBase MOB Design-v3.pdf, HBase
> MOB Design-v4.pdf, HBase MOB Design.pdf, MOB user guide.docx, MOB user
> guide_v2.docx, hbase-11339-in-dev.patch
>
>
> It's quite useful to save the medium binary data like images, documents
> into Apache HBase. Unfortunately directly saving the binary MOB(medium
> object) to HBase leads to a worse performance since the frequent split and
> compaction.
> In this design, the MOB data are stored in an more efficient way, which
> keeps a high write/read performance and guarantees the data consistency in
> Apache HBase.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)