[
https://issues.apache.org/jira/browse/HBASE-2655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michele Catasta updated HBASE-2655:
-----------------------------------
Attachment: HBASE-2655.patch
@stack: addressed the user experience problem you were talking about in
HBASE-2681. I updated the patch to let it depend on that code change (and the
JIRA issue as well)
> 2-pass compression support
> --------------------------
>
> Key: HBASE-2655
> URL: https://issues.apache.org/jira/browse/HBASE-2655
> Project: HBase
> Issue Type: New Feature
> Components: io
> Reporter: Michele Catasta
> Priority: Minor
> Fix For: 0.21.0
>
> Attachments: HBASE-2655.patch
>
>
> Quoting from BigTable paper: "Many clients use a two-pass custom compression
> scheme. The first pass uses Bentley and McIlroy's scheme, which compresses
> long common strings across a large window. The second pass uses a fast
> compression algorithm that looks for repetitions in a small 16 KB window of
> the data. Both compression passes are very fast—they encode at 100-200 MB/s,
> and decode at 400-1000 MB/s on modern machines."
> The goal of this patch is to integrate a similar compression scheme in HBase.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.