[ 
https://issues.apache.org/jira/browse/HBASE-4608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13148679#comment-13148679
 ] 

Kannan Muthukkaruppan commented on HBASE-4608:
----------------------------------------------

Li wrote: <<< The current bottleneck to HBase write speed is replicating the 
WAL appends across different datanodes. We can speed up this process by 
compressing the HLog. >>> 

Compression potentially adds some time, but then, yes, you save somewhere else 
in amount of stuff DFS has to do. I am curious what kind of improvement are you 
seeing with your changes. Without "sync" (deferred log flushing) the win might 
be even more. Perhaps, could you share some numbers with and without "sync".


                
> HLog Compression
> ----------------
>
>                 Key: HBASE-4608
>                 URL: https://issues.apache.org/jira/browse/HBASE-4608
>             Project: HBase
>          Issue Type: New Feature
>            Reporter: Li Pi
>            Assignee: Li Pi
>         Attachments: 4608v1.txt
>
>
> The current bottleneck to HBase write speed is replicating the WAL appends 
> across different datanodes. We can speed up this process by compressing the 
> HLog. Current plan involves using a dictionary to compress table name, region 
> id, cf name, and possibly other bits of repeated data. Also, HLog format may 
> be changed in other ways to produce a smaller HLog.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to