[ 
https://issues.apache.org/jira/browse/HBASE-5403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13208752#comment-13208752
 ] 

Li Pi commented on HBASE-5403:
------------------------------

Rolling the log would just reset the dictionary, which means performance will 
be degraded for a bit until the dictionary was built back up again. 

I'm assuming checkpointing would involve dumping the contents of the dictionary 
at certain points - but the max size of the dictionary can be quite large, up 
to 32 megabytes or so in extreme cases. This has its own problems.
                
> Checkpoint the compressed HLog
> ------------------------------
>
>                 Key: HBASE-5403
>                 URL: https://issues.apache.org/jira/browse/HBASE-5403
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Liyin Tang
>            Assignee: Liyin Tang
>
> Let's assume that HBase replication can be based on replaying the HLog in the 
> replica cluster.
> The replica process could be crash during the replay. Obviously, the replica 
> process need a way to start from the lastest check point in the HLog, even 
> the HLog is compressed.
> So the proposal is to write a series of checkpoints within the HLog. 
> Each each checkpoint will have a header with some special sequence of bytes.
> And between each checkpoints, HLog should use new dictionaries to compress.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to