[
https://issues.apache.org/jira/browse/CASSANDRA-47?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12786578#action_12786578
]
Tatu Saloranta commented on CASSANDRA-47:
-----------------------------------------
I just submitted slightly rewritten version of LZF in Voldemort, optimized for
specific use case (straight byte[] -> byte[], no streaming) for Voldemort. If
that's the model that'd be used here (i.e. input size is fully known, no need
for streaming) it'd work pretty well (adding streaming is not difficult, just
needs bit more code to work well with all chunk sizes).
>From my testing LZF seems like a very promising candidate for these use cases
>-- very fast, overhead generally no larger than parsing overhead for stored
>conten -- and code is very simple, so it is a good algorithm to start with.
If more compression is needed other algorithms (like basic deflate (== gzip))
could be plugged in later on.
Anyway, main problem obviously is not the codec, but if you need any help with
LZF codec, just let me know (or others on voldemort list).
> SSTable compression
> -------------------
>
> Key: CASSANDRA-47
> URL: https://issues.apache.org/jira/browse/CASSANDRA-47
> Project: Cassandra
> Issue Type: New Feature
> Components: Core
> Reporter: Jonathan Ellis
> Assignee: Eric Evans
> Priority: Minor
> Fix For: 0.9
>
>
> We should be able to do SSTable compression which would trade CPU for I/O
> (almost always a good trade).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.