[
https://issues.apache.org/jira/browse/HBASE-5626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14067807#comment-14067807
]
Nitish Upreti commented on HBASE-5626:
--------------------------------------
I am a newbie (student), learning about HBase and want to get started
contributing to the project. I have been scanning through the HBase "noob" tag
and found this issue interesting to work on. As this issue was last updated on
24/Jan/13 21:51, is the community still interested in this Task?
I understand the overall concepts of log-structured merge tree and reducing the
maximum number of disk seeks needed by compaction. I also understand how HBase
has pluggable a compaction component where we can exploit performance benefits
by knowing our data and request patterns in depth.
What are the relevant packages / source files / API References I should look
into for this task? Any general pointers from the community for working on this
task will be of great help. :)
> Compactions simulator tool for proofing algorithms
> --------------------------------------------------
>
> Key: HBASE-5626
> URL: https://issues.apache.org/jira/browse/HBASE-5626
> Project: HBase
> Issue Type: Task
> Reporter: stack
> Priority: Minor
> Labels: noob
> Attachments: cf_compact.py
>
>
> A tool to run compaction simulations would be a nice to have. We could use
> it to see how well an algo ran under different circumstances loaded w/
> different value types with different rates of flushes and splits, etc.
> HBASE-2462 had one (see in patch). Or we could try doing it using something
> like this: http://en.wikipedia.org/wiki/Discrete_event_simulation
--
This message was sent by Atlassian JIRA
(v6.2#6252)