[ 
https://issues.apache.org/jira/browse/HBASE-5626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14067807#comment-14067807
 ] 

Nitish Upreti commented on HBASE-5626:
--------------------------------------

I am a newbie (student), learning about HBase and want to get started 
contributing to the project. I have been scanning through the HBase "noob" tag 
and found this issue interesting to work on. As this issue was last updated on 
24/Jan/13 21:51, is the community still interested in this Task? 

I understand the overall concepts of log-structured merge tree and reducing the 
maximum number of disk seeks needed by compaction. I also understand how HBase 
has pluggable a compaction component  where we can exploit performance benefits 
by knowing our data and request patterns in depth.

What are the relevant packages / source files / API References I should look 
into for this task? Any general pointers from the community for working on this 
task will be of great help. :)

> Compactions simulator tool for proofing algorithms
> --------------------------------------------------
>
>                 Key: HBASE-5626
>                 URL: https://issues.apache.org/jira/browse/HBASE-5626
>             Project: HBase
>          Issue Type: Task
>            Reporter: stack
>            Priority: Minor
>              Labels: noob
>         Attachments: cf_compact.py
>
>
> A tool to run compaction simulations would be a nice to have.   We could use 
> it to see how well an algo ran under different circumstances loaded w/ 
> different value types with different rates of flushes and splits, etc. 
> HBASE-2462 had one (see in patch).  Or we could try doing it using something 
> like this: http://en.wikipedia.org/wiki/Discrete_event_simulation



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to