[ 
https://issues.apache.org/jira/browse/CASSANDRA-234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sandeep Tata updated CASSANDRA-234:
-----------------------------------

    Attachment: commons-math-1.1.jar
                234.patch

Here's a tool that loosely mimics the workload of one of the apps we have 
running.

1. The code is in Java should anyone choose to use a "heavy client" that knows 
where to send a given request ... this typically cuts latency in half.
2. The workload consists of 2 phases -- 
The first is a load phase that does a whole bunch of batch inserts.
The second phase is a mix of inserts (new data and replacement of old data), 
get_columns and get_slices. There is also the option to do deletes that is 
turned off by default.
3. To keep things simple, only one column family is used, and no supercolumns 
are written/read.
4. In the end, the total write bandwidth and a bunch of latency stats are 
reported. I pulled in commons-math to get percentiles (median, 99th percentile 
latency numbers).
5. All the configuration parameters are specified as in conf/workload.xml

I've found a simpler version of this this tool rather useful in the past in 
locating bugs and now for testing performance. 

> Performance regression tests
> ----------------------------
>
>                 Key: CASSANDRA-234
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-234
>             Project: Cassandra
>          Issue Type: New Feature
>          Components: Tools
>         Environment: all
>            Reporter: Sandeep Tata
>            Assignee: Sandeep Tata
>         Attachments: 234.patch, commons-math-1.1.jar
>
>
> Since we have a few performance/efficiency tickets open 
> (https://issues.apache.org/jira/browse/CASSANDRA-233, 
> https://issues.apache.org/jira/browse/CASSANDRA-174), we should probably 
> start using some basic benchmark for regression. 
> VPork's an option.
> Johan has a description of how to get going with it: 
> http://blog.oskarsson.nu/2009/05/vpork.html
> Jon Travis has a description at http://wiki.github.com/trav/vpork/vpork
> A performance graph that keeps getting better as we move forward is always 
> fun :-)
> Can apache provide resources for a perf regression service? How do other 
> projects set this up? Suggestions?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to