[ 
https://issues.apache.org/jira/browse/CASSANDRA-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14273931#comment-14273931
 ] 

Ariel Weisberg commented on CASSANDRA-7933:
-------------------------------------------

Brought over from CASSANDRA-8597. Stress needs additional documentation to 
introduce concepts and verbiage as well as common workflows. Most first time 
users will not be familiar with C* terminology .

There are two dimensions data distribution and access distribution and knowing 
all the knobs for describing data distribution (# partition, # cells/rows, 
#size of cells) and access distribution (across partitions, across cells within 
partitions, random vs sequential strides, # of rows/cells to select) would be 
helpful. I think some times these parameters are linked as well. Understanding 
data and access distribution is key to realistically simulating workloads.

A recipe of how you do a bulk load and then transition to some other mixed 
read/write pattern with matching schema. A recipe for single column rows as 
well as multi-column or very wide rows.

It's also worth mentioning in the distribution section that you can get the 
tool to print a summary of distributions so you know what the parameters are 
doing.

> Update cassandra-stress README
> ------------------------------
>
>                 Key: CASSANDRA-7933
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-7933
>             Project: Cassandra
>          Issue Type: Task
>            Reporter: Benedict
>            Assignee: Philip Thompson
>            Priority: Minor
>             Fix For: 2.1.3
>
>         Attachments: CASSANDRA-7933.txt
>
>
> There is a README in the tools/stress directory. It is completely out of date.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to