Stefania created CASSANDRA-11542:
------------------------------------
Summary: Create a benchmark to compare HDFS and Cassandra bulk
read times
Key: CASSANDRA-11542
URL: https://issues.apache.org/jira/browse/CASSANDRA-11542
Project: Cassandra
Issue Type: Sub-task
Components: Testing
Reporter: Stefania
Assignee: Stefania
Fix For: 3.x
I propose creating a benchmark for comparing Cassandra and HDFS bulk reading
performance. Data will be imported into Spark to perform very simple queries
and the entire duration will be measured. An example query would be the max or
min of a column or a count(*).
This benchmark should allow determining the impact of:
* partition size
* number of clustering columns
* number of value columns (cells)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)