Implement a Cassandra aware Hadoop mapreduce.Partitioner
--------------------------------------------------------
Key: CASSANDRA-1473
URL: https://issues.apache.org/jira/browse/CASSANDRA-1473
Project: Cassandra
Issue Type: Improvement
Components: Hadoop
Reporter: Stu Hood
When using a IPartitioner that does not sort data in byte order
(RandomPartitioner for example) with Cassandra's Hadoop integration, Hadoop is
unaware of the output order of the data.
We can make Hadoop aware of the proper order of the output data by implementing
Hadoop's mapreduce.Partitioner interface: then Hadoop will handle sorting all
of the data according to Cassandra's IPartitioner, and the writing clients will
be able to connect to smaller numbers of Cassandra nodes.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.