Implement a Cassandra aware Hadoop mapreduce.Partitioner
--------------------------------------------------------

                 Key: CASSANDRA-1473
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1473
             Project: Cassandra
          Issue Type: Improvement
          Components: Hadoop
            Reporter: Stu Hood


When using a IPartitioner that does not sort data in byte order 
(RandomPartitioner for example) with Cassandra's Hadoop integration, Hadoop is 
unaware of the output order of the data.

We can make Hadoop aware of the proper order of the output data by implementing 
Hadoop's mapreduce.Partitioner interface: then Hadoop will handle sorting all 
of the data according to Cassandra's IPartitioner, and the writing clients will 
be able to connect to smaller numbers of Cassandra nodes.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to