Hello,

Can we assume that values sent to the reduce functions, are sorted in the order of their associated keys? From my experiments it appears to be true, but is this likely to change in the future?

Incase I'm not asking the right question, here's what I'm trying to do.

my map function produces output like:

[X, Y, 0]  -> Object_A
[X, Y, 1]  -> Object_B1
[X, Y, 1]  -> Object_B1
[X, Y, 1]  -> Object_B1
[Z, Q, 0] ....

Here I apply group_level=2, and use a ranged query ( [X, 0] to [X, [] ] ) since Y >= 0


Now during the reduce phase, I combine together Object_A's and associated Object_B's. Can I assume that the first of the values sent to 'reduce' is Object_A? Furthermore, if a rereduce is required, will the first 'rereduced' value be the 'left most' object produced during the previous reduction. In other words, does the reduce function fold left over it's input data?


Thanks for reading this,

- Tom

Reply via email to