Hi Christoph,
The only unit test which exhibits this problem is the one which runs the
full MR job (testCanopyEuclideanMRJob()). This is darn hard to debug and
is doubly baffling since all the vectors should be read from Writable
format into new, distinct instances. If you have a small dataset which
exhibits the problem while running the reference implementation it would
be very nice if you could share it.
Jeff
Jeff Eastman wrote:
I added some test code to detect duplicate boundPoint entries and can
duplicate the issue in a unit test. I will see what is happening and
let you know.
Jeff