Long story short, regarding the performance issue, it appeared with
recompiled version of the source TGZ downloaded from spark website.
Problem disappears with 1.6.2-SNAPSHOT (branch-1.6)
Guillaume
Do you have code which can reproduce this performance drop in
treeReduce? It would be helpful to debug. In the 1.6 release, we
profiled it via the various MLlib algorithms and did not see
performance drops.
That would be difficult, but if we cannot find out, we'll design a
small example to test that. I first have to check with latest git
version. I have to recompile spark with lgpl version of netlib.