Re: Theoretical complexity of a coGroup

2015-07-22 Thread Kostas Tzoumas
Hi Andra, CoGroup is not implemented as a Cartesian product, so O(V*E) is not a very accurate approximation. All this depends on what you count. Let's assume single-node execution and that everything fits in memory, and let's count comparisons and UDFs on groups. Then, coGroup sorts both inputs,

Theoretical complexity of a coGroup

2015-07-22 Thread Andra Lungu
Hi everyone, I am not 100% sure about this one, so I thought that I could set my thoughts straight via the mailing list. Here's the use case. You coGroup a data set of vertices with a data set of edges. That gives you a complexity of* O(|V| * |E|)*, where |V| is the total number of vertices and |