Hi folks,
I use replicated joins, and recently I encountered an issue : my
rightmost relation seems to become too big and, even if I don't get
any "Java heap space" the time it take to finish the maps become
exponentially long (I cannot figure why exactly).
Removing "replicated" fix the issue, but several questions raise.
In Alan's book " *Figure 8.1. Choosing a Join Implementation " it is
said that replicated joins should NOT BE USED for outer joins.
*Nevertheless, it seems to work in the following case, and is faster
than regular joins. So why ?
sessions = JOIN sessions BY locid LEFT, locations BY locid USING
'replicated';
(not all sessions have a location in this case)
Thanks for your advices.