> Date: Mon, 24 Nov 2008 15:55:36 -0500
> To:
> Cc: Brandon Dimcheff <[EMAIL PROTECTED]>
> Subject: RE-using intermediate data
>
> Hi,
>
> I have a script roughly analogous to this:
>
> users = LOAD '/users.tsv' AS (id);
>
> sessions = LOA
Hi,
I have a script roughly analogous to this:
users = LOAD '/users.tsv' AS (id);
sessions = LOAD '/sessions.tsv' AS (id, userid, duration, day);
user_sessions = JOIN users BY id INNER, sessions BY userid INNER;
intermediate_aggregate = FOREACH (GROUP user_sessions BY (userid, day))
{