Hi all,

once again I can't wrap my head around how to approach a problem in Pig. I'm
trying to count a number of elements in a timespan if they are the first
that match a criterion. So let's say I have tuples with a date, a user and a
purchase, and now I want to count the users that made their first purchase
in the year 2010 (my timespan), but it's this "first purchase" which
troubles me, because usually I'd filter by date, then aggregate purchases by
users and then count the resulting rows. As it is right now I'd have to to
the above and then repeat the steps for the timespan before and then
subtract this set to the resulting set.
Isn't there an easier way?

Regards,
Chris

Reply via email to