Using a distinct before the group by does not guarantee the date order. I need to keep the earliest occurrence of 'a' and discard all later occurrences of 'a'.
On Fri, Mar 8, 2013 at 2:10 PM, Norbert Burger <[email protected]>wrote: > Looking at your sample, it seems you have a GROUPBY generating these > bags...? Could you just insert a DISTINCT before this GROUP BY? > > Norbert > > On Fri, Mar 8, 2013 at 5:00 PM, Chan, Tim <[email protected]> wrote: > > > If I have a bag and would like to remove dupes, while saving the first > > occurrence, is this possible? > > > > For example, for the following bag: > > > > (group_1,{(2012-12-15,a),(2012-12-17,a),(2012-12-23,c)}) > > > > I would like my result to be the following: > > > > (group_1,{(2012-12-15,a),(2012-12-23,c)}) > > >
