Yes, I did try it and you are right it behaves the same so far. I am not sure
how its gonna behave for large data sets though. I don't see anything in the
documentation confirming this behavior.
--
Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/
I think that’s how it should behave. Did you try it out and see ?
On Tue, Oct 16, 2018 at 5:11 AM mhussain wrote:
> Hi,
>
> I have a dataframe which looks like
>
> ++---+--++
> |group_id| id| text|type|
> ++---+--++
> | 1| 1| one| a|
> | 1| 1|