Re: forcing dataframe groupby partitioning
If there's no built in local groupBy, You could do something like that: df.groupby(C1,C2).agg(...).flatmap(x=>x.groupBy(C1)).agg Thank you. Daniel On 29 Jan 2017, at 18:33, Mendelson, Assaf> wrote: Hi, Consider the following example:
Re: is this something to worry about? HADOOP_HOME or hadoop.home.dir are not set
No Thank you. Daniel On 20 Jan 2017, at 23:28, kant kodali> wrote: Hi, I am running spark standalone with no storage. when I use spark-submit to submit my job I get the following Exception and I wonder if this is something to worry about?