Re: forcing dataframe groupby partitioning

2017-01-29 Thread Haviv, Daniel
If there's no built in local groupBy, You could do something like that: df.groupby(C1,C2).agg(...).flatmap(x=>x.groupBy(C1)).agg Thank you. Daniel On 29 Jan 2017, at 18:33, Mendelson, Assaf > wrote: Hi, Consider the following example:

Re: is this something to worry about? HADOOP_HOME or hadoop.home.dir are not set

2017-01-21 Thread Haviv, Daniel
No Thank you. Daniel On 20 Jan 2017, at 23:28, kant kodali > wrote: Hi, I am running spark standalone with no storage. when I use spark-submit to submit my job I get the following Exception and I wonder if this is something to worry about?