Does it work on a smaller file? Mayur Rustagi Ph: +1 (760) 203 3257 http://www.sigmoidanalytics.com @mayur_rustagi <https://twitter.com/mayur_rustagi>
On Sat, Mar 22, 2014 at 4:50 AM, Ryan Compton <compton.r...@gmail.com>wrote: > Does it work without .distinct() ? > > Possibly related issue I ran into: > > https://mail-archives.apache.org/mod_mbox/spark-user/201401.mbox/%3CCAMgYSQ-3YNwD=veb1ct9jro_jetj40rj5ce_8exgsrhm7jb...@mail.gmail.com%3E > > On Sat, Mar 22, 2014 at 12:45 AM, Kane <kane.ist...@gmail.com> wrote: > > It's 0.9.0 > > > > > > > > -- > > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/distinct-on-huge-dataset-tp3025p3027.html > > Sent from the Apache Spark User List mailing list archive at Nabble.com. >