AW: [Spark R]: dapply only works for very small datasets

2017-11-29 Thread Kunft, Andreas
?Thanks alot. I will have a lock at the issues Von: Felix Cheung <felixcheun...@hotmail.com> Gesendet: Mittwoch, 29. November 2017 04:47 An: Kunft, Andreas; user@spark.apache.org Betreff: Re: [Spark R]: dapply only works for very small datasets You can fin

Re: [Spark R]: dapply only works for very small datasets

2017-11-28 Thread Felix Cheung
; Sent: Tuesday, November 28, 2017 3:11 AM Subject: AW: [Spark R]: dapply only works for very small datasets To: Felix Cheung <felixcheun...@hotmail.com>, <user@spark.apache.org> Thanks for the fast reply. I tried it locally, with 1 - 8 slots on a 8 core machine w/ 25GB memory as w

AW: [Spark R]: dapply only works for very small datasets

2017-11-28 Thread Kunft, Andreas
ay, November 27, 2017 10:27:33 AM To: user@spark.apache.org Subject: [Spark R]: dapply only works for very small datasets Hello, I tried to execute some user defined functions with R using the airline arrival performance dataset. While the examples from the documentation for the `<-` appl

Re: [Spark R]: dapply only works for very small datasets

2017-11-27 Thread Felix Cheung
ber 27, 2017 10:27:33 AM To: user@spark.apache.org Subject: [Spark R]: dapply only works for very small datasets Hello, I tried to execute some user defined functions with R using the airline arrival performance dataset. While the examples from the documentation for the `<-` apply oper

[Spark R]: dapply only works for very small datasets

2017-11-27 Thread Kunft, Andreas
Hello, I tried to execute some user defined functions with R using the airline arrival performance dataset. While the examples from the documentation for the `<-` apply operator work perfectly fine on a size ~9GB, the `dapply` operator fails to finish even after ~4 hours. I'm using a