[GitHub] spark issue #20919: Feature/apply func to rdd

2018-03-28 Thread gianmarcodonetti
Github user gianmarcodonetti commented on the issue: https://github.com/apache/spark/pull/20919 For who says RDD is deprecated: cit: https://databricks.com/blog/2016/07/14/a-tale-of-three-apache-spark-apis-rdds-dataframes-and-datasets.html What happens to RDDs

[GitHub] spark pull request #20919: Feature/apply func to rdd

2018-03-28 Thread gianmarcodonetti
GitHub user gianmarcodonetti opened a pull request: https://github.com/apache/spark/pull/20919 Feature/apply func to rdd ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) ## How was this patch tested? (Please

[GitHub] spark pull request #20258: [SPARK-23060][Python] New feature - apply method ...

2018-02-26 Thread gianmarcodonetti
Github user gianmarcodonetti closed the pull request at: https://github.com/apache/spark/pull/20258 --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h

[GitHub] spark issue #20258: [SPARK-23060][Python] New feature - apply method to exte...

2018-01-13 Thread gianmarcodonetti
Github user gianmarcodonetti commented on the issue: https://github.com/apache/spark/pull/20258 @HyukjinKwon in my opinion, it helps a lot. My goal is to avoid this case: `final_rdd = func_3(func_2(func_1(initial_rdd)))` And admit this: `final_rdd

[GitHub] spark pull request #20258: [SPARK-23060][Python] New feature - apply method ...

2018-01-13 Thread gianmarcodonetti
GitHub user gianmarcodonetti opened a pull request: https://github.com/apache/spark/pull/20258 [SPARK-23060][Python] New feature - apply method to extend rdd's functionality ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix