Github user gianmarcodonetti commented on the issue:
https://github.com/apache/spark/pull/20919
For who says RDD is deprecated:
cit:
https://databricks.com/blog/2016/07/14/a-tale-of-three-apache-spark-apis-rdds-dataframes-and-datasets.html
What happens to RDDs
GitHub user gianmarcodonetti opened a pull request:
https://github.com/apache/spark/pull/20919
Feature/apply func to rdd
## What changes were proposed in this pull request?
(Please fill in changes proposed in this fix)
## How was this patch tested?
(Please
Github user gianmarcodonetti closed the pull request at:
https://github.com/apache/spark/pull/20258
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h
Github user gianmarcodonetti commented on the issue:
https://github.com/apache/spark/pull/20258
@HyukjinKwon in my opinion, it helps a lot.
My goal is to avoid this case:
`final_rdd = func_3(func_2(func_1(initial_rdd)))`
And admit this:
`final_rdd
GitHub user gianmarcodonetti opened a pull request:
https://github.com/apache/spark/pull/20258
[SPARK-23060][Python] New feature - apply method to extend rdd's
functionality
## What changes were proposed in this pull request?
(Please fill in changes proposed in this fix