Github user HyukjinKwon commented on a diff in the pull request:
https://github.com/apache/spark/pull/22011#discussion_r208092650
--- Diff: python/pyspark/rdd.py ---
@@ -2429,6 +2441,29 @@ def _wrap_function(sc, func, deserializer,
serializer, profiler=None):
sc.pythonVer, broadcast_vars,
sc._javaAccumulator)
+class RDDBarrier(object):
+
+ """
+ .. note:: Experimental
+
+ An RDDBarrier turns an RDD into a barrier RDD, which forces Spark to
launch tasks of the stage
+ contains this RDD together.
--- End diff --
ditto let's add `.. versionadded:: 2.4.0` at the end.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]