Github user JoshRosen commented on a diff in the pull request:
https://github.com/apache/spark/pull/4205#discussion_r23556520
--- Diff: python/pyspark/__init__.py ---
@@ -46,10 +46,12 @@
from pyspark.broadcast import Broadcast
from pyspark.serializers import MarshalSerializer, PickleSerializer
-# for back compatibility
-from pyspark.sql import SQLContext, HiveContext, SchemaRDD, Row
+from pyspark.graphx.vertex import VertexRDD
+from pyspark.graphx.edge import EdgeRDD, Edge
+from pyspark.graphx.graph import Graph
__all__ = [
"SparkConf", "SparkContext", "SparkFiles", "RDD", "StorageLevel",
"Broadcast",
"Accumulator", "AccumulatorParam", "MarshalSerializer",
"PickleSerializer",
-]
+ "VertexRDD", "EdgeRDD", "Edge", "Graph"]
--- End diff --
Similarly, I'm not sure that we want to add the GraphX methods to `__all__`
here since it doesn't include the SQL ones.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]