Chris Kanich created SPARK-15061:
------------------------------------

             Summary: Upgrade Py4J to 0.10.1
                 Key: SPARK-15061
                 URL: https://issues.apache.org/jira/browse/SPARK-15061
             Project: Spark
          Issue Type: Improvement
          Components: PySpark
            Reporter: Chris Kanich


Py4J 0.10.1 hasn't landed yet, but it will likely cause a significant 
performance improvement for PySpark and MLLib in particular. More details are 
available at https://github.com/bartdag/py4j/issues/201

The syscall overhead was likely the reason that 
https://issues.apache.org/jira/browse/SPARK-6728 was reported as well - 
dropping the base64 encoding will help too, but I imagine this fix will help 
more.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to