[GitHub] spark issue #21043: [SPARK-23963] [SQL] Properly handle large number of colu...

Tagar Wed, 18 Apr 2018 09:28:09 -0700

Github user Tagar commented on the issue:

    https://github.com/apache/spark/pull/21043
  
    @gatorsmile could you please backport this to a Spark 2.2 branch as well?
    This PR gives 24x improvement on 6000 columns as @bersprockets discovered, 
so I think this 1-line change should be fairly safely applied to Spark 2.2 as 
well. We see the same performance degradation on wide dataframes in Spark 2.2 
as well. Thanks both!




---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #21043: [SPARK-23963] [SQL] Properly handle large number of colu...

Reply via email to