subject:"spark git commit\: \[SPARK\-17514\] df.take\(1\) and df.limit\(1\).collect\(\) should perform the same in Python"

spark git commit: [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in Python

2016-09-14 Thread davies

Repository: spark Updated Branches: refs/heads/branch-2.0 5493107d9 -> 6fe5972e6 [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in Python ## What changes were proposed in this pull request? In PySpark, `df.take(1)` runs a single-stage job which computes only one

spark git commit: [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in Python

2016-09-14 Thread davies

Repository: spark Updated Branches: refs/heads/master 52738d4e0 -> 6d06ff6f7 [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in Python ## What changes were proposed in this pull request? In PySpark, `df.take(1)` runs a single-stage job which computes only one part