spark git commit: [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in Python

2016-09-14 Thread davies
Repository: spark Updated Branches: refs/heads/branch-2.0 5493107d9 -> 6fe5972e6 [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in Python ## What changes were proposed in this pull request? In PySpark, `df.take(1)` runs a single-stage job which computes only one

spark git commit: [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in Python

2016-09-14 Thread davies
Repository: spark Updated Branches: refs/heads/master 52738d4e0 -> 6d06ff6f7 [SPARK-17514] df.take(1) and df.limit(1).collect() should perform the same in Python ## What changes were proposed in this pull request? In PySpark, `df.take(1)` runs a single-stage job which computes only one part