[jira] [Commented] (SPARK-35622) DataFrame's count function do not need groupBy and avoid shuffle

2021-06-15 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17364074#comment-17364074 ] Apache Spark commented on SPARK-35622: -- User 'StefanXiepj' has created a pull request for this

[jira] [Commented] (SPARK-35622) DataFrame's count function do not need groupBy and avoid shuffle

2021-06-15 Thread xiepengjie (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17363605#comment-17363605 ] xiepengjie commented on SPARK-35622: [~hyukjin.kwon] [~dc-heros], Sorry, I was on vacation last week

[jira] [Commented] (SPARK-35622) DataFrame's count function do not need groupBy and avoid shuffle

2021-06-14 Thread dgd_contributor (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17363336#comment-17363336 ] dgd_contributor commented on SPARK-35622: - Run a benchmark on my computer, df.rdd.count()

[jira] [Commented] (SPARK-35622) DataFrame's count function do not need groupBy and avoid shuffle

2021-06-13 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17362683#comment-17362683 ] Hyukjin Kwon commented on SPARK-35622: -- IIRC, it already works same or similarly with RDD's count.

[jira] [Commented] (SPARK-35622) DataFrame's count function do not need groupBy and avoid shuffle

2021-06-13 Thread dgd_contributor (Jira)
[ https://issues.apache.org/jira/browse/SPARK-35622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17362681#comment-17362681 ] dgd_contributor commented on SPARK-35622: - hi, could you explain more of this issue? where