[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16712414#comment-16712414
]
Dongjoon Hyun commented on SPARK-20184:
---
If this exists up to 2.4.0, could you update the `Affects
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16611717#comment-16611717
]
Kazuaki Ishizaki commented on SPARK-20184:
--
In {{branch-2.4}}, we still see the performance
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16611502#comment-16611502
]
Kazuaki Ishizaki commented on SPARK-20184:
--
Although I created another JIRA
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16610655#comment-16610655
]
Wenchen Fan commented on SPARK-20184:
-
[~kiszk] did you create another JIRA to replace this one?
>
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15985319#comment-15985319
]
Kazuaki Ishizaki commented on SPARK-20184:
--
When # of the aggregated columns gets large, I saw
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15976897#comment-15976897
]
Kazuaki Ishizaki commented on SPARK-20184:
--
The root cause is overhead in Java code generated by
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15976251#comment-15976251
]
Takeshi Yamamuro commented on SPARK-20184:
--
When #aggregated columns gets large, it seems we get
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15974682#comment-15974682
]
Kazuaki Ishizaki commented on SPARK-20184:
--
I succeeded to reproduce this...
{code}
% git log |
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15966300#comment-15966300
]
Tejas Patil commented on SPARK-20184:
-
Out of curiosity, I tried out a query with ~20 columns
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15965735#comment-15965735
]
Fei Wang commented on SPARK-20184:
--
Also use the master branch to test my test case:
1. Java version
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15965670#comment-15965670
]
Herman van Hovell commented on SPARK-20184:
---
I just tried your example using the master branch,
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15965597#comment-15965597
]
Fei Wang commented on SPARK-20184:
--
try this :
1. create table
[code]
val df = (1 to 50).map(x =>
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15965308#comment-15965308
]
Fei Wang commented on SPARK-20184:
--
Tested with a smaller table 100,000 rows.
Codegen on: 2.6s
Codegen
[
https://issues.apache.org/jira/browse/SPARK-20184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15956420#comment-15956420
]
Kazuaki Ishizaki commented on SPARK-20184:
--
If the number of rows are smaller, how about the
14 matches
Mail list logo