[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-12-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15035164#comment-15035164 ] Xiao Li commented on SPARK-12030: - I did verify the fix using my test cases. It works!

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-12-01 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15034936#comment-15034936 ] Yin Huai commented on SPARK-12030: -- I also merged the patch to branch 1.5. Please note t

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-12-01 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15034652#comment-15034652 ] Yin Huai commented on SPARK-12030: -- Yes, it will be in 1.6.0. > Incorrect results when

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-12-01 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15034641#comment-15034641 ] Maciej Bryński commented on SPARK-12030: Will the fix be included in 1.6.0 ? > I

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-12-01 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15034615#comment-15034615 ] Davies Liu commented on SPARK-12030: I also figured out the root cause last night, th

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-12-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15034141#comment-15034141 ] Apache Spark commented on SPARK-12030: -- User 'nongli' has created a pull request for

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-12-01 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15033356#comment-15033356 ] Xiao Li commented on SPARK-12030: - [~nongli] Thank you very much! Your finding sounds r

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-12-01 Thread Nong Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15033321#comment-15033321 ] Nong Li commented on SPARK-12030: - I think I tracked it down. The bug is from this PR whi

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15032939#comment-15032939 ] Xiao Li commented on SPARK-12030: - Let me post a simple case that can trigger the data co

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-30 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15032927#comment-15032927 ] Yin Huai commented on SPARK-12030: -- [~smilegator] Can you post the case that triggers th

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15032902#comment-15032902 ] Xiao Li commented on SPARK-12030: - I already excluded Exchange and Partitioning. It shoul

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15032887#comment-15032887 ] Xiao Li commented on SPARK-12030: - [SPARK-7542][SQL] Support off-heap index/sort buffer h

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-30 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15032857#comment-15032857 ] Davies Liu commented on SPARK-12030: [~smilegator] Could you post the related PRs her

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15032724#comment-15032724 ] Xiao Li commented on SPARK-12030: - I believe I already found which PRs introduced the reg

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-29 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15031206#comment-15031206 ] Xiao Li commented on SPARK-12030: - I can reproduced a similar issue in a Sort. I think th

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-29 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15031060#comment-15031060 ] Xiao Li commented on SPARK-12030: - [~maver1ck] Yeah, the problem was introduced in 1.6.0.

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-29 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030887#comment-15030887 ] Maciej Bryński commented on SPARK-12030: [~smilegator] I tested 1.5.2 (binaries f

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-29 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030883#comment-15030883 ] Maciej Bryński commented on SPARK-12030: [~smilegator] Problem is not only with d

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-28 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030694#comment-15030694 ] Xiao Li commented on SPARK-12030: - I can reproduce it now. Will take a look at it and try

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-28 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030652#comment-15030652 ] Xiao Li commented on SPARK-12030: - Thank you! [~maver1ck] That will be great if we can k

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030643#comment-15030643 ] Maciej Bryński commented on SPARK-12030: I tried following things: - disable kryo

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-28 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030638#comment-15030638 ] Xiao Li commented on SPARK-12030: - Trying to reproduce it using your parquet files. Thank

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030633#comment-15030633 ] Maciej Bryński commented on SPARK-12030: And spark-defaults.conf: {code} spark.ma

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030632#comment-15030632 ] Maciej Bryński commented on SPARK-12030: When I cache joined the result of distin

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-28 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030618#comment-15030618 ] Xiao Li commented on SPARK-12030: - If you cache `joined`, can you see the same issue? >

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-28 Thread Xiu(Joe) Guo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030615#comment-15030615 ] Xiu(Joe) Guo commented on SPARK-12030: -- I tried your scenario with some TPCDS table

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-28 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030612#comment-15030612 ] Maciej Bryński commented on SPARK-12030: id1, id2 and fk1 are integers. > Incorr

[jira] [Commented] (SPARK-12030) Incorrect results when aggregate joined data

2015-11-28 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15030430#comment-15030430 ] Xiao Li commented on SPARK-12030: - What is the data type of id1? > Incorrect results whe