[jira] [Updated] (SPARK-1468) The hash method used by partitionBy in Pyspark doesn't deal with None correctly.

2014-04-10 Thread Erik Selin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erik Selin updated SPARK-1468: -- Component/s: PySpark Affects Version/s: 0.9.0 Summary: The hash method used by

[jira] [Updated] (SPARK-12089) java.lang.NegativeArraySizeException when growing BufferHolder

2015-12-02 Thread Erik Selin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erik Selin updated SPARK-12089: --- Description: When running a large spark sql query including multiple joins I see tasks failing with

[jira] [Comment Edited] (SPARK-12089) java.lang.NegativeArraySizeException when growing BufferHolder

2015-12-02 Thread Erik Selin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036015#comment-15036015 ] Erik Selin edited comment on SPARK-12089 at 12/2/15 4:15 PM: - I can make that

[jira] [Comment Edited] (SPARK-12089) java.lang.NegativeArraySizeException when growing BufferHolder

2015-12-02 Thread Erik Selin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036015#comment-15036015 ] Erik Selin edited comment on SPARK-12089 at 12/2/15 4:07 PM: - I can make that

[jira] [Commented] (SPARK-12089) java.lang.NegativeArraySizeException when growing BufferHolder

2015-12-02 Thread Erik Selin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036015#comment-15036015 ] Erik Selin commented on SPARK-12089: I can make that change if it is that easy. I'm just wondering if

[jira] [Updated] (SPARK-12089) java.lang.NegativeArraySizeException when growing BufferHolder

2015-12-02 Thread Erik Selin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erik Selin updated SPARK-12089: --- Description: When running a large spark sql query including multiple joins I see tasks failing with

[jira] [Commented] (SPARK-12089) java.lang.NegativeArraySizeException when growing BufferHolder

2015-12-02 Thread Erik Selin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036247#comment-15036247 ] Erik Selin commented on SPARK-12089: There shouldn't be a single record larger than 1G no. But I'm

[jira] [Commented] (SPARK-12089) java.lang.NegativeArraySizeException when growing BufferHolder

2015-12-02 Thread Erik Selin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15036327#comment-15036327 ] Erik Selin commented on SPARK-12089: It's a bunch of table joins followed by a group by on multiple

[jira] [Created] (SPARK-12089) java.lang.NegativeArraySizeException when growing BufferHolder

2015-12-01 Thread Erik Selin (JIRA)
Erik Selin created SPARK-12089: -- Summary: java.lang.NegativeArraySizeException when growing BufferHolder Key: SPARK-12089 URL: https://issues.apache.org/jira/browse/SPARK-12089 Project: Spark

[jira] [Created] (SPARK-12268) pyspark shell uses execfile which breaks python3 compatibility

2015-12-10 Thread Erik Selin (JIRA)
Erik Selin created SPARK-12268: -- Summary: pyspark shell uses execfile which breaks python3 compatibility Key: SPARK-12268 URL: https://issues.apache.org/jira/browse/SPARK-12268 Project: Spark

[jira] [Commented] (SPARK-12268) pyspark shell uses execfile which breaks python3 compatibility

2016-01-04 Thread Erik Selin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15081317#comment-15081317 ] Erik Selin commented on SPARK-12268: Any reason to keep this from getting merged? The PR is good to

[jira] [Commented] (SPARK-23855) Performing a Join after a CrossJoin can lead to data corruption

2018-04-13 Thread Erik Selin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437327#comment-16437327 ] Erik Selin commented on SPARK-23855: +1, from our investigations it looks like we've also hit this