[jira] [Created] (SPARK-20632) Allow 'Column.getItem()' API to accept Vector columns

2017-05-07 Thread Kevin Ushey (JIRA)
Kevin Ushey created SPARK-20632: --- Summary: Allow 'Column.getItem()' API to accept Vector columns Key: SPARK-20632 URL: https://issues.apache.org/jira/browse/SPARK-20632 Project: Spark Issue

[jira] [Comment Edited] (SPARK-14393) monotonicallyIncreasingId not monotonically increasing with downstream coalesce

2016-10-10 Thread Kevin Ushey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15562887#comment-15562887 ] Kevin Ushey edited comment on SPARK-14393 at 10/10/16 5:27 PM: --- Am I

[jira] [Commented] (SPARK-14393) monotonicallyIncreasingId not monotonically increasing with downstream coalesce

2016-10-10 Thread Kevin Ushey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15562888#comment-15562888 ] Kevin Ushey commented on SPARK-14393: - Am I correct to understand that I should simply expect

[jira] [Issue Comment Deleted] (SPARK-14393) monotonicallyIncreasingId not monotonically increasing with downstream coalesce

2016-10-10 Thread Kevin Ushey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Ushey updated SPARK-14393: Comment: was deleted (was: Am I correct to understand that I should simply expect

[jira] [Commented] (SPARK-14393) monotonicallyIncreasingId not monotonically increasing with downstream coalesce

2016-10-10 Thread Kevin Ushey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15562887#comment-15562887 ] Kevin Ushey commented on SPARK-14393: - Am I correct to understand that I should simply expect

[jira] [Created] (SPARK-17833) 'monotonicallyIncreasingId()' should probably be deterministic

2016-10-07 Thread Kevin Ushey (JIRA)
Kevin Ushey created SPARK-17833: --- Summary: 'monotonicallyIncreasingId()' should probably be deterministic Key: SPARK-17833 URL: https://issues.apache.org/jira/browse/SPARK-17833 Project: Spark

[jira] [Commented] (SPARK-17752) Spark returns incorrect result when 'collect()'ing a cached Dataset with many columns

2016-10-05 Thread Kevin Ushey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15549759#comment-15549759 ] Kevin Ushey commented on SPARK-17752: - It looks like this was tracked + fixed in

[jira] [Commented] (SPARK-17752) Spark returns incorrect result when 'collect()'ing a cached Dataset with many columns

2016-10-03 Thread Kevin Ushey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15543045#comment-15543045 ] Kevin Ushey commented on SPARK-17752: - I can confirm that everything is okay with Spark

[jira] [Updated] (SPARK-17752) Spark returns incorrect result when 'collect()'ing a cached Dataset with many columns

2016-10-01 Thread Kevin Ushey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Ushey updated SPARK-17752: Description: Run the following code (modify SPARK_HOME to point to a Spark 2.0.0 installation as

[jira] [Updated] (SPARK-17752) Spark returns incorrect result when 'collect()'ing a cached Dataset with many columns

2016-09-30 Thread Kevin Ushey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Ushey updated SPARK-17752: Description: Run the following code (modify SPARK_HOME to point to a Spark 2.0.0 installation as

[jira] [Updated] (SPARK-17752) Spark returns incorrect result when 'collect()'ing a cached Dataset with many columns

2016-09-30 Thread Kevin Ushey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Ushey updated SPARK-17752: Description: Run the following code (modify SPARK_HOME to point to a Spark 2.0.0 installation as

[jira] [Updated] (SPARK-17752) Spark returns incorrect result when 'collect()'ing a cached Dataset with many columns

2016-09-30 Thread Kevin Ushey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Ushey updated SPARK-17752: Description: Run the following code (modify SPARK_HOME to point to a Spark 2.0.0 installation as

[jira] [Created] (SPARK-17752) Spark returns incorrect result when 'collect()'ing a cached Dataset with many columns

2016-09-30 Thread Kevin Ushey (JIRA)
Kevin Ushey created SPARK-17752: --- Summary: Spark returns incorrect result when 'collect()'ing a cached Dataset with many columns Key: SPARK-17752 URL: https://issues.apache.org/jira/browse/SPARK-17752

[jira] [Commented] (SPARK-12965) Indexer setInputCol() doesn't resolve column names like DataFrame.col()

2016-07-13 Thread Kevin Ushey (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15375995#comment-15375995 ] Kevin Ushey commented on SPARK-12965: - I'm also seeing this issue. > Indexer setInputCol() doesn't