[jira] [Commented] (SPARK-1834) NoSuchMethodError when invoking JavaPairRDD.reduce() in Java

2014-08-04 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14085567#comment-14085567 ] Franklyn Dsouza commented on SPARK-1834: There is no reduce function in

[jira] [Created] (SPARK-15811) UDFs do not work in Spark 2.0-preview built with scala 2.10

2016-06-07 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-15811: --- Summary: UDFs do not work in Spark 2.0-preview built with scala 2.10 Key: SPARK-15811 URL: https://issues.apache.org/jira/browse/SPARK-15811 Project: Spark

[jira] [Updated] (SPARK-15811) UDFs do not work in Spark 2.0-preview built with scala 2.10

2016-06-07 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-15811: Description: I've built spark-2.0-preview (8f5a04b) with scala-2.10 using the following

[jira] [Updated] (SPARK-15811) UDFs do not work in Spark 2.0-preview built with scala 2.10

2016-06-07 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-15811: Description: I've built spark-2.0-preview (8f5a04b) with scala-2.10 using the following

[jira] [Updated] (SPARK-15811) UDFs do not work in Spark 2.0-preview built with scala 2.10

2016-06-07 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-15811: Description: I've built spark-2.0-preview (8f5a04b) with scala-2.10 using the following

[jira] [Updated] (SPARK-15811) UDFs do not work in Spark 2.0-preview built with scala 2.10

2016-06-09 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-15811: Shepherd: Davies Liu > UDFs do not work in Spark 2.0-preview built with scala 2.10 >

[jira] [Created] (SPARK-13410) unionAll throws error with DataFrames containing UDT columns.

2016-02-19 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-13410: --- Summary: unionAll throws error with DataFrames containing UDT columns. Key: SPARK-13410 URL: https://issues.apache.org/jira/browse/SPARK-13410 Project: Spark

[jira] [Updated] (SPARK-13410) unionAll throws error with DataFrames containing UDT columns.

2016-02-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13410: Description: Unioning two DataFrames that contain UDTs fails with {quote}

[jira] [Updated] (SPARK-13410) unionAll AnalysisException with DataFrames containing UDT columns.

2016-02-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13410: Summary: unionAll AnalysisException with DataFrames containing UDT columns. (was:

[jira] [Updated] (SPARK-13410) unionAll throws error with DataFrames containing UDT columns.

2016-02-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13410: Description: Unioning two DataFrames that contain UDTs fails with {quote}

[jira] [Updated] (SPARK-13410) unionAll throws error with DataFrames containing UDT columns.

2016-02-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13410: Description: Unioning two DataFrames that contain UDTs fails with {quote}

[jira] [Updated] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-14117: Description: When writing a Dataframe as parquet using a partitionBy on the writer to

[jira] [Updated] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-14117: Description: When writing a Dataframe as parquet using a partitionBy on the writer to

[jira] [Created] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-14117: --- Summary: write.partitionBy retains partitioning column when outputting Parquet Key: SPARK-14117 URL: https://issues.apache.org/jira/browse/SPARK-14117 Project:

[jira] [Updated] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-14117: Description: When writing a Dataframe as parquet using a partitionBy on the writer to

[jira] [Updated] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-14117: Description: When writing a Dataframe as parquet using a partitionBy on the writer to

[jira] [Updated] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-14117: Description: When writing a Dataframe as parquet using a partitionBy on the writer to

[jira] [Closed] (SPARK-14117) write.partitionBy retains partitioning column when outputting Parquet

2016-03-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza closed SPARK-14117. --- Resolution: Fixed > write.partitionBy retains partitioning column when outputting Parquet >

[jira] [Updated] (SPARK-13730) Nulls in dataframes getting converted to 0 with spark 2.0 SNAPSHOT

2016-03-08 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13730: Description: Basically I'm putting nulls into a non-nullable LongType column and doing a

[jira] [Updated] (SPARK-13730) Nulls in dataframes getting converted to 0 with spark 2.0 SNAPSHOT

2016-03-07 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-13730: Description: Basically I'm putting nulls into a non-nullable LongType column and doing a

[jira] [Created] (SPARK-13730) Nulls in dataframes getting converted to 0 with spark 2.0 SNAPSHOT

2016-03-07 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-13730: --- Summary: Nulls in dataframes getting converted to 0 with spark 2.0 SNAPSHOT Key: SPARK-13730 URL: https://issues.apache.org/jira/browse/SPARK-13730 Project:

[jira] [Created] (SPARK-16629) UDTs can not be compared to DataTypes in dataframes.

2016-07-19 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-16629: --- Summary: UDTs can not be compared to DataTypes in dataframes. Key: SPARK-16629 URL: https://issues.apache.org/jira/browse/SPARK-16629 Project: Spark

[jira] [Created] (SPARK-19440) Window in pyspark doesn't have attributes unboundedPreceding, unboundedFollowing and currentRow

2017-02-02 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-19440: --- Summary: Window in pyspark doesn't have attributes unboundedPreceding, unboundedFollowing and currentRow Key: SPARK-19440 URL:

[jira] [Created] (SPARK-19388) Reading an empty folder as parquet causes an Analysis Exception

2017-01-27 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-19388: --- Summary: Reading an empty folder as parquet causes an Analysis Exception Key: SPARK-19388 URL: https://issues.apache.org/jira/browse/SPARK-19388 Project: Spark

[jira] [Closed] (SPARK-19388) Reading an empty folder as parquet causes an Analysis Exception

2017-01-27 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza closed SPARK-19388. --- Resolution: Fixed > Reading an empty folder as parquet causes an Analysis Exception >

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and

[jira] [Created] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-19 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-19299: --- Summary: Nulls in non nullable columns causes data corruption in parquet Key: SPARK-19299 URL: https://issues.apache.org/jira/browse/SPARK-19299 Project: Spark

[jira] [Commented] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15832048#comment-15832048 ] Franklyn Dsouza commented on SPARK-19299: - These issues also are very likely reproducible in

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Priority: Critical (was: Major) > Nulls in non nullable columns causes data corruption in

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a no-nullable field and

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a non-nullable field and

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Summary: Nulls in non nullable columns causes data corruption in parquet (was: Nulls in

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns cause data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Summary: Nulls in non nullable columns cause data corruption in parquet (was: Nulls in

[jira] [Updated] (SPARK-19299) Nulls in non nullable columns causes data corruption in parquet

2017-01-20 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-19299: Description: The problem we're seeing is that if a null occurs in a non-nullable field and

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:24 PM: --- The

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:22 PM: --- The

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:37 PM: --- The

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:21 PM: --- The

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:29 PM: --- The

[jira] [Comment Edited] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15761741#comment-15761741 ] Franklyn Dsouza edited comment on SPARK-18589 at 12/19/16 5:20 PM: --- The

[jira] [Commented] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-12-19 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15761741#comment-15761741 ] Franklyn Dsouza commented on SPARK-18589: - The sequence of steps that causes this are: {code}

[jira] [Created] (SPARK-19844) UDF in when control function is executed before the when clause is evaluated.

2017-03-06 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-19844: --- Summary: UDF in when control function is executed before the when clause is evaluated. Key: SPARK-19844 URL: https://issues.apache.org/jira/browse/SPARK-19844

[jira] [Created] (SPARK-21199) Its not possible to impute Vector types

2017-06-23 Thread Franklyn Dsouza (JIRA)
Franklyn Dsouza created SPARK-21199: --- Summary: Its not possible to impute Vector types Key: SPARK-21199 URL: https://issues.apache.org/jira/browse/SPARK-21199 Project: Spark Issue Type:

[jira] [Commented] (SPARK-12806) Support SQL expressions extracting values from VectorUDT

2017-06-23 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16061718#comment-16061718 ] Franklyn Dsouza commented on SPARK-12806: - [~mengxr] can we get this fixed ? > Support SQL

[jira] [Updated] (SPARK-21199) Its not possible to impute Vector types

2017-06-23 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Franklyn Dsouza updated SPARK-21199: Description: There are cases where nulls end up in vector columns in dataframes. Currently

[jira] [Comment Edited] (SPARK-21199) Its not possible to impute Vector types

2017-06-26 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16063416#comment-16063416 ] Franklyn Dsouza edited comment on SPARK-21199 at 6/26/17 5:16 PM: -- For

[jira] [Comment Edited] (SPARK-21199) Its not possible to impute Vector types

2017-06-26 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16063416#comment-16063416 ] Franklyn Dsouza edited comment on SPARK-21199 at 6/26/17 5:16 PM: -- For

[jira] [Commented] (SPARK-21199) Its not possible to impute Vector types

2017-06-26 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16063416#comment-16063416 ] Franklyn Dsouza commented on SPARK-21199: - For this particular scenario I have a table with two

[jira] [Comment Edited] (SPARK-21199) Its not possible to impute Vector types

2017-06-26 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16063416#comment-16063416 ] Franklyn Dsouza edited comment on SPARK-21199 at 6/26/17 5:15 PM: -- For

[jira] [Comment Edited] (SPARK-12806) Support SQL expressions extracting values from VectorUDT

2017-06-24 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16061718#comment-16061718 ] Franklyn Dsouza edited comment on SPARK-12806 at 6/24/17 2:54 PM: --

[jira] [Comment Edited] (SPARK-12806) Support SQL expressions extracting values from VectorUDT

2017-06-25 Thread Franklyn Dsouza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16061718#comment-16061718 ] Franklyn Dsouza edited comment on SPARK-12806 at 6/25/17 11:32 PM: ---