[jira] [Commented] (SPARK-5722) Infer_schema_type incorrect for Integers in pyspark

2015-02-11 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14316940#comment-14316940 ] Don Drake commented on SPARK-5722: -- Hi, I've submitted 2 pull requests for branch-1.2 and

[jira] [Updated] (SPARK-5722) Infer_schema_type incorrect for Integers in pyspark

2015-02-10 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Don Drake updated SPARK-5722: - Description: The Integers datatype in Python does not match what a Scala/Java integer is defined as.

[jira] [Updated] (SPARK-5722) Infer_schema_type incorrect for Integers in pyspark

2015-02-10 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Don Drake updated SPARK-5722: - Summary: Infer_schema_type incorrect for Integers in pyspark (was: Infer_schma_type incorrect for

[jira] [Created] (SPARK-5722) Infer_schma_type incorrect for Integers in pyspark

2015-02-10 Thread Don Drake (JIRA)
Don Drake created SPARK-5722: Summary: Infer_schma_type incorrect for Integers in pyspark Key: SPARK-5722 URL: https://issues.apache.org/jira/browse/SPARK-5722 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-7182) [SQL] Can't remove columns from DataFrame or save DataFrame from a join due to duplicate columns

2015-04-27 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Don Drake updated SPARK-7182: - Summary: [SQL] Can't remove columns from DataFrame or save DataFrame from a join due to duplicate columns

[jira] [Updated] (SPARK-7182) [SQL] Can't remove or save DataFrame from a join due to duplicate columns

2015-04-27 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Don Drake updated SPARK-7182: - Description: I'm having trouble saving a dataframe as parquet after performing a simple table join.

[jira] [Created] (SPARK-7182) [SQL] Can't remove or save DataFrame from a join due to duplicate columns

2015-04-27 Thread Don Drake (JIRA)
Don Drake created SPARK-7182: Summary: [SQL] Can't remove or save DataFrame from a join due to duplicate columns Key: SPARK-7182 URL: https://issues.apache.org/jira/browse/SPARK-7182 Project: Spark

[jira] [Created] (SPARK-7781) GradientBoostedTrees.trainRegressor is missing maxBins parameter in pyspark

2015-05-20 Thread Don Drake (JIRA)
Don Drake created SPARK-7781: Summary: GradientBoostedTrees.trainRegressor is missing maxBins parameter in pyspark Key: SPARK-7781 URL: https://issues.apache.org/jira/browse/SPARK-7781 Project: Spark

[jira] [Commented] (SPARK-8368) ClassNotFoundException in closure for map

2015-06-22 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14596001#comment-14596001 ] Don Drake commented on SPARK-8368: -- I've verified through a nightly build that this

[jira] [Created] (SPARK-8365) pyspark does not retain --packages or --jars passed on the command line as of 1.4.0

2015-06-14 Thread Don Drake (JIRA)
Don Drake created SPARK-8365: Summary: pyspark does not retain --packages or --jars passed on the command line as of 1.4.0 Key: SPARK-8365 URL: https://issues.apache.org/jira/browse/SPARK-8365 Project:

[jira] [Commented] (SPARK-8365) pyspark does not retain --packages or --jars passed on the command line as of 1.4.0

2015-06-17 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14590290#comment-14590290 ] Don Drake commented on SPARK-8365: -- Is there a workaround that you are aware of?

[jira] [Commented] (SPARK-11085) Add support for HTTP proxy

2015-10-13 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14954989#comment-14954989 ] Don Drake commented on SPARK-11085: --- Neither of the options work. > Add support for HTTP proxy >

[jira] [Commented] (SPARK-10441) Cannot write timestamp to JSON

2015-09-08 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14735690#comment-14735690 ] Don Drake commented on SPARK-10441: --- I see that PR 8597 was merged into master. Does master represent

[jira] [Commented] (SPARK-10441) Cannot write timestamp to JSON

2015-09-09 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14736986#comment-14736986 ] Don Drake commented on SPARK-10441: --- Got it, thanks for the clarification. > Cannot write timestamp to

[jira] [Commented] (SPARK-15467) Getting stack overflow when attempting to query a wide Dataset (>200 fields)

2016-05-22 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15295512#comment-15295512 ] Don Drake commented on SPARK-15467: --- Vishnu, the 22 field limitation is with Scala 2.10.x, Spark 2.0

[jira] [Created] (SPARK-15467) Getting stack overflow when attempting to query a wide Dataset (>200 fields)

2016-05-21 Thread Don Drake (JIRA)
Don Drake created SPARK-15467: - Summary: Getting stack overflow when attempting to query a wide Dataset (>200 fields) Key: SPARK-15467 URL: https://issues.apache.org/jira/browse/SPARK-15467 Project:

[jira] [Reopened] (SPARK-19477) [SQL] Datasets created from a Dataframe with extra columns retain the extra columns

2017-02-07 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Don Drake reopened SPARK-19477: --- I'm struggling with this answer. I thought the point of Datasets, was to have a strongly typed

[jira] [Created] (SPARK-19477) [SQL] Datasets created from a Dataframe with extra columns retain the extra columns

2017-02-06 Thread Don Drake (JIRA)
Don Drake created SPARK-19477: - Summary: [SQL] Datasets created from a Dataframe with extra columns retain the extra columns Key: SPARK-19477 URL: https://issues.apache.org/jira/browse/SPARK-19477

[jira] [Commented] (SPARK-19477) [SQL] Datasets created from a Dataframe with extra columns retain the extra columns

2017-02-11 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15862560#comment-15862560 ] Don Drake commented on SPARK-19477: --- How does lazy apply here? If I read/create a dataframe with extra

[jira] [Commented] (SPARK-17341) Can't read Parquet data with fields containing periods "."

2016-08-31 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15453806#comment-15453806 ] Don Drake commented on SPARK-17341: --- I just downloaded the nightly build from 8/31/2016 and gave it a

[jira] [Commented] (SPARK-13721) Add support for LATERAL VIEW OUTER explode()

2016-09-02 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15459490#comment-15459490 ] Don Drake commented on SPARK-13721: --- My nested structures aren't simple types, they are structs (case

[jira] [Created] (SPARK-17384) SQL - Running query with outer join from 1.6 fails

2016-09-02 Thread Don Drake (JIRA)
Don Drake created SPARK-17384: - Summary: SQL - Running query with outer join from 1.6 fails Key: SPARK-17384 URL: https://issues.apache.org/jira/browse/SPARK-17384 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-17341) Can't read Parquet data with fields containing periods "."

2016-08-31 Thread Don Drake (JIRA)
Don Drake created SPARK-17341: - Summary: Can't read Parquet data with fields containing periods "." Key: SPARK-17341 URL: https://issues.apache.org/jira/browse/SPARK-17341 Project: Spark Issue

[jira] [Commented] (SPARK-13721) Add support for LATERAL VIEW OUTER explode()

2016-08-31 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15452846#comment-15452846 ] Don Drake commented on SPARK-13721: --- Spark 2.0 has deprecated this function, what workarounds are

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-03 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15543555#comment-15543555 ] Don Drake commented on SPARK-16845: --- I just hit this bug as well. Are there any suggested workarounds?

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-19 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15590256#comment-15590256 ] Don Drake commented on SPARK-16845: --- [~lwlin] I saw your PR, but noticed it's failing some tests. Just

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-22 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15598442#comment-15598442 ] Don Drake commented on SPARK-16845: --- Update: It turns out that I am still getting this exception.

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-20 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15592862#comment-15592862 ] Don Drake commented on SPARK-16845: --- I compiled your branch and ran my large job and it finished

[jira] [Commented] (SPARK-18207) class "org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificUnsafeProjection" grows beyond 64 KB

2016-11-12 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15659978#comment-15659978 ] Don Drake commented on SPARK-18207: --- Hi, I was able to download a nightly SNAPSHOT release and verify

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-27 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15614100#comment-15614100 ] Don Drake commented on SPARK-16845: --- I'm struggling to get a simple case created. I'm curious though,

[jira] [Updated] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-27 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Don Drake updated SPARK-16845: -- Attachment: error.txt.zip Does this generated code help in resolving this? >

[jira] [Created] (SPARK-18207) class "org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificUnsafeProjection" grows beyond 64 KB

2016-11-01 Thread Don Drake (JIRA)
Don Drake created SPARK-18207: - Summary: class "org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificUnsafeProjection" grows beyond 64 KB Key: SPARK-18207 URL:

[jira] [Updated] (SPARK-18207) class "org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificUnsafeProjection" grows beyond 64 KB

2016-11-01 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Don Drake updated SPARK-18207: -- Description: I have 2 wide dataframes that contain nested data structures, when I explode one of the

[jira] [Updated] (SPARK-18207) class "org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificUnsafeProjection" grows beyond 64 KB

2016-11-01 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Don Drake updated SPARK-18207: -- Attachment: spark-18207.txt Please read the comments at the top of the attachment, you need to :paste

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-11-01 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626664#comment-15626664 ] Don Drake commented on SPARK-16845: --- I've been struggling to duplicate this and finally came up with a

[jira] [Commented] (SPARK-18207) class "org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificUnsafeProjection" grows beyond 64 KB

2016-11-02 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15629140#comment-15629140 ] Don Drake commented on SPARK-18207: --- I opened it based on [~lwlin]'s suggestion in the comments of

[jira] [Comment Edited] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-11-01 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15626664#comment-15626664 ] Don Drake edited comment on SPARK-16845 at 11/2/16 12:32 AM: - I've been

[jira] [Commented] (SPARK-18207) class "org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificUnsafeProjection" grows beyond 64 KB

2016-11-02 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15629177#comment-15629177 ] Don Drake commented on SPARK-18207: --- The difference with my case versus the other test cases is that my

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-11 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15565592#comment-15565592 ] Don Drake commented on SPARK-16845: --- Unfortunately, it does not work around it. 16/10/10 18:19:47

[jira] [Commented] (SPARK-16845) org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificOrdering" grows beyond 64 KB

2016-10-11 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15567358#comment-15567358 ] Don Drake commented on SPARK-16845: --- I can't at the moment, mine is not simple. But this JIRA has

[jira] [Commented] (SPARK-16745) Spark job completed however have to wait for 13 mins (data size is small)

2017-03-24 Thread Don Drake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15940654#comment-15940654 ] Don Drake commented on SPARK-16745: --- I just came across the same exception running Spark 2.1.0 on my