[jira] [Updated] (SPARK-24216) Spark TypedAggregateExpression uses getSimpleName that is not safe in scala

2018-05-08 Thread Fangshi Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fangshi Li updated SPARK-24216: --- Description: When we create a aggregator object within a function in scala and pass the aggregator

[jira] [Assigned] (SPARK-24219) Improve the docker build script to avoid copying everything in example

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24219: Assignee: Apache Spark > Improve the docker build script to avoid copying everything in

[jira] [Assigned] (SPARK-24219) Improve the docker build script to avoid copying everything in example

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24219: Assignee: (was: Apache Spark) > Improve the docker build script to avoid copying

[jira] [Commented] (SPARK-24219) Improve the docker build script to avoid copying everything in example

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468408#comment-16468408 ] Apache Spark commented on SPARK-24219: -- User 'jerryshao' has created a pull request for this issue:

[jira] [Created] (SPARK-24219) Improve the docker build script to avoid copying everything in example

2018-05-08 Thread Saisai Shao (JIRA)
Saisai Shao created SPARK-24219: --- Summary: Improve the docker build script to avoid copying everything in example Key: SPARK-24219 URL: https://issues.apache.org/jira/browse/SPARK-24219 Project: Spark

[jira] [Created] (SPARK-24218) Allow Configuration of DynamoDbEndpointUrl in KinesisReceiver

2018-05-08 Thread Cole Murray (JIRA)
Cole Murray created SPARK-24218: --- Summary: Allow Configuration of DynamoDbEndpointUrl in KinesisReceiver Key: SPARK-24218 URL: https://issues.apache.org/jira/browse/SPARK-24218 Project: Spark

[jira] [Assigned] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some nodes.

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24217: Assignee: Apache Spark > Power Iteration Clustering is not displaying cluster indices

[jira] [Commented] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some nodes.

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468337#comment-16468337 ] Apache Spark commented on SPARK-24217: -- User 'shahidki31' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some nodes.

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24217: Assignee: (was: Apache Spark) > Power Iteration Clustering is not displaying cluster

[jira] [Commented] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some nodes.

2018-05-08 Thread spark_user (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468338#comment-16468338 ] spark_user commented on SPARK-24217: I am working on this issue > Power Iteration Clustering is not

[jira] [Created] (SPARK-24217) Power Iteration Clustering is not displaying cluster indices corresponding to some nodes.

2018-05-08 Thread spark_user (JIRA)
spark_user created SPARK-24217: -- Summary: Power Iteration Clustering is not displaying cluster indices corresponding to some nodes. Key: SPARK-24217 URL: https://issues.apache.org/jira/browse/SPARK-24217

[jira] [Resolved] (SPARK-23972) Upgrade to Parquet 1.10

2018-05-08 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23972. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21070

[jira] [Assigned] (SPARK-23972) Upgrade to Parquet 1.10

2018-05-08 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23972: --- Assignee: Ryan Blue > Upgrade to Parquet 1.10 > --- > >

[jira] [Assigned] (SPARK-24132) Instrumentation improvement for classification

2018-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-24132: - Assignee: Lu Wang > Instrumentation improvement for classification >

[jira] [Resolved] (SPARK-24132) Instrumentation improvement for classification

2018-05-08 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-24132. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 21204

[jira] [Commented] (SPARK-21033) fix the potential OOM in UnsafeExternalSorter

2018-05-08 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468231#comment-16468231 ] Wenchen Fan commented on SPARK-21033: - The followup is just a code cleanup, I think we should not

[jira] [Commented] (SPARK-24208) Cannot resolve column in self join after applying Pandas UDF

2018-05-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468190#comment-16468190 ] Hyukjin Kwon commented on SPARK-24208: -- Just for clarification, is it specific to Pandas UDF? >

[jira] [Assigned] (SPARK-24068) CSV schema inferring doesn't work for compressed files

2018-05-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-24068: Assignee: Maxim Gekk > CSV schema inferring doesn't work for compressed files >

[jira] [Resolved] (SPARK-24068) CSV schema inferring doesn't work for compressed files

2018-05-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24068. -- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21182

[jira] [Assigned] (SPARK-24216) Spark TypedAggregateExpression uses getSimpleName that is not safe in scala

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24216: Assignee: (was: Apache Spark) > Spark TypedAggregateExpression uses getSimpleName

[jira] [Assigned] (SPARK-24216) Spark TypedAggregateExpression uses getSimpleName that is not safe in scala

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24216: Assignee: Apache Spark > Spark TypedAggregateExpression uses getSimpleName that is not

[jira] [Commented] (SPARK-24216) Spark TypedAggregateExpression uses getSimpleName that is not safe in scala

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468146#comment-16468146 ] Apache Spark commented on SPARK-24216: -- User 'fangshil' has created a pull request for this issue:

[jira] [Created] (SPARK-24216) Spark TypedAggregateExpression uses getSimpleName that is not safe in scala

2018-05-08 Thread Fangshi Li (JIRA)
Fangshi Li created SPARK-24216: -- Summary: Spark TypedAggregateExpression uses getSimpleName that is not safe in scala Key: SPARK-24216 URL: https://issues.apache.org/jira/browse/SPARK-24216 Project:

[jira] [Updated] (SPARK-24215) Implement __repr__ and _repr_html_ for dataframes in PySpark

2018-05-08 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-24215: Target Version/s: 2.4.0 > Implement __repr__ and _repr_html_ for dataframes in PySpark >

[jira] [Updated] (SPARK-24215) Implement __repr__ and _repr_html_ for dataframes in PySpark

2018-05-08 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-24215: Component/s: Spark Core > Implement __repr__ and _repr_html_ for dataframes in PySpark >

[jira] [Updated] (SPARK-24215) Implement __repr__ and _repr_html_ for dataframes in PySpark

2018-05-08 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-24215: Fix Version/s: (was: 2.4.0) > Implement __repr__ and _repr_html_ for dataframes in PySpark >

[jira] [Updated] (SPARK-24215) Implement __repr__ and _repr_html_ for dataframes in PySpark

2018-05-08 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated SPARK-24215: -- Description: To help people that are new to Spark get feedback more easily, we should implement the

[jira] [Updated] (SPARK-24215) Implement __repr__ and _repr_html_ for dataframes in PySpark

2018-05-08 Thread Ryan Blue (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ryan Blue updated SPARK-24215: -- Description: To help people that are new to Spark get feedback more easily, we should implement the

[jira] [Commented] (SPARK-14146) Imported implicits can't be found in Spark REPL in some cases

2018-05-08 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468094#comment-16468094 ] Shixiong Zhu commented on SPARK-14146: -- [~ashawley] Yeah, we can close this ticket when we upgrade

[jira] [Created] (SPARK-24215) Implement __repr__ and _repr_html_ for dataframes in PySpark

2018-05-08 Thread Ryan Blue (JIRA)
Ryan Blue created SPARK-24215: - Summary: Implement __repr__ and _repr_html_ for dataframes in PySpark Key: SPARK-24215 URL: https://issues.apache.org/jira/browse/SPARK-24215 Project: Spark

[jira] [Comment Edited] (SPARK-23945) Column.isin() should accept a single-column DataFrame as input

2018-05-08 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468045#comment-16468045 ] Nicholas Chammas edited comment on SPARK-23945 at 5/8/18 10:22 PM: ---

[jira] [Commented] (SPARK-23945) Column.isin() should accept a single-column DataFrame as input

2018-05-08 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468045#comment-16468045 ] Nicholas Chammas commented on SPARK-23945: -- > So in the grand scheme of things I'd expect

[jira] [Comment Edited] (SPARK-23945) Column.isin() should accept a single-column DataFrame as input

2018-05-08 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16433316#comment-16433316 ] Nicholas Chammas edited comment on SPARK-23945 at 5/8/18 10:13 PM: --- I

[jira] [Assigned] (SPARK-24214) StreamingRelationV2/StreamingExecutionRelation/ContinuousExecutionRelation.toJSON should not fail

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24214: Assignee: Apache Spark (was: Shixiong Zhu) >

[jira] [Commented] (SPARK-24214) StreamingRelationV2/StreamingExecutionRelation/ContinuousExecutionRelation.toJSON should not fail

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468031#comment-16468031 ] Apache Spark commented on SPARK-24214: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24214) StreamingRelationV2/StreamingExecutionRelation/ContinuousExecutionRelation.toJSON should not fail

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24214: Assignee: Shixiong Zhu (was: Apache Spark) >

[jira] [Created] (SPARK-24214) StreamingRelationV2/StreamingExecutionRelation/ContinuousExecutionRelation.toJSON should not fail

2018-05-08 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-24214: Summary: StreamingRelationV2/StreamingExecutionRelation/ContinuousExecutionRelation.toJSON should not fail Key: SPARK-24214 URL:

[jira] [Commented] (SPARK-24213) Power Iteration Clustering in the SparkML throws exception, when the ID is IntType

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468022#comment-16468022 ] Apache Spark commented on SPARK-24213: -- User 'jkbradley' has created a pull request for this issue:

[jira] [Commented] (SPARK-24213) Power Iteration Clustering in the SparkML throws exception, when the ID is IntType

2018-05-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468018#comment-16468018 ] Joseph K. Bradley commented on SPARK-24213: --- Thanks for reporting this issue! There is

[jira] [Commented] (SPARK-17916) CSV data source treats empty string as null no matter what nullValue option is

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467998#comment-16467998 ] Apache Spark commented on SPARK-17916: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Updated] (SPARK-24213) Power Iteration Clustering in the SparkML throws exception, when the ID is IntType

2018-05-08 Thread spark_user (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] spark_user updated SPARK-24213: --- Summary: Power Iteration Clustering in the SparkML throws exception, when the ID is IntType (was:

[jira] [Updated] (SPARK-24213) Power Iteration Clustering in SparkML throws exception, when the ID is IntType

2018-05-08 Thread spark_user (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] spark_user updated SPARK-24213: --- Summary: Power Iteration Clustering in SparkML throws exception, when the ID is IntType (was: Power

[jira] [Updated] (SPARK-24191) SparkML: Example code for Power Iteration Clustering

2018-05-08 Thread spark_user (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] spark_user updated SPARK-24191: --- Fix Version/s: 2.4.0 > SparkML: Example code for Power Iteration Clustering >

[jira] [Updated] (SPARK-24213) Power Iteration Clustering in SparkML throws exception, when the ID in IntType

2018-05-08 Thread spark_user (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] spark_user updated SPARK-24213: --- Environment: (was: {code:java} {code}  ) > Power Iteration Clustering in SparkML throws

[jira] [Assigned] (SPARK-24213) Power Iteration Clustering in SparkML throws exception, when the ID in IntType

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24213: Assignee: Apache Spark > Power Iteration Clustering in SparkML throws exception, when the

[jira] [Commented] (SPARK-24213) Power Iteration Clustering in SparkML throws exception, when the ID in IntType

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467944#comment-16467944 ] Apache Spark commented on SPARK-24213: -- User 'shahidki31' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24213) Power Iteration Clustering in SparkML throws exception, when the ID in IntType

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24213: Assignee: (was: Apache Spark) > Power Iteration Clustering in SparkML throws

[jira] [Commented] (SPARK-24213) Power Iteration Clustering in SparkML throws exception, when the ID in IntType

2018-05-08 Thread spark_user (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467943#comment-16467943 ] spark_user commented on SPARK-24213: Currently I am working on this issue. > Power Iteration

[jira] [Created] (SPARK-24213) Power Iteration Clustering in SparkML throws exception, when the ID in IntType

2018-05-08 Thread spark_user (JIRA)
spark_user created SPARK-24213: -- Summary: Power Iteration Clustering in SparkML throws exception, when the ID in IntType Key: SPARK-24213 URL: https://issues.apache.org/jira/browse/SPARK-24213 Project:

[jira] [Commented] (SPARK-21033) fix the potential OOM in UnsafeExternalSorter

2018-05-08 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467803#comment-16467803 ] Thomas Graves commented on SPARK-21033: --- [~cloud_fan] the followup PR 

[jira] [Created] (SPARK-24212) PrefixSpan in spark.ml: user guide section

2018-05-08 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-24212: - Summary: PrefixSpan in spark.ml: user guide section Key: SPARK-24212 URL: https://issues.apache.org/jira/browse/SPARK-24212 Project: Spark Issue

[jira] [Closed] (SPARK-24145) spark.ml parity for sequential pattern mining - PrefixSpan: Python API

2018-05-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-24145. - > spark.ml parity for sequential pattern mining - PrefixSpan: Python API >

[jira] [Resolved] (SPARK-24145) spark.ml parity for sequential pattern mining - PrefixSpan: Python API

2018-05-08 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-24145. --- Resolution: Duplicate > spark.ml parity for sequential pattern mining - PrefixSpan:

[jira] [Created] (SPARK-24211) Flaky test: StreamingOuterJoinSuite

2018-05-08 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-24211: - Summary: Flaky test: StreamingOuterJoinSuite Key: SPARK-24211 URL: https://issues.apache.org/jira/browse/SPARK-24211 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-23355) convertMetastore should not ignore table properties

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467610#comment-16467610 ] Apache Spark commented on SPARK-23355: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Resolved] (SPARK-24117) Unified the getSizePerRow

2018-05-08 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-24117. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21189

[jira] [Assigned] (SPARK-24117) Unified the getSizePerRow

2018-05-08 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-24117: --- Assignee: Yuming Wang > Unified the getSizePerRow > - > >

[jira] [Commented] (SPARK-23894) Flaky Test: BucketedWriteWithoutHiveSupportSuite

2018-05-08 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467570#comment-16467570 ] Imran Rashid commented on SPARK-23894: -- After discussion in related PRs, SPARK-22938 should cover

[jira] [Created] (SPARK-24210) incorrect handling of boolean expressions when using column in expressions in pyspark.sql.DataFrame filter function

2018-05-08 Thread Michael H (JIRA)
Michael H created SPARK-24210: - Summary: incorrect handling of boolean expressions when using column in expressions in pyspark.sql.DataFrame filter function Key: SPARK-24210 URL:

[jira] [Resolved] (SPARK-24112) Add `spark.sql.hive.convertMetastoreTableProperty` for backward compatiblility

2018-05-08 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-24112. --- Resolution: Won't Fix > Add `spark.sql.hive.convertMetastoreTableProperty` for backward

[jira] [Comment Edited] (SPARK-24112) Add `spark.sql.hive.convertMetastoreTableProperty` for backward compatiblility

2018-05-08 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467534#comment-16467534 ] Dongjoon Hyun edited comment on SPARK-24112 at 5/8/18 3:05 PM: --- Based on

[jira] [Commented] (SPARK-24112) Add `spark.sql.hive.convertMetastoreTableProperty` for backward compatiblility

2018-05-08 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24112?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467534#comment-16467534 ] Dongjoon Hyun commented on SPARK-24112: --- Based on the decion on PR, I'll close this. > Add

[jira] [Commented] (SPARK-19680) Offsets out of range with no configured reset policy for partitions

2018-05-08 Thread Marcin Kuthan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467498#comment-16467498 ] Marcin Kuthan commented on SPARK-19680: --- We have started migration our spark streaming job from 1.6

[jira] [Comment Edited] (SPARK-19512) codegen for compare structs fails

2018-05-08 Thread howie yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467403#comment-16467403 ] howie yu edited comment on SPARK-19512 at 5/8/18 1:18 PM: -- Hi I think this  is

[jira] [Commented] (SPARK-19512) codegen for compare structs fails

2018-05-08 Thread howie yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467403#comment-16467403 ] howie yu commented on SPARK-19512: -- Hi I think this  is similar issue, may not the same. I have

[jira] [Assigned] (SPARK-24209) 0 configuration Knox gateway support in SHS

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24209: Assignee: Apache Spark > 0 configuration Knox gateway support in SHS >

[jira] [Commented] (SPARK-24209) 0 configuration Knox gateway support in SHS

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467377#comment-16467377 ] Apache Spark commented on SPARK-24209: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24209) 0 configuration Knox gateway support in SHS

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24209: Assignee: (was: Apache Spark) > 0 configuration Knox gateway support in SHS >

[jira] [Created] (SPARK-24209) 0 configuration Knox gateway support in SHS

2018-05-08 Thread Marco Gaido (JIRA)
Marco Gaido created SPARK-24209: --- Summary: 0 configuration Knox gateway support in SHS Key: SPARK-24209 URL: https://issues.apache.org/jira/browse/SPARK-24209 Project: Spark Issue Type:

[jira] [Updated] (SPARK-24208) Cannot resolve column in self join after applying Pandas UDF

2018-05-08 Thread Rafal Ganczarek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rafal Ganczarek updated SPARK-24208: Description: I noticed that after applying Pandas UDF function, a self join of resulted

[jira] [Created] (SPARK-24208) Cannot resolve column in self join after applying Pandas UDF

2018-05-08 Thread Rafal Ganczarek (JIRA)
Rafal Ganczarek created SPARK-24208: --- Summary: Cannot resolve column in self join after applying Pandas UDF Key: SPARK-24208 URL: https://issues.apache.org/jira/browse/SPARK-24208 Project: Spark

[jira] [Updated] (SPARK-24202) Separate SQLContext dependency from SparkSession.implicits

2018-05-08 Thread Gerard Maas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gerard Maas updated SPARK-24202: Description: The current implementation of the implicits in SparkSession passes the current

[jira] [Updated] (SPARK-24202) Separate SQLContext dependency from SparkSession.implicits

2018-05-08 Thread Gerard Maas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gerard Maas updated SPARK-24202: Summary: Separate SQLContext dependency from SparkSession.implicits (was: Separate SQLContext

[jira] [Resolved] (SPARK-24076) very bad performance when shuffle.partition = 8192

2018-05-08 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-24076. --- Resolution: Fixed Assignee: yucai Fix Version/s: 2.4.0 > very bad

[jira] [Commented] (SPARK-10925) Exception when joining DataFrames

2018-05-08 Thread howie yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467142#comment-16467142 ] howie yu commented on SPARK-10925: -- Same issue on Spark 2.3.0. Add checkpoint still have same error >

[jira] [Commented] (SPARK-19181) SparkListenerSuite.local metrics fails when average executorDeserializeTime is too short.

2018-05-08 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467137#comment-16467137 ] Attila Zsolt Piros commented on SPARK-19181: I am working on this. >

[jira] [Commented] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2018-05-08 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467056#comment-16467056 ] Takeshi Yamamuro commented on SPARK-21274: -- yea, I'm interested in the performance differences

[jira] [Assigned] (SPARK-21945) pyspark --py-files doesn't work in yarn client mode

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21945: Assignee: (was: Apache Spark) > pyspark --py-files doesn't work in yarn client mode >

[jira] [Assigned] (SPARK-21945) pyspark --py-files doesn't work in yarn client mode

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21945?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21945: Assignee: Apache Spark > pyspark --py-files doesn't work in yarn client mode >

[jira] [Commented] (SPARK-21945) pyspark --py-files doesn't work in yarn client mode

2018-05-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467052#comment-16467052 ] Apache Spark commented on SPARK-21945: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Commented] (SPARK-19512) codegen for compare structs fails

2018-05-08 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467043#comment-16467043 ] Takeshi Yamamuro commented on SPARK-19512: -- I checked in the released v2.3.0 and the master

[jira] [Commented] (SPARK-24201) IllegalArgumentException originating from ClosureCleaner in Java 9+

2018-05-08 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467023#comment-16467023 ] Takeshi Yamamuro commented on SPARK-24201: -- IIUC spark doesn't support java9+ (the doc should be

[jira] [Resolved] (SPARK-24188) /api/v1/version not working

2018-05-08 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao resolved SPARK-24188. - Resolution: Fixed Fix Version/s: 2.3.1 2.4.0 Issue resolved by pull

[jira] [Assigned] (SPARK-24188) /api/v1/version not working

2018-05-08 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao reassigned SPARK-24188: --- Assignee: Marcelo Vanzin > /api/v1/version not working > --- > >

[jira] [Commented] (SPARK-24200) Read subdirectories with out asterisks

2018-05-08 Thread kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466948#comment-16466948 ] kumar commented on SPARK-24200: --- This is not question for how, it's an improvement suggestion, i found a

[jira] [Updated] (SPARK-24200) Read subdirectories with out asterisks

2018-05-08 Thread kumar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] kumar updated SPARK-24200: -- Description: String folder = "/Users/test/data/* /* "; sparkContext.textFile(folder, 1).toJavaRDD()  Is

[jira] [Created] (SPARK-24207) PrefixSpan: R API

2018-05-08 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-24207: Summary: PrefixSpan: R API Key: SPARK-24207 URL: https://issues.apache.org/jira/browse/SPARK-24207 Project: Spark Issue Type: Sub-task Components:

[jira] [Commented] (SPARK-23780) Failed to use googleVis library with new SparkR

2018-05-08 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16466920#comment-16466920 ] Felix Cheung commented on SPARK-23780: -- I suppose if you load googleVis first and then SparkR it