[jira] [Commented] (SPARK-15507) ClassCastException: SomeCaseClass cannot be cast to org.apache.spark.sql.Row

2016-06-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315331#comment-15315331 ] Reynold Xin commented on SPARK-15507: - Koert in 2.0, isn't this just rdd.toDS and then you get a

[jira] [Resolved] (SPARK-15756) Support command 'create table stored as orcfile/parquetfile/avrofile'

2016-06-03 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-15756. - Resolution: Fixed Assignee: Lianhui Wang Fix Version/s: 2.0.0 > Support command

[jira] [Assigned] (SPARK-15766) R should export is.nan

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15766: Assignee: Apache Spark > R should export is.nan > -- > >

[jira] [Commented] (SPARK-15766) R should export is.nan

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315324#comment-15315324 ] Apache Spark commented on SPARK-15766: -- User 'wangmiao1981' has created a pull request for this

[jira] [Assigned] (SPARK-15766) R should export is.nan

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15766: Assignee: (was: Apache Spark) > R should export is.nan > -- > >

[jira] [Created] (SPARK-15766) R should export is.nan

2016-06-03 Thread Miao Wang (JIRA)
Miao Wang created SPARK-15766: - Summary: R should export is.nan Key: SPARK-15766 URL: https://issues.apache.org/jira/browse/SPARK-15766 Project: Spark Issue Type: Bug Reporter: Miao

[jira] [Assigned] (SPARK-15765) Make continuous Parquet writing consistent with non-consistent Parquet writing

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15765: Assignee: (was: Apache Spark) > Make continuous Parquet writing consistent with

[jira] [Assigned] (SPARK-15765) Make continuous Parquet writing consistent with non-consistent Parquet writing

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15765: Assignee: Apache Spark > Make continuous Parquet writing consistent with non-consistent

[jira] [Commented] (SPARK-15765) Make continuous Parquet writing consistent with non-consistent Parquet writing

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315317#comment-15315317 ] Apache Spark commented on SPARK-15765: -- User 'lw-lin' has created a pull request for this issue:

[jira] [Created] (SPARK-15765) Make continuous Parquet writing consistent with non-consistent Parquet writing

2016-06-03 Thread Liwei Lin (JIRA)
Liwei Lin created SPARK-15765: - Summary: Make continuous Parquet writing consistent with non-consistent Parquet writing Key: SPARK-15765 URL: https://issues.apache.org/jira/browse/SPARK-15765 Project:

[jira] [Comment Edited] (SPARK-15507) ClassCastException: SomeCaseClass cannot be cast to org.apache.spark.sql.Row

2016-06-03 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315306#comment-15315306 ] koert kuipers edited comment on SPARK-15507 at 6/4/16 4:20 AM: --- this used

[jira] [Commented] (SPARK-15507) ClassCastException: SomeCaseClass cannot be cast to org.apache.spark.sql.Row

2016-06-03 Thread koert kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315306#comment-15315306 ] koert kuipers commented on SPARK-15507: --- this used to make it very easy to go back and forth

[jira] [Updated] (SPARK-15756) Support command 'create table stored as orcfile/parquetfile/avrofile'

2016-06-03 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-15756: - Description: Now Spark SQL can support 'create table src stored as orc/parquet/avro' for

[jira] [Updated] (SPARK-15756) Support command 'create table stored as orcfile/parquetfile/avrofile'

2016-06-03 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-15756: - Summary: Support command 'create table stored as orcfile/parquetfile/avrofile' (was: Support

[jira] [Commented] (SPARK-15763) Add DELETE FILE command support in spark

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315195#comment-15315195 ] Apache Spark commented on SPARK-15763: -- User 'kevinyu98' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15763) Add DELETE FILE command support in spark

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15763: Assignee: Apache Spark > Add DELETE FILE command support in spark >

[jira] [Assigned] (SPARK-15763) Add DELETE FILE command support in spark

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15763: Assignee: (was: Apache Spark) > Add DELETE FILE command support in spark >

[jira] [Updated] (SPARK-15632) Dataset typed filter operation changes query plan schema

2016-06-03 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-15632: --- Assignee: Sean Zhong > Dataset typed filter operation changes query plan schema >

[jira] [Commented] (SPARK-15507) ClassCastException: SomeCaseClass cannot be cast to org.apache.spark.sql.Row

2016-06-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315164#comment-15315164 ] Wenchen Fan commented on SPARK-15507: - `Product` is not a valid external type for struct type column.

[jira] [Commented] (SPARK-15764) Replace n^2 loop in BindReferences

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315161#comment-15315161 ] Apache Spark commented on SPARK-15764: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15764) Replace n^2 loop in BindReferences

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15764: Assignee: Apache Spark (was: Josh Rosen) > Replace n^2 loop in BindReferences >

[jira] [Assigned] (SPARK-15764) Replace n^2 loop in BindReferences

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15764: Assignee: Josh Rosen (was: Apache Spark) > Replace n^2 loop in BindReferences >

[jira] [Created] (SPARK-15764) Replace n^2 loop in BindReferences

2016-06-03 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-15764: -- Summary: Replace n^2 loop in BindReferences Key: SPARK-15764 URL: https://issues.apache.org/jira/browse/SPARK-15764 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-15657) RowEncoder should validate the data type of input object

2016-06-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-15657: Issue Type: Sub-task (was: Improvement) Parent: SPARK-15631 > RowEncoder should validate

[jira] [Commented] (SPARK-15545) R remove non-exported unused methods, like jsonRDD

2016-06-03 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315138#comment-15315138 ] Miao Wang commented on SPARK-15545: --- Yes. All RDD APIs are not exported. According to the above PR, I

[jira] [Resolved] (SPARK-15754) org.apache.spark.deploy.yarn.Client changes the credential of current user

2016-06-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-15754. Resolution: Fixed Assignee: Subroto Sanyal Fix Version/s: 2.0.0

[jira] [Resolved] (SPARK-15391) Spark executor OOM during TimSort

2016-06-03 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-15391. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13318

[jira] [Updated] (SPARK-15761) pyspark shell should load if PYSPARK_DRIVER_PYTHON is ipython an Python3

2016-06-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-15761: --- Assignee: Manoj Kumar > pyspark shell should load if PYSPARK_DRIVER_PYTHON is ipython an

[jira] [Created] (SPARK-15763) Add DELETE FILE command support in spark

2016-06-03 Thread kevin yu (JIRA)
kevin yu created SPARK-15763: Summary: Add DELETE FILE command support in spark Key: SPARK-15763 URL: https://issues.apache.org/jira/browse/SPARK-15763 Project: Spark Issue Type: Improvement

[jira] [Comment Edited] (SPARK-14380) Review spark.ml parity for clustering

2016-06-03 Thread Xinh Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314617#comment-15314617 ] Xinh Huynh edited comment on SPARK-14380 at 6/3/16 11:18 PM: - Existing

[jira] [Assigned] (SPARK-15762) Cache Metadata.hashCode and use a singleton for Metadata.empty

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15762: Assignee: Apache Spark (was: Josh Rosen) > Cache Metadata.hashCode and use a singleton

[jira] [Commented] (SPARK-15762) Cache Metadata.hashCode and use a singleton for Metadata.empty

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315067#comment-15315067 ] Apache Spark commented on SPARK-15762: -- User 'JoshRosen' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15762) Cache Metadata.hashCode and use a singleton for Metadata.empty

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15762: Assignee: Josh Rosen (was: Apache Spark) > Cache Metadata.hashCode and use a singleton

[jira] [Created] (SPARK-15762) Cache Metadata.hashCode and use a singleton for Metadata.empty

2016-06-03 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-15762: -- Summary: Cache Metadata.hashCode and use a singleton for Metadata.empty Key: SPARK-15762 URL: https://issues.apache.org/jira/browse/SPARK-15762 Project: Spark

[jira] [Updated] (SPARK-15762) Cache Metadata.hashCode and use a singleton for Metadata.empty

2016-06-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-15762: --- Issue Type: Improvement (was: Bug) > Cache Metadata.hashCode and use a singleton for Metadata.empty

[jira] [Updated] (SPARK-15762) Cache Metadata.hashCode and use a singleton for Metadata.empty

2016-06-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-15762: --- Target Version/s: 2.0.0 > Cache Metadata.hashCode and use a singleton for Metadata.empty >

[jira] [Commented] (SPARK-14381) Review spark.ml parity for feature transformers

2016-06-03 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315060#comment-15315060 ] Xusen Yin commented on SPARK-14381: --- I can work on this one. > Review spark.ml parity for feature

[jira] [Resolved] (SPARK-15168) Add missing params to Python's MultilayerPerceptronClassifier

2016-06-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath resolved SPARK-15168. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 12943

[jira] [Updated] (SPARK-15168) Add missing params to Python's MultilayerPerceptronClassifier

2016-06-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-15168: --- Assignee: holdenk > Add missing params to Python's MultilayerPerceptronClassifier >

[jira] [Commented] (SPARK-15545) R remove non-exported unused methods, like jsonRDD

2016-06-03 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315038#comment-15315038 ] Felix Cheung commented on SPARK-15545: -- I think that's mostly right - the RDD related methods are

[jira] [Commented] (SPARK-15761) pyspark shell should load if PYSPARK_DRIVER_PYTHON is ipython an Python3

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315007#comment-15315007 ] Apache Spark commented on SPARK-15761: -- User 'MechCoder' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15761) pyspark shell should load if PYSPARK_DRIVER_PYTHON is ipython an Python3

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15761: Assignee: (was: Apache Spark) > pyspark shell should load if PYSPARK_DRIVER_PYTHON is

[jira] [Assigned] (SPARK-15761) pyspark shell should load if PYSPARK_DRIVER_PYTHON is ipython an Python3

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15761: Assignee: Apache Spark > pyspark shell should load if PYSPARK_DRIVER_PYTHON is ipython an

[jira] [Created] (SPARK-15761) pyspark shell should load if PYSPARK_DRIVER_PYTHON is ipython an Python3

2016-06-03 Thread Manoj Kumar (JIRA)
Manoj Kumar created SPARK-15761: --- Summary: pyspark shell should load if PYSPARK_DRIVER_PYTHON is ipython an Python3 Key: SPARK-15761 URL: https://issues.apache.org/jira/browse/SPARK-15761 Project:

[jira] [Commented] (SPARK-15760) Documentation missing for package-related config options

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314984#comment-15314984 ] Apache Spark commented on SPARK-15760: -- User 'vanzin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15760) Documentation missing for package-related config options

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15760: Assignee: Apache Spark > Documentation missing for package-related config options >

[jira] [Assigned] (SPARK-15760) Documentation missing for package-related config options

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15760: Assignee: (was: Apache Spark) > Documentation missing for package-related config

[jira] [Commented] (SPARK-15564) App name is the main class name in Spark streaming jobs

2016-06-03 Thread Steven Lowenthal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314959#comment-15314959 ] Steven Lowenthal commented on SPARK-15564: -- Do I make the PR? > App name is the main class name

[jira] [Commented] (SPARK-15564) App name is the main class name in Spark streaming jobs

2016-06-03 Thread Steven Lowenthal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314957#comment-15314957 ] Steven Lowenthal commented on SPARK-15564: -- Saisai - It's spark standalone mode. See the code

[jira] [Comment Edited] (SPARK-14380) Review spark.ml parity for clustering

2016-06-03 Thread Xinh Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314617#comment-15314617 ] Xinh Huynh edited comment on SPARK-14380 at 6/3/16 10:13 PM: - Existing

[jira] [Commented] (SPARK-15703) Spark UI doesn't show all tasks as completed when it should

2016-06-03 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314926#comment-15314926 ] Shixiong Zhu commented on SPARK-15703: -- [~tgraves] could you check if there are any the following

[jira] [Created] (SPARK-15760) Documentation missing for package-related config options

2016-06-03 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-15760: -- Summary: Documentation missing for package-related config options Key: SPARK-15760 URL: https://issues.apache.org/jira/browse/SPARK-15760 Project: Spark

[jira] [Commented] (SPARK-15716) Memory usage of driver keeps growing up in Spark Streaming

2016-06-03 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314908#comment-15314908 ] Shixiong Zhu commented on SPARK-15716: -- [~yani.chen] could you try 1.6.1 and see if this still

[jira] [Commented] (SPARK-15344) Unable to set default log level for PySpark

2016-06-03 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314907#comment-15314907 ] Felix Cheung commented on SPARK-15344: -- There are two ways to change the default. 1) As in the

[jira] [Resolved] (SPARK-15722) Wrong data when CTAS specifies schema

2016-06-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-15722. - Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13490

[jira] [Commented] (SPARK-15140) encoder should make sure input object is not null

2016-06-03 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314853#comment-15314853 ] Cheng Lian commented on SPARK-15140: Issue resolved by pull request 13469

[jira] [Resolved] (SPARK-15140) encoder should make sure input object is not null

2016-06-03 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15140. Resolution: Fixed Fix Version/s: 2.0.0 Target Version/s: 2.0.0 > encoder should

[jira] [Resolved] (SPARK-15681) Allow case-insensitiveness in sc.setLogLevel

2016-06-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-15681. Resolution: Fixed Assignee: Xin Wu Fix Version/s: 2.0.0 > Allow

[jira] [Resolved] (SPARK-15547) Encoder validation is too strict for inner nested structs

2016-06-03 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian resolved SPARK-15547. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13474

[jira] [Commented] (SPARK-15740) Word2VecSuite "big model load / save" caused OOM in maven jenkins builds

2016-06-03 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314816#comment-15314816 ] Xiangrui Meng commented on SPARK-15740: --- The proposal looks good to me. Please also try to measure

[jira] [Updated] (SPARK-15286) Make the output readable for EXPLAIN CREATE TABLE and DESC EXTENDED

2016-06-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-15286: Assignee: Xiao Li > Make the output readable for EXPLAIN CREATE TABLE and DESC EXTENDED >

[jira] [Commented] (SPARK-3728) RandomForest: Learn models too large to store in memory

2016-06-03 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314795#comment-15314795 ] Xusen Yin commented on SPARK-3728: -- Hi [~josephkb], as I [surveyed on

[jira] [Resolved] (SPARK-15286) Make the output readable for EXPLAIN CREATE TABLE and DESC EXTENDED

2016-06-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-15286. - Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13070

[jira] [Resolved] (SPARK-15742) Reduce collections allocations in Catalyst tree transformation methods

2016-06-03 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-15742. Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13484

[jira] [Comment Edited] (SPARK-14380) Review spark.ml parity for clustering

2016-06-03 Thread Xinh Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314617#comment-15314617 ] Xinh Huynh edited comment on SPARK-14380 at 6/3/16 8:23 PM: Existing

[jira] [Commented] (SPARK-15544) Bouncing Zookeeper node causes Active spark master to exit

2016-06-03 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314730#comment-15314730 ] Shixiong Zhu commented on SPARK-15544: -- As a workaround, you can write a script to restart master if

[jira] [Resolved] (SPARK-15665) spark-submit --kill and --status are not working

2016-06-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-15665. Resolution: Fixed Assignee: Devaraj K Fix Version/s: 2.0.0 > spark-submit

[jira] [Commented] (SPARK-15746) SchemaUtils.checkColumnType with VectorUDT prints instance details in error message

2016-06-03 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314653#comment-15314653 ] Miao Wang commented on SPARK-15746: --- I see. Thanks! > SchemaUtils.checkColumnType with VectorUDT

[jira] [Commented] (SPARK-15746) SchemaUtils.checkColumnType with VectorUDT prints instance details in error message

2016-06-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314627#comment-15314627 ] Nick Pentreath commented on SPARK-15746: I'd say hold off on working on it until we decide which

[jira] [Updated] (SPARK-15677) Query with scalar sub-query in the SELECT list throws UnsupportedOperationException.

2016-06-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-15677: Assignee: Ioana Delaney > Query with scalar sub-query in the SELECT list throws >

[jira] [Commented] (SPARK-14380) Review spark.ml parity for clustering

2016-06-03 Thread Xinh Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314617#comment-15314617 ] Xinh Huynh commented on SPARK-14380: Existing algorithms * KMeans ** Param: initial model, bypassing

[jira] [Resolved] (SPARK-15677) Query with scalar sub-query in the SELECT list throws UnsupportedOperationException.

2016-06-03 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-15677. - Resolution: Fixed Fix Version/s: 2.0.0 Issue resolved by pull request 13418

[jira] [Commented] (SPARK-15746) SchemaUtils.checkColumnType with VectorUDT prints instance details in error message

2016-06-03 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15746?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314587#comment-15314587 ] Miao Wang commented on SPARK-15746: --- [~mlnick] If you are not working on this one, I can give a try.

[jira] [Assigned] (SPARK-15759) Fallback to non-codegen if fail to compile generated code

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15759: Assignee: Apache Spark (was: Davies Liu) > Fallback to non-codegen if fail to compile

[jira] [Commented] (SPARK-15759) Fallback to non-codegen if fail to compile generated code

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314529#comment-15314529 ] Apache Spark commented on SPARK-15759: -- User 'davies' has created a pull request for this issue:

[jira] [Assigned] (SPARK-15759) Fallback to non-codegen if fail to compile generated code

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15759: Assignee: Davies Liu (was: Apache Spark) > Fallback to non-codegen if fail to compile

[jira] [Assigned] (SPARK-15756) Support create table stored as orcfile/parquetfile/avrofile

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15756: Assignee: (was: Apache Spark) > Support create table stored as

[jira] [Assigned] (SPARK-15756) Support create table stored as orcfile/parquetfile/avrofile

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-15756: Assignee: Apache Spark > Support create table stored as orcfile/parquetfile/avrofile >

[jira] [Commented] (SPARK-15756) Support create table stored as orcfile/parquetfile/avrofile

2016-06-03 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314526#comment-15314526 ] Apache Spark commented on SPARK-15756: -- User 'lianhuiwang' has created a pull request for this

[jira] [Created] (SPARK-15759) Fallback to non-codegen if fail to compile generated code

2016-06-03 Thread Davies Liu (JIRA)
Davies Liu created SPARK-15759: -- Summary: Fallback to non-codegen if fail to compile generated code Key: SPARK-15759 URL: https://issues.apache.org/jira/browse/SPARK-15759 Project: Spark Issue

[jira] [Updated] (SPARK-15756) Support create table stored as orcfile/parquetfile/avrofile

2016-06-03 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-15756: - Description: in

[jira] [Updated] (SPARK-15756) Support create table stored as orcfile/parquetfile/avrofile

2016-06-03 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-15756: - Description: in

[jira] [Updated] (SPARK-15756) Support create table stored as orcfile/parquetfile/avrofile

2016-06-03 Thread Lianhui Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lianhui Wang updated SPARK-15756: - Summary: Support create table stored as orcfile/parquetfile/avrofile (was: SQL “stored as

[jira] [Commented] (SPARK-14811) ML, Graph 2.0 QA: API: New Scala APIs, docs

2016-06-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314489#comment-15314489 ] Nick Pentreath commented on SPARK-14811: Yes, that does make sense. I will take a pass through

[jira] [Issue Comment Deleted] (SPARK-15722) Wrong data when CTAS specifies schema

2016-06-03 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-15722: -- Comment: was deleted (was: User 'andrewor14' has created a pull request for this issue:

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2016-06-03 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314470#comment-15314470 ] ASF GitHub Bot commented on SPARK-2243: --- Github user bbuild11 commented on the issue:

[jira] [Comment Edited] (SPARK-15447) Performance test for ALS in Spark 2.0

2016-06-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314441#comment-15314441 ] Nick Pentreath edited comment on SPARK-15447 at 6/3/16 5:22 PM: Added a

[jira] [Commented] (SPARK-15447) Performance test for ALS in Spark 2.0

2016-06-03 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314441#comment-15314441 ] Nick Pentreath commented on SPARK-15447: Added a second tab to the sheet for testing DF-based API

[jira] [Commented] (SPARK-15545) R remove non-exported unused methods, like jsonRDD

2016-06-03 Thread Miao Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314416#comment-15314416 ] Miao Wang commented on SPARK-15545: --- I did a search on all R files with setMethods and compare with

[jira] [Resolved] (SPARK-15737) Fix Jetty server start warning

2016-06-03 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-15737. Resolution: Fixed Assignee: Bo Meng Fix Version/s: 2.0.0 > Fix Jetty

[jira] [Resolved] (SPARK-15714) Fix Flaky Test: o.a.s.scheduler.BlacklistIntegrationSuite

2016-06-03 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid resolved SPARK-15714. -- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 13454

[jira] [Commented] (SPARK-15757) Error occurs when using Spark sql "select" statement on orc file after hive sql "insert overwrite tb1 select * from sourcTb" has been executed on this orc file

2016-06-03 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314365#comment-15314365 ] Sean Owen commented on SPARK-15757: --- This doesn't look like a Spark problem, or at least, right now it

[jira] [Commented] (SPARK-15731) orc writer directory permissions

2016-06-03 Thread kevin yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314364#comment-15314364 ] kevin yu commented on SPARK-15731: -- Hi Ran: I tried on my machine for orc file, the partition

[jira] [Resolved] (SPARK-15710) Exception with WHERE clause in SQL for non-default Hive database

2016-06-03 Thread Igor Fridman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Igor Fridman resolved SPARK-15710. -- Resolution: Resolved Latest rebase of the master resolved the problem > Exception with WHERE

[jira] [Commented] (SPARK-15710) Exception with WHERE clause in SQL for non-default Hive database

2016-06-03 Thread Igor Fridman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314357#comment-15314357 ] Igor Fridman commented on SPARK-15710: -- It seems to work fine now for pyspark. Marking resolved. >

[jira] [Issue Comment Deleted] (SPARK-14381) Review spark.ml parity for feature transformers

2016-06-03 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gayathri Murali updated SPARK-14381: Comment: was deleted (was: I will work on this) > Review spark.ml parity for feature

[jira] [Issue Comment Deleted] (SPARK-15041) adding mode strategy for ml.feature.Imputer for categorical features

2016-06-03 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gayathri Murali updated SPARK-15041: Comment: was deleted (was: I can work on this) > adding mode strategy for

[jira] [Issue Comment Deleted] (SPARK-15201) Handle integer overflow correctly in hash code computation

2016-06-03 Thread Gayathri Murali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gayathri Murali updated SPARK-15201: Comment: was deleted (was: I can work on this ) > Handle integer overflow correctly in

[jira] [Commented] (SPARK-14380) Review spark.ml parity for clustering

2016-06-03 Thread Xinh Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314345#comment-15314345 ] Xinh Huynh commented on SPARK-14380: Completely missing algorithms * Power Iteration Clustering

[jira] [Commented] (SPARK-14380) Review spark.ml parity for clustering

2016-06-03 Thread Xinh Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15314288#comment-15314288 ] Xinh Huynh commented on SPARK-14380: I'll take a stab at this. > Review spark.ml parity for

  1   2   >