[jira] [Commented] (SPARK-7230) Make RDD API private in SparkR for Spark 1.4

2015-05-07 Thread Sun Rui (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532101#comment-14532101 ] Sun Rui commented on SPARK-7230: One question here is there are still some basic RDD API

[jira] [Assigned] (SPARK-7431) PySpark CrossValidatorModel needs to call parent init

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7431: --- Assignee: Apache Spark PySpark CrossValidatorModel needs to call parent init

[jira] [Updated] (SPARK-7431) PySpark CrossValidatorModel needs to call parent init

2015-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7431: - Priority: Major (was: Critical) PySpark CrossValidatorModel needs to call parent init

[jira] [Assigned] (SPARK-7431) PySpark CrossValidatorModel needs to call parent init

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7431: --- Assignee: (was: Apache Spark) PySpark CrossValidatorModel needs to call parent init

[jira] [Commented] (SPARK-7431) PySpark CrossValidatorModel needs to call parent init

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532114#comment-14532114 ] Apache Spark commented on SPARK-7431: - User 'jkbradley' has created a pull request for

[jira] [Commented] (SPARK-7393) How to improve Spark SQL performance?

2015-05-07 Thread Liang Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532201#comment-14532201 ] Liang Lee commented on SPARK-7393: -- Dear Dennis, Under a 1-node standalone spark cluster

[jira] [Updated] (SPARK-7437) Fold literal in (item1, item2, ..., literal, ...) into true or false directly if all element of list is Literal

2015-05-07 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7437: -- Summary: Fold literal in (item1, item2, ..., literal, ...) into true or false directly if all

[jira] [Updated] (SPARK-7437) Fold literal in (item1, item2, ..., literal, ...) into true or false directly if element of list is Literal

2015-05-07 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7437: -- Summary: Fold literal in (item1, item2, ..., literal, ...) into true or false directly if

[jira] [Resolved] (SPARK-7421) Online LDA cleanups

2015-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7421. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5956

[jira] [Assigned] (SPARK-7437) Fold literal in (item1, item2, ..., literal, ...) into false directly if not in.

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7437: --- Assignee: (was: Apache Spark) Fold literal in (item1, item2, ..., literal, ...) into

[jira] [Commented] (SPARK-7035) Drop __getattr__ on pyspark.sql.DataFrame

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532148#comment-14532148 ] Apache Spark commented on SPARK-7035: - User 'ksonj' has created a pull request for

[jira] [Commented] (SPARK-7437) Fold literal in (item1, item2, ..., literal, ...) into false directly if not in.

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532147#comment-14532147 ] Apache Spark commented on SPARK-7437: - User 'DoingDone9' has created a pull request

[jira] [Assigned] (SPARK-7437) Fold literal in (item1, item2, ..., literal, ...) into false directly if not in.

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7437: --- Assignee: Apache Spark Fold literal in (item1, item2, ..., literal, ...) into false

[jira] [Resolved] (SPARK-7430) General improvements to streaming tests to increase debuggability

2015-05-07 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-7430. -- Resolution: Fixed Fix Version/s: 1.4.0 General improvements to streaming tests to

[jira] [Updated] (SPARK-7437) Fold literal in (item1, item2, ..., literal, ...) into false directly if not in.

2015-05-07 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7437: -- Description: Just Fold literal in (item1, item2, ..., literal, ...) into true directly if in

[jira] [Assigned] (SPARK-7436) Cannot implement nor use custom StandaloneRecoveryModeFactory implementations

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7436: --- Assignee: Apache Spark Cannot implement nor use custom StandaloneRecoveryModeFactory

[jira] [Created] (SPARK-7436) Cannot implement nor use custom StandaloneRecoveryModeFactory implementations

2015-05-07 Thread Jacek Lewandowski (JIRA)
Jacek Lewandowski created SPARK-7436: Summary: Cannot implement nor use custom StandaloneRecoveryModeFactory implementations Key: SPARK-7436 URL: https://issues.apache.org/jira/browse/SPARK-7436

[jira] [Created] (SPARK-7437) Fold literal in (item1, item2, ..., literal, ...) into false directly if not in.

2015-05-07 Thread Zhongshuai Pei (JIRA)
Zhongshuai Pei created SPARK-7437: - Summary: Fold literal in (item1, item2, ..., literal, ...) into false directly if not in. Key: SPARK-7437 URL: https://issues.apache.org/jira/browse/SPARK-7437

[jira] [Resolved] (SPARK-7429) Cleanups: Params.setDefault varargs, CrossValidatorModel transformSchema

2015-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-7429. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5960

[jira] [Updated] (SPARK-7075) Project Tungsten: Improving Physical Execution and Memory Management

2015-05-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7075: --- Description: Based on our observation, majority of Spark workloads are not bottlenecked by I/O or

[jira] [Updated] (SPARK-7075) Project Tungsten: Improving Physical Execution and Memory Management

2015-05-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7075: --- Description: Based on our observation, majority of Spark workloads are not bottlenecked by I/O or

[jira] [Commented] (SPARK-7183) Memory leak in netty shuffle with spark standalone cluster

2015-05-07 Thread Jack Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532108#comment-14532108 ] Jack Hu commented on SPARK-7183: Hi, [~sowen] Do we plan to add this to 1.3+? If there is

[jira] [Updated] (SPARK-7431) PySpark CrossValidatorModel needs to call parent init

2015-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7431: - Summary: PySpark CrossValidatorModel needs to call parent init (was: cvModel does not

[jira] [Updated] (SPARK-7436) Cannot implement nor use custom StandaloneRecoveryModeFactory implementations

2015-05-07 Thread Jacek Lewandowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Lewandowski updated SPARK-7436: - Description: At least, this code fragment is buggy ({{Master.scala}}): {code} case

[jira] [Updated] (SPARK-7437) Fold literal in (item1, item2, ..., literal, ...) into false directly if not in.

2015-05-07 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7437: -- Description: just Fold literal in (item1, item2, ..., literal, ...) into true directly if in

[jira] [Commented] (SPARK-7035) Drop __getattr__ on pyspark.sql.DataFrame

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532141#comment-14532141 ] Apache Spark commented on SPARK-7035: - User 'ksonj' has created a pull request for

[jira] [Assigned] (SPARK-7438) Validation Error while running countApproxDistinct with relative accuracy = 0.38

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7438: --- Assignee: (was: Apache Spark) Validation Error while running countApproxDistinct with

[jira] [Commented] (SPARK-7438) Validation Error while running countApproxDistinct with relative accuracy = 0.38

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532161#comment-14532161 ] Apache Spark commented on SPARK-7438: - User 'vinodkc' has created a pull request for

[jira] [Assigned] (SPARK-7438) Validation Error while running countApproxDistinct with relative accuracy = 0.38

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7438: --- Assignee: Apache Spark Validation Error while running countApproxDistinct with relative

[jira] [Resolved] (SPARK-7433) How to pass the parameters to spark SQL backend and set its value to the environment variable through the simba ODBC driver.

2015-05-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-7433. -- Resolution: Invalid How to pass the parameters to spark SQL backend and set its value to the

[jira] [Updated] (SPARK-7440) Binary processing for SQL Distinct operator

2015-05-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7440: --- Description: We can either just rewrite distinct using groupby (i.e. aggregate operator), or rewrite

[jira] [Updated] (SPARK-7075) Project Tungsten: Improving Physical Execution and Memory Management

2015-05-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7075: --- Description: Based on our observation, majority of Spark workloads are not bottlenecked by I/O or

[jira] [Assigned] (SPARK-7262) Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7262: --- Assignee: (was: Apache Spark) Binary LogisticRegression with L1/L2 (elastic net) using

[jira] [Commented] (SPARK-7262) Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532088#comment-14532088 ] Apache Spark commented on SPARK-7262: - User 'dbtsai' has created a pull request for

[jira] [Assigned] (SPARK-7262) Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in new ML package

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7262: --- Assignee: Apache Spark Binary LogisticRegression with L1/L2 (elastic net) using OWLQN in

[jira] [Commented] (SPARK-7230) Make RDD API private in SparkR for Spark 1.4

2015-05-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532106#comment-14532106 ] Reynold Xin commented on SPARK-7230: We should hide them for now. As a matter of

[jira] [Created] (SPARK-7439) Should delete temporary local directories

2015-05-07 Thread Taeyun Kim (JIRA)
Taeyun Kim created SPARK-7439: - Summary: Should delete temporary local directories Key: SPARK-7439 URL: https://issues.apache.org/jira/browse/SPARK-7439 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-7431) PySpark CrossValidatorModel needs to call parent init

2015-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-7431: Assignee: Joseph K. Bradley PySpark CrossValidatorModel needs to call parent init

[jira] [Updated] (SPARK-7438) Validation Error while running countApproxDistinct with relative accuracy = 0.38

2015-05-07 Thread Vinod KC (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinod KC updated SPARK-7438: Description: Eg Code: val a = sc.parallelize(1 to 1, 20) val b = a ++ a ++ a ++ a ++ a

[jira] [Commented] (SPARK-3904) HQL doesn't support the ConstantObjectInspector to pass into UDFs

2015-05-07 Thread baishuo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532192#comment-14532192 ] baishuo commented on SPARK-3904: please referrence

[jira] [Resolved] (SPARK-7295) Add bitwise operations to DataFrame DSL

2015-05-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7295. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Shiti Saxena Add bitwise

[jira] [Assigned] (SPARK-7436) Cannot implement nor use custom StandaloneRecoveryModeFactory implementations

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7436: --- Assignee: (was: Apache Spark) Cannot implement nor use custom

[jira] [Commented] (SPARK-7275) Make LogicalRelation public

2015-05-07 Thread Santiago M. Mola (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532159#comment-14532159 ] Santiago M. Mola commented on SPARK-7275: - [~gweidner] I work on a project that

[jira] [Created] (SPARK-7438) Validation Error while running countApproxDistinct with relative accuracy = 0.38

2015-05-07 Thread Vinod KC (JIRA)
Vinod KC created SPARK-7438: --- Summary: Validation Error while running countApproxDistinct with relative accuracy = 0.38 Key: SPARK-7438 URL: https://issues.apache.org/jira/browse/SPARK-7438 Project:

[jira] [Created] (SPARK-7435) Make DataFrame.show() cosistent with that of Scala and pySpark

2015-05-07 Thread Sun Rui (JIRA)
Sun Rui created SPARK-7435: -- Summary: Make DataFrame.show() cosistent with that of Scala and pySpark Key: SPARK-7435 URL: https://issues.apache.org/jira/browse/SPARK-7435 Project: Spark Issue

[jira] [Commented] (SPARK-7116) Intermediate RDD cached but never unpersisted

2015-05-07 Thread Kalle Jepsen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532146#comment-14532146 ] Kalle Jepsen commented on SPARK-7116: - Sure, thanks Intermediate RDD cached but

[jira] [Assigned] (SPARK-7116) Intermediate RDD cached but never unpersisted

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7116: --- Assignee: Apache Spark Intermediate RDD cached but never unpersisted

[jira] [Commented] (SPARK-7116) Intermediate RDD cached but never unpersisted

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14532160#comment-14532160 ] Apache Spark commented on SPARK-7116: - User 'ksonj' has created a pull request for

[jira] [Assigned] (SPARK-7116) Intermediate RDD cached but never unpersisted

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7116: --- Assignee: (was: Apache Spark) Intermediate RDD cached but never unpersisted

[jira] [Updated] (SPARK-7437) Fold literal in (item1, item2, ..., literal, ...) into true or false directly if all element of list is Literal

2015-05-07 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7437: -- Description: (was: Just Fold literal in (item1, item2, ..., literal, ...) into true

[jira] [Updated] (SPARK-7437) Fold literal in (item1, item2, ..., literal, ...) into true or false directly if all elements of list is Literal

2015-05-07 Thread Zhongshuai Pei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhongshuai Pei updated SPARK-7437: -- Summary: Fold literal in (item1, item2, ..., literal, ...) into true or false directly if all

[jira] [Assigned] (SPARK-7199) Add date and timestamp support to UnsafeRow

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7199: --- Assignee: (was: Apache Spark) Add date and timestamp support to UnsafeRow

[jira] [Commented] (SPARK-7199) Add date and timestamp support to UnsafeRow

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533068#comment-14533068 ] Apache Spark commented on SPARK-7199: - User 'viirya' has created a pull request for

[jira] [Updated] (SPARK-7443) MLlib 1.4 QA plan

2015-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7443: - Description: TODO: create JIRAs for each task and assign them accordingly. h2. API * Check API

[jira] [Updated] (SPARK-7443) MLlib 1.4 QA plan

2015-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7443: - Description: TODO: create JIRAs for each task and assign them accordingly. h2. API * Check API

[jira] [Updated] (SPARK-7443) MLlib 1.4 QA plan

2015-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7443: - Description: TODO: create JIRAs for each task and assign them accordingly. h2. API *

[jira] [Commented] (SPARK-7427) Make sharedParams match in Scala, Python

2015-05-07 Thread Glenn Weidner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533239#comment-14533239 ] Glenn Weidner commented on SPARK-7427: -- Also checking

[jira] [Commented] (SPARK-5754) Spark AM not launching on Windows

2015-05-07 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533315#comment-14533315 ] Steve Loughran commented on SPARK-5754: --- I had a look at what we did for slider via

[jira] [Updated] (SPARK-6093) Add RegressionMetrics in PySpark/MLlib

2015-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6093: - Assignee: Yanbo Liang Add RegressionMetrics in PySpark/MLlib

[jira] [Commented] (SPARK-3928) Support wildcard matches on Parquet files

2015-05-07 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533244#comment-14533244 ] Michael Armbrust commented on SPARK-3928: - Sorry for the confusion here. There

[jira] [Created] (SPARK-7446) Inverse transform for StringIndexer

2015-05-07 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7446: Summary: Inverse transform for StringIndexer Key: SPARK-7446 URL: https://issues.apache.org/jira/browse/SPARK-7446 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-3928) Support wildcard matches on Parquet files

2015-05-07 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-3928: Assignee: Cheng Lian Support wildcard matches on Parquet files

[jira] [Commented] (SPARK-3928) Support wildcard matches on Parquet files

2015-05-07 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533277#comment-14533277 ] Nicholas Chammas commented on SPARK-3928: - {quote} Comma separated lists: were

[jira] [Commented] (SPARK-7449) createPhysicalRDD should use RDD output as schema instead of relation.schema

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533298#comment-14533298 ] Apache Spark commented on SPARK-7449: - User 'zhzhan' has created a pull request for

[jira] [Resolved] (SPARK-7118) Add coalesce Spark SQL function to PySpark API

2015-05-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7118. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Olivier Girardot Add coalesce

[jira] [Updated] (SPARK-7118) Add coalesce Spark SQL function to PySpark API

2015-05-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-7118: --- Issue Type: Sub-task (was: Improvement) Parent: SPARK-6116 Add coalesce Spark SQL function

[jira] [Updated] (SPARK-6411) PySpark DataFrames can't be created if any datetimes have timezones

2015-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6411: - Assignee: (was: Xiangrui Meng) PySpark DataFrames can't be created if any datetimes have

[jira] [Updated] (SPARK-6411) PySpark DataFrames can't be created if any datetimes have timezones

2015-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-6411: - Assignee: Davies Liu PySpark DataFrames can't be created if any datetimes have timezones

[jira] [Updated] (SPARK-7435) Make DataFrame.show() consistent with that of Scala and pySpark

2015-05-07 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-7435: - Target Version/s: 1.4.0 Make DataFrame.show() consistent with that of Scala and

[jira] [Commented] (SPARK-7427) Make sharedParams match in Scala, Python

2015-05-07 Thread Glenn Weidner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533207#comment-14533207 ] Glenn Weidner commented on SPARK-7427: -- I would like to start working on this since I

[jira] [Assigned] (SPARK-7199) Add date and timestamp support to UnsafeRow

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7199: --- Assignee: Apache Spark Add date and timestamp support to UnsafeRow

[jira] [Updated] (SPARK-7443) MLlib 1.4 QA plan

2015-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7443: - Description: TODO: create JIRAs for each task and assign them accordingly. h2. API * Check API

[jira] [Updated] (SPARK-7443) MLlib 1.4 QA plan

2015-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7443: - Assignee: Joseph K. Bradley (was: Xiangrui Meng) MLlib 1.4 QA plan -

[jira] [Resolved] (SPARK-7116) Intermediate RDD cached but never unpersisted

2015-05-07 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7116. - Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5973

[jira] [Commented] (SPARK-6948) VectorAssembler should choose dense/sparse for output based on number of zeros

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533226#comment-14533226 ] Apache Spark commented on SPARK-6948: - User 'mengxr' has created a pull request for

[jira] [Updated] (SPARK-7378) HistoryServer does not handle deep link when lazy loading app

2015-05-07 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7378: - Assignee: Marcelo Vanzin HistoryServer does not handle deep link when lazy loading app

[jira] [Updated] (SPARK-7378) HistoryServer does not handle deep link when lazy loading app

2015-05-07 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7378: - Priority: Blocker (was: Major) HistoryServer does not handle deep link when lazy loading app

[jira] [Commented] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-05-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533097#comment-14533097 ] Reynold Xin commented on SPARK-6980: I will let [~zsxwing] chime in. Akka timeout

[jira] [Commented] (SPARK-2496) Compression streams should write its codec info to the stream

2015-05-07 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533084#comment-14533084 ] Josh Rosen commented on SPARK-2496: --- One potential concern here is the ability to

[jira] [Updated] (SPARK-7378) HistoryServer does not handle deep link when lazy loading app

2015-05-07 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-7378: - Priority: Major (was: Minor) HistoryServer does not handle deep link when lazy loading app

[jira] [Created] (SPARK-7447) Large Job submission lag when using Parquet w/ Schema Merging

2015-05-07 Thread Brad Willard (JIRA)
Brad Willard created SPARK-7447: --- Summary: Large Job submission lag when using Parquet w/ Schema Merging Key: SPARK-7447 URL: https://issues.apache.org/jira/browse/SPARK-7447 Project: Spark

[jira] [Created] (SPARK-7448) Implement custom bye array serializer for use in PySpark shuffle

2015-05-07 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-7448: - Summary: Implement custom bye array serializer for use in PySpark shuffle Key: SPARK-7448 URL: https://issues.apache.org/jira/browse/SPARK-7448 Project: Spark

[jira] [Resolved] (SPARK-6093) Add RegressionMetrics in PySpark/MLlib

2015-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-6093. -- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5941

[jira] [Updated] (SPARK-7404) Add RegressionEvaluator to spark.ml

2015-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7404?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7404: - Assignee: (was: Xiangrui Meng) Add RegressionEvaluator to spark.ml

[jira] [Created] (SPARK-7445) StringIndexer should handle binary labels properly

2015-05-07 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7445: Summary: StringIndexer should handle binary labels properly Key: SPARK-7445 URL: https://issues.apache.org/jira/browse/SPARK-7445 Project: Spark Issue Type:

[jira] [Commented] (SPARK-7427) Make sharedParams match in Scala, Python

2015-05-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533229#comment-14533229 ] Joseph K. Bradley commented on SPARK-7427: -- Please go ahead! (This one is

[jira] [Commented] (SPARK-7378) HistoryServer does not handle deep link when lazy loading app

2015-05-07 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533273#comment-14533273 ] Andrew Or commented on SPARK-7378: -- Bumping to blocker because it's a regression from

[jira] [Commented] (SPARK-5034) Spark on Yarn launch failure on HDInsight on Windows

2015-05-07 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533349#comment-14533349 ] Steve Loughran commented on SPARK-5034: --- looks related to SPARK-5754 ; linking

[jira] [Updated] (SPARK-7447) Large Job submission lag when using Parquet w/ Schema Merging

2015-05-07 Thread Brad Willard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brad Willard updated SPARK-7447: Environment: Spark 1.3.1, aws, persistent hdfs version 2 with ebs storage, pyspark, 8 x c3.8xlarge

[jira] [Comment Edited] (SPARK-7435) Make DataFrame.show() consistent with that of Scala and pySpark

2015-05-07 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533478#comment-14533478 ] Reynold Xin edited comment on SPARK-7435 at 5/7/15 10:52 PM: -

[jira] [Created] (SPARK-7455) Perf test for LDA (EM/online)

2015-05-07 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7455: Summary: Perf test for LDA (EM/online) Key: SPARK-7455 URL: https://issues.apache.org/jira/browse/SPARK-7455 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-7443) MLlib 1.4 QA plan

2015-05-07 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-7443: - Description: TODO: create JIRAs for each task and assign them accordingly. h2. API * Check API

[jira] [Created] (SPARK-7457) Perf test for ALS.recommendAll

2015-05-07 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7457: Summary: Perf test for ALS.recommendAll Key: SPARK-7457 URL: https://issues.apache.org/jira/browse/SPARK-7457 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-7456) Perf test for linear regression with elastic-net

2015-05-07 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7456: Summary: Perf test for linear regression with elastic-net Key: SPARK-7456 URL: https://issues.apache.org/jira/browse/SPARK-7456 Project: Spark Issue Type:

[jira] [Commented] (SPARK-2356) Exception: Could not locate executable null\bin\winutils.exe in the Hadoop

2015-05-07 Thread zhengbing li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533596#comment-14533596 ] zhengbing li commented on SPARK-2356: - wintuils.exe from

[jira] [Assigned] (SPARK-7137) Add checkInputColumn back to Params and print more info

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7137: --- Assignee: (was: Apache Spark) Add checkInputColumn back to Params and print more info

[jira] [Assigned] (SPARK-7137) Add checkInputColumn back to Params and print more info

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7137: --- Assignee: Apache Spark Add checkInputColumn back to Params and print more info

[jira] [Commented] (SPARK-7137) Add checkInputColumn back to Params and print more info

2015-05-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14533605#comment-14533605 ] Apache Spark commented on SPARK-7137: - User 'rekhajoshm' has created a pull request

[jira] [Resolved] (SPARK-7450) Use UNSAFE.getLong() to speed up BitSetMethods#anySet()

2015-05-07 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen resolved SPARK-7450. --- Resolution: Fixed Fix Version/s: 1.4.0 Issue resolved by pull request 5897

[jira] [Updated] (SPARK-7450) Use UNSAFE.getLong() to speed up BitSetMethods#anySet()

2015-05-07 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7450: -- Priority: Minor (was: Major) Use UNSAFE.getLong() to speed up BitSetMethods#anySet()

  1   2   3   >