[jira] [Resolved] (SPARK-8698) partitionBy in Python DataFrame reader/writer interface should not default to empty tuple

2015-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-8698. Resolution: Fixed Fix Version/s: 1.5.0 partitionBy in Python DataFrame reader/writer

[jira] [Commented] (SPARK-8337) KafkaUtils.createDirectStream for python is lacking API/feature parity with the Scala/Java version

2015-06-29 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605241#comment-14605241 ] Saisai Shao commented on SPARK-8337: Hi [~juanrh], I think the best choice is to keep

[jira] [Created] (SPARK-8700) Disable feature scaling in Logistic Regression

2015-06-29 Thread DB Tsai (JIRA)
DB Tsai created SPARK-8700: -- Summary: Disable feature scaling in Logistic Regression Key: SPARK-8700 URL: https://issues.apache.org/jira/browse/SPARK-8700 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-8700) Disable feature scaling in Logistic Regression

2015-06-29 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai reassigned SPARK-8700: -- Assignee: DB Tsai Disable feature scaling in Logistic Regression

[jira] [Commented] (SPARK-8701) Add input metadata to InputInfo and display it in the batch page

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605265#comment-14605265 ] Apache Spark commented on SPARK-8701: - User 'zsxwing' has created a pull request for

[jira] [Commented] (SPARK-8559) Support association rule generation in FPGrowth

2015-06-29 Thread Guangwen Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605332#comment-14605332 ] Guangwen Liu commented on SPARK-8559: - Thanks, Feyman. That is exactly what i mean and

[jira] [Updated] (SPARK-8668) expr function to convert SQL expression into a Column

2015-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin updated SPARK-8668: --- Description: selectExpr uses the expression parser to parse a string expressions. would be great to

[jira] [Created] (SPARK-8701) Add input metadata to InputInfo and display it in the batch page

2015-06-29 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-8701: --- Summary: Add input metadata to InputInfo and display it in the batch page Key: SPARK-8701 URL: https://issues.apache.org/jira/browse/SPARK-8701 Project: Spark

[jira] [Commented] (SPARK-8661) Update comments that contain R statements in ml.LinearRegressionSuite

2015-06-29 Thread somil deshmukh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605256#comment-14605256 ] somil deshmukh commented on SPARK-8661: --- I would Like to solve this issue Update

[jira] [Commented] (SPARK-8660) Update comments that contain R statements in ml.logisticRegressionSuite

2015-06-29 Thread somil deshmukh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605257#comment-14605257 ] somil deshmukh commented on SPARK-8660: --- I would like to solve this issue Update

[jira] [Commented] (SPARK-8592) CoarseGrainedExecutorBackend: Cannot register with driver = NPE

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605177#comment-14605177 ] Apache Spark commented on SPARK-8592: - User 'xuchenCN' has created a pull request for

[jira] [Assigned] (SPARK-8592) CoarseGrainedExecutorBackend: Cannot register with driver = NPE

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8592: --- Assignee: (was: Apache Spark) CoarseGrainedExecutorBackend: Cannot register with driver

[jira] [Assigned] (SPARK-8592) CoarseGrainedExecutorBackend: Cannot register with driver = NPE

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8592: --- Assignee: Apache Spark CoarseGrainedExecutorBackend: Cannot register with driver = NPE

[jira] [Updated] (SPARK-8426) Add blacklist mechanism for YARN container allocation

2015-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8426: - Priority: Minor (was: Major) Add blacklist mechanism for YARN container allocation

[jira] [Updated] (SPARK-8425) Add blacklist mechanism for task scheduling

2015-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8425: - Priority: Minor (was: Major) Add blacklist mechanism for task scheduling

[jira] [Updated] (SPARK-8426) Add blacklist mechanism for YARN container allocation

2015-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8426: - Issue Type: Improvement (was: Sub-task) Parent: (was: SPARK-8424) Add blacklist mechanism

[jira] [Updated] (SPARK-8425) Add blacklist mechanism for task scheduling

2015-06-29 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8425?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-8425: - Issue Type: Improvement (was: Sub-task) Parent: (was: SPARK-8424) Add blacklist mechanism

[jira] [Updated] (SPARK-5562) LDA should handle empty documents

2015-06-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5562: - Shepherd: Joseph K. Bradley LDA should handle empty documents -

[jira] [Assigned] (SPARK-8355) Python DataFrameReader/Writer should mirror scala

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8355: --- Assignee: (was: Apache Spark) Python DataFrameReader/Writer should mirror scala

[jira] [Commented] (SPARK-8355) Python DataFrameReader/Writer should mirror scala

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605185#comment-14605185 ] Apache Spark commented on SPARK-8355: - User 'piaozhexiu' has created a pull request

[jira] [Assigned] (SPARK-8355) Python DataFrameReader/Writer should mirror scala

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8355: --- Assignee: Apache Spark Python DataFrameReader/Writer should mirror scala

[jira] [Commented] (SPARK-8700) Disable feature scaling in Logistic Regression

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605251#comment-14605251 ] Apache Spark commented on SPARK-8700: - User 'dbtsai' has created a pull request for

[jira] [Assigned] (SPARK-8700) Disable feature scaling in Logistic Regression

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8700: --- Assignee: Apache Spark (was: DB Tsai) Disable feature scaling in Logistic Regression

[jira] [Commented] (SPARK-8621) crosstab exception when one of the value is empty

2015-06-29 Thread Animesh Baranawal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605273#comment-14605273 ] Animesh Baranawal commented on SPARK-8621: -- cc [~marmbrus] [~rxin] I think

[jira] [Commented] (SPARK-8716) Remove executor shared cache feature

2015-06-29 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606627#comment-14606627 ] Marcelo Vanzin commented on SPARK-8716: --- bq. (1) It doesn't even work. Recently,

[jira] [Commented] (SPARK-8716) Remove executor shared cache feature

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606657#comment-14606657 ] Andrew Or commented on SPARK-8716: -- [~vanzin] doesn't each executor container get its own

[jira] [Resolved] (SPARK-8589) cleanup DateTimeUtils

2015-06-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-8589. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6980

[jira] [Updated] (SPARK-8717) Update mllib-data-types docs to include missing matrix Python examples

2015-06-29 Thread Rosstin Murphy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rosstin Murphy updated SPARK-8717: -- Description: Currently, the documentation for MLLib Data Types (docs/mllib-data-types.md in

[jira] [Updated] (SPARK-8716) Write tests for executor shared cache feature

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8716: - Description: More specifically, this is the feature that is currently flagged by

[jira] [Closed] (SPARK-8019) [SparkR] Create worker R processes with a command other then Rscript

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-8019. Resolution: Fixed Fix Version/s: 1.5.0 Target Version/s: 1.5.0 [SparkR] Create worker R

[jira] [Resolved] (SPARK-8715) ArrayOutOfBoundsException for DataFrameStatSuite.crosstab

2015-06-29 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-8715. - Resolution: Fixed Fix Version/s: 1.5.0 1.4.2 Issue resolved by pull request

[jira] [Updated] (SPARK-8715) ArrayOutOfBoundsException for DataFrameStatSuite.crosstab

2015-06-29 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-8715: Assignee: Burak Yavuz ArrayOutOfBoundsException for DataFrameStatSuite.crosstab

[jira] [Resolved] (SPARK-8669) Parquet 1.7 files that store binary enums crash when inferring schema

2015-06-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-8669. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 7048

[jira] [Resolved] (SPARK-7667) MLlib Python API consistency check

2015-06-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-7667. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6856

[jira] [Updated] (SPARK-7674) R-like stats for ML models

2015-06-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7674: - Shepherd: Joseph K. Bradley R-like stats for ML models --

[jira] [Assigned] (SPARK-8721) Rename ExpectsInputTypes = AutoCastInputTypes

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8721: --- Assignee: Apache Spark (was: Reynold Xin) Rename ExpectsInputTypes = AutoCastInputTypes

[jira] [Assigned] (SPARK-8721) Rename ExpectsInputTypes = AutoCastInputTypes

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8721: --- Assignee: Reynold Xin (was: Apache Spark) Rename ExpectsInputTypes = AutoCastInputTypes

[jira] [Commented] (SPARK-8721) Rename ExpectsInputTypes = AutoCastInputTypes

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606901#comment-14606901 ] Apache Spark commented on SPARK-8721: - User 'rxin' has created a pull request for this

[jira] [Resolved] (SPARK-8661) Update comments that contain R statements in ml.LinearRegressionSuite

2015-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-8661. Resolution: Fixed Assignee: somil deshmukh Fix Version/s: 1.5.0 Update comments

[jira] [Created] (SPARK-8716) Remove executor shared cache feature

2015-06-29 Thread Andrew Or (JIRA)
Andrew Or created SPARK-8716: Summary: Remove executor shared cache feature Key: SPARK-8716 URL: https://issues.apache.org/jira/browse/SPARK-8716 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-8716) Remove executor shared cache feature

2015-06-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606646#comment-14606646 ] Josh Rosen commented on SPARK-8716: --- Looking a bit more closely, I guess they might be

[jira] [Updated] (SPARK-8407) complex type constructors: struct and named_struct

2015-06-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8407: Assignee: Yijie Shen complex type constructors: struct and named_struct

[jira] [Resolved] (SPARK-8710) ScalaReflection.mirror should be a def

2015-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-8710. Resolution: Fixed Fix Version/s: 1.4.2 1.5.0 ScalaReflection.mirror

[jira] [Assigned] (SPARK-8588) Could not use concat with UDF in where clause

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8588: --- Assignee: Wenchen Fan (was: Apache Spark) Could not use concat with UDF in where clause

[jira] [Assigned] (SPARK-8588) Could not use concat with UDF in where clause

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8588: --- Assignee: Apache Spark (was: Wenchen Fan) Could not use concat with UDF in where clause

[jira] [Updated] (SPARK-8717) Update mllib-data-types docs to include missing matrix Python examples

2015-06-29 Thread Rosstin Murphy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rosstin Murphy updated SPARK-8717: -- Description: Currently, the documentation for MLLib Data Types (docs/mllib-data-types.md in

[jira] [Updated] (SPARK-8717) Update mllib-data-types docs to include missing matrix Python examples

2015-06-29 Thread Rosstin Murphy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rosstin Murphy updated SPARK-8717: -- Description: Currently, the documentation for MLLib Data Types (docs/mllib-data-types.md in

[jira] [Closed] (SPARK-8130) spark.files.useFetchCache should be off by default

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-8130. Resolution: Won't Fix See discussion on https://github.com/apache/spark/pull/7051

[jira] [Updated] (SPARK-8657) Fail to upload conf archive to viewfs

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8657: - Target Version/s: 1.5.0, 1.4.2 (was: 1.4.2) Fail to upload conf archive to viewfs

[jira] [Updated] (SPARK-8119) Spark will set total executor when some executors fail.

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8119: - Description: DynamicAllocation will set the total executor to a little number when it wants to kill some

[jira] [Commented] (SPARK-8450) PySpark write.parquet raises Unsupported datatype DecimalType()

2015-06-29 Thread Yuri Saito (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606812#comment-14606812 ] Yuri Saito commented on SPARK-8450: --- When {{createDataFrame}} is called(via *PySpark*),

[jira] [Updated] (SPARK-8119) HeartbeatReceiver should not adjust application executor resources

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8119: - Component/s: (was: Scheduler) HeartbeatReceiver should not adjust application executor resources

[jira] [Reopened] (SPARK-8031) Version number written to Hive metastore is 0.13.1aa instead of 0.13.1a

2015-06-29 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi reopened SPARK-8031: Version number written to Hive metastore is 0.13.1aa instead of 0.13.1a

[jira] [Closed] (SPARK-8031) Version number written to Hive metastore is 0.13.1aa instead of 0.13.1a

2015-06-29 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi closed SPARK-8031. -- Version number written to Hive metastore is 0.13.1aa instead of 0.13.1a

[jira] [Commented] (SPARK-8653) Add constraint for Children expression for data type

2015-06-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606904#comment-14606904 ] Reynold Xin commented on SPARK-8653: I think a better way to do this is to have a

[jira] [Updated] (SPARK-8657) Fail to upload conf archive to viewfs

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8657: - Affects Version/s: (was: 1.4.2) (was: 1.4.1) Fail to upload conf archive

[jira] [Updated] (SPARK-8657) Fail to upload conf archive to viewfs

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8657: - Fix Version/s: (was: 1.4.2) Fail to upload conf archive to viewfs

[jira] [Updated] (SPARK-8657) Fail to upload conf archive to viewfs

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8657: - Target Version/s: 1.5.0 (was: 1.5.0, 1.4.2) Fail to upload conf archive to viewfs

[jira] [Updated] (SPARK-8457) Documentation for N-Gram feature transformer

2015-06-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8457: - Assignee: Feynman Liang Target Version/s: 1.5.0 Documentation for N-Gram

[jira] [Updated] (SPARK-8456) Python API for N-Gram Feature Transformer

2015-06-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8456: - Assignee: Feynman Liang Python API for N-Gram Feature Transformer

[jira] [Resolved] (SPARK-8456) Python API for N-Gram Feature Transformer

2015-06-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-8456. -- Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6960

[jira] [Commented] (SPARK-8716) Remove executor shared cache feature

2015-06-29 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606660#comment-14606660 ] Marcelo Vanzin commented on SPARK-8716: --- Yes, but there is also a shared app

[jira] [Commented] (SPARK-8716) Remove executor shared cache feature

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606682#comment-14606682 ] Andrew Or commented on SPARK-8716: -- Hm, I think you're right. Otherwise there's no way

[jira] [Created] (SPARK-8718) Improve EdgePartition2D for non perfect square number of partitions

2015-06-29 Thread Andrew Ray (JIRA)
Andrew Ray created SPARK-8718: - Summary: Improve EdgePartition2D for non perfect square number of partitions Key: SPARK-8718 URL: https://issues.apache.org/jira/browse/SPARK-8718 Project: Spark

[jira] [Closed] (SPARK-8410) Hive VersionsSuite RuntimeException

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-8410. Resolution: Fixed Fix Version/s: 1.4.2 1.5.0 Target Version/s: 1.5.0,

[jira] [Closed] (SPARK-8475) SparkSubmit with Ivy jars is very slow to load with no internet access

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-8475. Resolution: Fixed Fix Version/s: 1.4.2 1.5.0 SparkSubmit with Ivy jars is very

[jira] [Updated] (SPARK-8475) SparkSubmit with Ivy jars is very slow to load with no internet access

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8475: - Assignee: Burak Yavuz SparkSubmit with Ivy jars is very slow to load with no internet access

[jira] [Created] (SPARK-8719) Adding Python support for 1-sample, 2-sided Kolmogorov Smirnov Test

2015-06-29 Thread Jose Cambronero (JIRA)
Jose Cambronero created SPARK-8719: -- Summary: Adding Python support for 1-sample, 2-sided Kolmogorov Smirnov Test Key: SPARK-8719 URL: https://issues.apache.org/jira/browse/SPARK-8719 Project: Spark

[jira] [Commented] (SPARK-8119) HeartbeatReceiver should not adjust application executor resources

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606853#comment-14606853 ] Apache Spark commented on SPARK-8119: - User 'andrewor14' has created a pull request

[jira] [Commented] (SPARK-5571) LDA should handle text as well

2015-06-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606869#comment-14606869 ] Joseph K. Bradley commented on SPARK-5571: -- Thanks for your interest! The API

[jira] [Resolved] (SPARK-8031) Version number written to Hive metastore is 0.13.1aa instead of 0.13.1a

2015-06-29 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rekha Joshi resolved SPARK-8031. Resolution: Fixed Fixed in 1.5.0 Version number written to Hive metastore is 0.13.1aa instead of

[jira] [Commented] (SPARK-8703) Add CountVectorizer as a ml transformer to convert document to words count vector

2015-06-29 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606909#comment-14606909 ] yuhao yang commented on SPARK-8703: --- Appreciate the suggestion. I've sent an update to

[jira] [Created] (SPARK-8722) PR merge script should warn when merging a PR that has failed tests

2015-06-29 Thread Josh Rosen (JIRA)
Josh Rosen created SPARK-8722: - Summary: PR merge script should warn when merging a PR that has failed tests Key: SPARK-8722 URL: https://issues.apache.org/jira/browse/SPARK-8722 Project: Spark

[jira] [Closed] (SPARK-8437) Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or closed SPARK-8437. Resolution: Fixed Assignee: Sean Owen Fix Version/s: 1.4.2

[jira] [Updated] (SPARK-8715) ArrayOutOfBoundsException for DataFrameStatSuite.crosstab

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8715: - Labels: flaky-test (was: ) ArrayOutOfBoundsException for DataFrameStatSuite.crosstab

[jira] [Created] (SPARK-8720) PR #7036 breaks branch-1.4 because of a malformed comment

2015-06-29 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-8720: - Summary: PR #7036 breaks branch-1.4 because of a malformed comment Key: SPARK-8720 URL: https://issues.apache.org/jira/browse/SPARK-8720 Project: Spark Issue

[jira] [Commented] (SPARK-8450) PySpark write.parquet raises Unsupported datatype DecimalType()

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606809#comment-14606809 ] Apache Spark commented on SPARK-8450: - User 'x1-' has created a pull request for this

[jira] [Assigned] (SPARK-8450) PySpark write.parquet raises Unsupported datatype DecimalType()

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8450: --- Assignee: Apache Spark PySpark write.parquet raises Unsupported datatype DecimalType()

[jira] [Assigned] (SPARK-8450) PySpark write.parquet raises Unsupported datatype DecimalType()

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8450: --- Assignee: (was: Apache Spark) PySpark write.parquet raises Unsupported datatype

[jira] [Updated] (SPARK-8119) HeartbeatReceiver should not adjust application executor resources

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8119: - Component/s: Spark Core HeartbeatReceiver should not adjust application executor resources

[jira] [Updated] (SPARK-8687) Spark on yarn-client mode can't send `spark.yarn.credentials.file` to executor.

2015-06-29 Thread SaintBacchus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SaintBacchus updated SPARK-8687: Summary: Spark on yarn-client mode can't send `spark.yarn.credentials.file` to executor. (was:

[jira] [Updated] (SPARK-8407) complex type constructors: struct and named_struct

2015-06-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8407: Shepherd: Michael Armbrust complex type constructors: struct and named_struct

[jira] [Created] (SPARK-8717) Update mllib-data-types docs to include missing matrix Python examples

2015-06-29 Thread Rosstin Murphy (JIRA)
Rosstin Murphy created SPARK-8717: - Summary: Update mllib-data-types docs to include missing matrix Python examples Key: SPARK-8717 URL: https://issues.apache.org/jira/browse/SPARK-8717 Project:

[jira] [Updated] (SPARK-8628) Race condition in AbstractSparkSQLParser.parse

2015-06-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8628: Shepherd: Michael Armbrust Race condition in AbstractSparkSQLParser.parse

[jira] [Updated] (SPARK-8628) Race condition in AbstractSparkSQLParser.parse

2015-06-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8628: Target Version/s: 1.3.2, 1.4.1, 1.5.0 (was: 1.3.2, 1.4.1) Race condition in

[jira] [Updated] (SPARK-8716) Write tests for executor shared cache feature

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8716: - Summary: Write tests for executor shared cache feature (was: Remove executor shared cache feature)

[jira] [Updated] (SPARK-8716) Write tests for executor shared cache feature

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8716: - Assignee: (was: Josh Rosen) Write tests for executor shared cache feature

[jira] [Commented] (SPARK-8716) Write tests for executor shared cache feature

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606693#comment-14606693 ] Andrew Or commented on SPARK-8716: -- I've updated the description. Thanks for your input

[jira] [Commented] (SPARK-8437) Using directory path without wildcard for filename slow for large number of files with wholeTextFiles and binaryFiles

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606721#comment-14606721 ] Andrew Or commented on SPARK-8437: -- The merged PR involves only documentation changes. I

[jira] [Updated] (SPARK-8653) Add constraint for Children expression for data type

2015-06-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8653: Shepherd: Reynold Xin Add constraint for Children expression for data type

[jira] [Updated] (SPARK-8119) HeartbeatReceiver should not adjust application executor resources

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8119: - Summary: HeartbeatReceiver should not adjust application executor resources (was: HeartbeatReceiver

[jira] [Updated] (SPARK-8308) add missing save load for python doc example and tune down MatrixFactorization iterations

2015-06-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-8308: - Shepherd: Joseph K. Bradley add missing save load for python doc example and tune down

[jira] [Commented] (SPARK-6129) Add a section in user guide for model evaluation

2015-06-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606871#comment-14606871 ] Joseph K. Bradley commented on SPARK-6129: -- Please do, thanks! Add a section in

[jira] [Commented] (SPARK-8236) misc function: crc32

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606890#comment-14606890 ] Apache Spark commented on SPARK-8236: - User 'qiansl127' has created a pull request for

[jira] [Commented] (SPARK-8592) CoarseGrainedExecutorBackend: Cannot register with driver = NPE

2015-06-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606927#comment-14606927 ] Apache Spark commented on SPARK-8592: - User 'xuchenCN' has created a pull request for

[jira] [Commented] (SPARK-8722) PR merge script should warn when merging a PR that has failed tests

2015-06-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606928#comment-14606928 ] Josh Rosen commented on SPARK-8722: --- Also, this feature needs to block on being able to

[jira] [Commented] (SPARK-8722) PR merge script should warn when merging a PR that has failed tests

2015-06-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606925#comment-14606925 ] Josh Rosen commented on SPARK-8722: --- If we wanted to get _really_ fancy, we could warn

[jira] [Updated] (SPARK-8690) Add a setting to disable SparkSQL parquet schema merge by using datasource API

2015-06-29 Thread Cheng Lian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Lian updated SPARK-8690: -- Shepherd: Cheng Lian Add a setting to disable SparkSQL parquet schema merge by using datasource API

[jira] [Comment Edited] (SPARK-8130) spark.files.useFetchCache should be off by default

2015-06-29 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606698#comment-14606698 ] Andrew Or edited comment on SPARK-8130 at 6/30/15 12:07 AM:

[jira] [Commented] (SPARK-8703) Add CountVectorizer as a ml transformer to convert document to words count vector

2015-06-29 Thread Feynman Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606700#comment-14606700 ] Feynman Liang commented on SPARK-8703: -- This seems to extend HashingTF by adding * a

<    1   2   3   4   >