[jira] [Updated] (SPARK-7690) MulticlassClassificationEvaluator for tuning Multiclass Classifiers

2015-06-24 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7690: -- Target Version/s: 1.5.0 (was: 1.4.1) MulticlassClassificationEvaluator for tuning Multiclass

[jira] [Updated] (SPARK-7689) Deprecate spark.cleaner.ttl

2015-06-24 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-7689: -- Target Version/s: 1.5.0 (was: 1.4.1) Deprecate spark.cleaner.ttl ---

[jira] [Resolved] (SPARK-7088) [REGRESSION] Spark 1.3.1 breaks analysis of third-party logical plans

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-7088. - Resolution: Fixed Fix Version/s: 1.5.0 Issue resolved by pull request 6853

[jira] [Created] (SPARK-8598) Implementation of 1-sample, two-sided, Kolmogorov Smirnov Test for RDDs

2015-06-24 Thread Jose Cambronero (JIRA)
Jose Cambronero created SPARK-8598: -- Summary: Implementation of 1-sample, two-sided, Kolmogorov Smirnov Test for RDDs Key: SPARK-8598 URL: https://issues.apache.org/jira/browse/SPARK-8598 Project:

[jira] [Commented] (SPARK-8167) Tasks that fail due to YARN preemption can cause job failure

2015-06-24 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14600012#comment-14600012 ] Matt Cheah commented on SPARK-8167: --- What's curious here as I'm trying to design this is

[jira] [Commented] (SPARK-8337) KafkaUtils.createDirectStream for python is lacking API/feature parity with the Scala/Java version

2015-06-24 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-8337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14600010#comment-14600010 ] Juan Rodríguez Hortalá commented on SPARK-8337: --- Hi, As I said above, I

[jira] [Updated] (SPARK-8410) Hive VersionsSuite RuntimeException

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8410: Assignee: Burak Yavuz Hive VersionsSuite RuntimeException

[jira] [Updated] (SPARK-8586) SQL add jar command does not work well with Scala REPL

2015-06-24 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai updated SPARK-8586: Target Version/s: 1.5.0, 1.4.2 (was: 1.4.1, 1.5.0, 1.4.2) SQL add jar command does not work well with

[jira] [Updated] (SPARK-8506) SparkR does not provide an easy way to depend on Spark Packages when performing init from inside of R

2015-06-24 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman updated SPARK-8506: - Assignee: holdenk SparkR does not provide an easy way to depend on Spark

[jira] [Resolved] (SPARK-8399) Overlap between histograms and axis' name in Spark Streaming UI

2015-06-24 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-8399. -- Resolution: Fixed Assignee: Benjamin Fradet Fix Version/s: 1.4.2

[jira] [Resolved] (SPARK-8127) KafkaRDD optimize count() take() isEmpty()

2015-06-24 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-8127. -- Resolution: Fixed KafkaRDD optimize count() take() isEmpty()

[jira] [Updated] (SPARK-8597) DataFrame partitionBy memory pressure scales extremely poorly

2015-06-24 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-8597: -- Summary: DataFrame partitionBy memory pressure scales extremely poorly (was: DataFrame partitionBy

[jira] [Created] (SPARK-8597) DataFrame partitionBy scales extremely poorly

2015-06-24 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-8597: - Summary: DataFrame partitionBy scales extremely poorly Key: SPARK-8597 URL: https://issues.apache.org/jira/browse/SPARK-8597 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-8597) DataFrame partitionBy memory pressure scales extremely poorly

2015-06-24 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-8597: -- Attachment: table.csv DataFrame partitionBy memory pressure scales extremely poorly

[jira] [Commented] (SPARK-8596) Install and configure RStudio server on Spark EC2

2015-06-24 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14599940#comment-14599940 ] Shivaram Venkataraman commented on SPARK-8596: -- I think it should technically

[jira] [Commented] (SPARK-8586) SQL add jar command does not work well with Scala REPL

2015-06-24 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14599943#comment-14599943 ] Yin Huai commented on SPARK-8586: - The workaround is to use {{--jars}} to add the jar when

[jira] [Resolved] (SPARK-8506) SparkR does not provide an easy way to depend on Spark Packages when performing init from inside of R

2015-06-24 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-8506. -- Resolution: Fixed Fix Version/s: 1.5.0 1.4.1 Issue

[jira] [Updated] (SPARK-8483) Remove commons-lang3 depedency from flume-sink

2015-06-24 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-8483: - Fix Version/s: (was: 1.4.2) 1.4.1 Remove commons-lang3 depedency from

[jira] [Commented] (SPARK-8597) DataFrame partitionBy memory pressure scales extremely poorly

2015-06-24 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14599977#comment-14599977 ] Matt Cheah commented on SPARK-8597: --- I've attached the CSV file used in the test.

[jira] [Updated] (SPARK-8597) DataFrame partitionBy memory pressure scales extremely poorly

2015-06-24 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah updated SPARK-8597: -- Description: I'm running into a strange memory scaling issue when using the partitionBy feature of

[jira] [Commented] (SPARK-8598) Implementation of 1-sample, two-sided, Kolmogorov Smirnov Test for RDDs

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14600043#comment-14600043 ] Apache Spark commented on SPARK-8598: - User 'josepablocam' has created a pull request

[jira] [Assigned] (SPARK-8598) Implementation of 1-sample, two-sided, Kolmogorov Smirnov Test for RDDs

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8598: --- Assignee: Apache Spark Implementation of 1-sample, two-sided, Kolmogorov Smirnov Test for

[jira] [Assigned] (SPARK-8598) Implementation of 1-sample, two-sided, Kolmogorov Smirnov Test for RDDs

2015-06-24 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-8598: --- Assignee: (was: Apache Spark) Implementation of 1-sample, two-sided, Kolmogorov Smirnov

[jira] [Updated] (SPARK-8588) Could not use concat with UDF in where clause

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8588: Assignee: Wenchen Fan Could not use concat with UDF in where clause

[jira] [Updated] (SPARK-8588) Could not use concat with UDF in where clause

2015-06-24 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-8588: Target Version/s: 1.4.1, 1.5.0 Could not use concat with UDF in where clause

[jira] [Commented] (SPARK-1503) Implement Nesterov's accelerated first-order method

2015-06-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14600056#comment-14600056 ] Joseph K. Bradley commented on SPARK-1503: -- Switching to absolute tolerance

[jira] [Commented] (SPARK-3382) GradientDescent convergence tolerance

2015-06-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14600058#comment-14600058 ] Joseph K. Bradley commented on SPARK-3382: -- [~lewuathe] I'm sorry for the

[jira] [Commented] (SPARK-8510) NumPy arrays and matrices as values in sequence files

2015-06-24 Thread Peter Aberline (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14600068#comment-14600068 ] Peter Aberline commented on SPARK-8510: --- See PR at

[jira] [Commented] (SPARK-8540) KMeans-based outlier detection

2015-06-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14600076#comment-14600076 ] Joseph K. Bradley commented on SPARK-8540: -- That's correct: For (b), the user

[jira] [Commented] (SPARK-8587) Return cost and cluster index KMeansModel.predict

2015-06-24 Thread Rakesh Chalasani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14600079#comment-14600079 ] Rakesh Chalasani commented on SPARK-8587: - +1 for this. But we can't do what you

[jira] [Commented] (SPARK-6791) Model export/import for spark.ml: meta-algorithms

2015-06-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14600080#comment-14600080 ] Joseph K. Bradley commented on SPARK-6791: -- Can you pick from the Pipeline issues

<    1   2   3   4