[jira] [Updated] (SPARK-9043) Serialize key, value and combiner classes in ShuffleDependency

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-9043: - Component/s: (was: Spark Core) Shuffle > Serialize key, value and combiner classes

[jira] [Commented] (SPARK-10296) add preservesParitioning parameter to RDD.map

2015-09-01 Thread Esteban Donato (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725878#comment-14725878 ] Esteban Donato commented on SPARK-10296: any further thought on this issue? Do you think it

[jira] [Updated] (SPARK-9043) Serialize key, value and combiner classes in ShuffleDependency

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-9043: - Assignee: Matt Massie > Serialize key, value and combiner classes in ShuffleDependency >

[jira] [Updated] (SPARK-9043) Serialize key, value and combiner classes in ShuffleDependency

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-9043: - Target Version/s: 1.6.0 > Serialize key, value and combiner classes in ShuffleDependency >

[jira] [Commented] (SPARK-8518) Log-linear models for survival analysis

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725732#comment-14725732 ] Xiangrui Meng commented on SPARK-8518: -- The value in this column would be 0/1. Then we need to see

[jira] [Updated] (SPARK-8518) Log-linear models for survival analysis

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-8518: - Priority: Critical (was: Major) > Log-linear models for survival analysis >

[jira] [Commented] (SPARK-8518) Log-linear models for survival analysis

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725749#comment-14725749 ] Xiangrui Meng commented on SPARK-8518: -- [~yanboliang] Thanks for working on design doc! I think we

[jira] [Updated] (SPARK-10372) Tests for entire scheduler

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10372: -- Issue Type: Sub-task (was: Test) Parent: SPARK-8987 > Tests for entire scheduler >

[jira] [Commented] (SPARK-10288) Add a rest client for Spark on Yarn

2015-09-01 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725913#comment-14725913 ] Steve Loughran commented on SPARK-10288: Long Haul job submission. You can't currently submit

[jira] [Updated] (SPARK-10324) MLlib 1.6 Roadmap

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10324: -- Description: Following SPARK-8445, we created this master list for MLlib features we plan to

[jira] [Commented] (SPARK-8518) Log-linear models for survival analysis

2015-09-01 Thread Meihua Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725935#comment-14725935 ] Meihua Wu commented on SPARK-8518: -- For the reference implementations, recommend we consider this R

[jira] [Commented] (SPARK-10375) Setting the driver memory with SparkConf().set("spark.driver.memory","1g") does not work

2015-09-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725911#comment-14725911 ] Sean Owen commented on SPARK-10375: --- I don't think this is a problem in the sense that you would not be

[jira] [Commented] (SPARK-10320) Kafka Support new topic subscriptions without requiring restart of the streaming context

2015-09-01 Thread Sudarshan Kadambi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725687#comment-14725687 ] Sudarshan Kadambi commented on SPARK-10320: --- "That's not the way checkpoints work. ..." Sure,

[jira] [Assigned] (SPARK-10372) Tests for entire scheduler

2015-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10372: Assignee: Apache Spark (was: Imran Rashid) > Tests for entire scheduler >

[jira] [Resolved] (SPARK-10377) Cassandra connector affected by backport change

2015-09-01 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-10377. -- Resolution: Fixed Fix Version/s: 1.4.2 Issue resolved by pull request 8545

[jira] [Created] (SPARK-10402) Add scaladoc for default values of params in ML

2015-09-01 Thread holdenk (JIRA)
holdenk created SPARK-10402: --- Summary: Add scaladoc for default values of params in ML Key: SPARK-10402 URL: https://issues.apache.org/jira/browse/SPARK-10402 Project: Spark Issue Type:

[jira] [Created] (SPARK-10403) UnsafeRowSerializer can't work with UnsafeShuffleManager (tungsten-sort)

2015-09-01 Thread Davies Liu (JIRA)
Davies Liu created SPARK-10403: -- Summary: UnsafeRowSerializer can't work with UnsafeShuffleManager (tungsten-sort) Key: SPARK-10403 URL: https://issues.apache.org/jira/browse/SPARK-10403 Project: Spark

[jira] [Commented] (SPARK-10320) Kafka Support new topic subscriptions without requiring restart of the streaming context

2015-09-01 Thread Sudarshan Kadambi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725951#comment-14725951 ] Sudarshan Kadambi commented on SPARK-10320: --- "it's almost certainly not the same thread". Yes,

[jira] [Updated] (SPARK-10192) Test for fetch failure in a shared dependency for "skipped" stages

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10192: -- Component/s: Tests > Test for fetch failure in a shared dependency for "skipped" stages >

[jira] [Updated] (SPARK-10372) Add end-to-end tests for the scheduling code

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10372: -- Summary: Add end-to-end tests for the scheduling code (was: Add end-to-end tests for the scheduler)

[jira] [Updated] (SPARK-10372) Add end-to-end tests for the scheduler

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10372: -- Summary: Add end-to-end tests for the scheduler (was: Tests for entire scheduler) > Add end-to-end

[jira] [Updated] (SPARK-10192) Test for fetch failure in a shared dependency for "skipped" stages

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10192: -- Issue Type: Sub-task (was: Test) Parent: SPARK-8987 > Test for fetch failure in a shared

[jira] [Updated] (SPARK-10370) After a stages map outputs are registered, all running attempts should be marked as zombies

2015-09-01 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-10370: - Component/s: (was: Spark Core) Scheduler > After a stages map outputs are

[jira] [Commented] (SPARK-10375) Setting the driver memory with SparkConf().set("spark.driver.memory","1g") does not work

2015-09-01 Thread Alex Rovner (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725909#comment-14725909 ] Alex Rovner commented on SPARK-10375: - [~srowen] Shall we re-open? > Setting the driver memory with

[jira] [Commented] (SPARK-10288) Add a rest client for Spark on Yarn

2015-09-01 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725747#comment-14725747 ] Marcelo Vanzin commented on SPARK-10288: Hi Saisai, I have a question: why do you need this? Why

[jira] [Resolved] (SPARK-9679) Add python interface for ml.feature.StopWordsRemover

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-9679. -- Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8118

[jira] [Updated] (SPARK-9679) Add python interface for ml.feature.StopWordsRemover

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-9679: - Assignee: holdenk > Add python interface for ml.feature.StopWordsRemover >

[jira] [Commented] (SPARK-10288) Add a rest client for Spark on Yarn

2015-09-01 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725955#comment-14725955 ] Marcelo Vanzin commented on SPARK-10288: So can that instead be used as the reasoning in the

[jira] [Commented] (SPARK-10320) Kafka Support new topic subscriptions without requiring restart of the streaming context

2015-09-01 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726076#comment-14726076 ] Cody Koeninger commented on SPARK-10320: You would supply a function, similar to the way

[jira] [Updated] (SPARK-10372) Add end-to-end tests for the scheduling code

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10372: -- Component/s: (was: Spark Core) Tests Scheduler > Add end-to-end

[jira] [Updated] (SPARK-10372) Add end-to-end tests for the scheduling code

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10372: -- Target Version/s: 1.6.0 > Add end-to-end tests for the scheduling code >

[jira] [Updated] (SPARK-10192) Test for fetch failure in a shared dependency for "skipped" stages

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10192: -- Component/s: (was: Spark Core) Scheduler > Test for fetch failure in a shared

[jira] [Updated] (SPARK-10372) Add end-to-end tests for the scheduling code

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10372: -- Fix Version/s: (was: 1.6.0) > Add end-to-end tests for the scheduling code >

[jira] [Resolved] (SPARK-10398) Migrate Spark download page to use new lua mirroring scripts

2015-09-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-10398. --- Resolution: Fixed Fix Version/s: (was: 1.5.0) 1.5.1

[jira] [Updated] (SPARK-10394) Make GBTParams use shared "stepSize"

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10394: -- Assignee: Yanbo Liang Affects Version/s: 1.5.0 Target Version/s: 1.6.0 >

[jira] [Updated] (SPARK-10379) UnsafeShuffleExternalSorter should preserve first page

2015-09-01 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-10379: --- Target Version/s: 1.6.0, 1.5.1 (was: 1.5.0) > UnsafeShuffleExternalSorter should preserve first

[jira] [Commented] (SPARK-10372) Tests for entire scheduler

2015-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725708#comment-14725708 ] Apache Spark commented on SPARK-10372: -- User 'squito' has created a pull request for this issue:

[jira] [Assigned] (SPARK-10372) Tests for entire scheduler

2015-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10372: Assignee: Imran Rashid (was: Apache Spark) > Tests for entire scheduler >

[jira] [Commented] (SPARK-10320) Kafka Support new topic subscriptions without requiring restart of the streaming context

2015-09-01 Thread Cody Koeninger (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725719#comment-14725719 ] Cody Koeninger commented on SPARK-10320: So if you're changing topics in an event handler, it's

[jira] [Commented] (SPARK-10288) Add a rest client for Spark on Yarn

2015-09-01 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14725750#comment-14725750 ] Marcelo Vanzin commented on SPARK-10288: /cc [~tgraves] who also might have questions / insights.

[jira] [Updated] (SPARK-8987) Increase test coverage of DAGScheduler

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-8987: - Issue Type: Umbrella (was: Bug) > Increase test coverage of DAGScheduler >

[jira] [Comment Edited] (SPARK-10199) Avoid using reflections for parquet model save

2015-09-01 Thread Vinod KC (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14724459#comment-14724459 ] Vinod KC edited comment on SPARK-10199 at 9/1/15 6:57 AM: -- [~mengxr] 1) I

[jira] [Created] (SPARK-10386) Model import/export for PrefixSpan

2015-09-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10386: - Summary: Model import/export for PrefixSpan Key: SPARK-10386 URL: https://issues.apache.org/jira/browse/SPARK-10386 Project: Spark Issue Type: Sub-task

[jira] [Comment Edited] (SPARK-8467) Add LDAModel.describeTopics() in Python

2015-09-01 Thread Hrishikesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14706257#comment-14706257 ] Hrishikesh edited comment on SPARK-8467 at 9/1/15 6:49 AM: ---

[jira] [Updated] (SPARK-10386) Model import/export for PrefixSpan

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10386: -- Description: Support save/load for PrefixSpanModel. Should be similar to save/load for

[jira] [Created] (SPARK-10389) support order by non-attribute grouping expression on Aggregate

2015-09-01 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-10389: --- Summary: support order by non-attribute grouping expression on Aggregate Key: SPARK-10389 URL: https://issues.apache.org/jira/browse/SPARK-10389 Project: Spark

[jira] [Comment Edited] (SPARK-7132) Add fit with validation set to spark.ml GBT

2015-09-01 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14724889#comment-14724889 ] Yanbo Liang edited comment on SPARK-7132 at 9/1/15 7:03 AM: I will work on

[jira] [Updated] (SPARK-10324) MLlib 1.6 Roadmap

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10324: -- Description: Following SPARK-8445, we created this master list for MLlib features we plan to

[jira] [Updated] (SPARK-10324) MLlib 1.6 Roadmap

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10324: -- Description: Following SPARK-8445, we created this master list for MLlib features we plan to

[jira] [Created] (SPARK-10384) Univariate statistics as UDAFs

2015-09-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10384: - Summary: Univariate statistics as UDAFs Key: SPARK-10384 URL: https://issues.apache.org/jira/browse/SPARK-10384 Project: Spark Issue Type: Umbrella

[jira] [Created] (SPARK-10385) Bivariate statistics as UDAFs

2015-09-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10385: - Summary: Bivariate statistics as UDAFs Key: SPARK-10385 URL: https://issues.apache.org/jira/browse/SPARK-10385 Project: Spark Issue Type: Umbrella

[jira] [Comment Edited] (SPARK-8467) Add LDAModel.describeTopics() in Python

2015-09-01 Thread Hrishikesh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14706257#comment-14706257 ] Hrishikesh edited comment on SPARK-8467 at 9/1/15 6:50 AM: ---

[jira] [Comment Edited] (SPARK-7132) Add fit with validation set to spark.ml GBT

2015-09-01 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14724889#comment-14724889 ] Yanbo Liang edited comment on SPARK-7132 at 9/1/15 7:02 AM: I will work on

[jira] [Created] (SPARK-10387) Code generation for decision tree

2015-09-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10387: - Summary: Code generation for decision tree Key: SPARK-10387 URL: https://issues.apache.org/jira/browse/SPARK-10387 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-7132) Add fit with validation set to spark.ml GBT

2015-09-01 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14724889#comment-14724889 ] Yanbo Liang commented on SPARK-7132: I will work on this issue. [~josephkb] I propose another way to

[jira] [Updated] (SPARK-10324) MLlib 1.6 Roadmap

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10324: -- Description: Following SPARK-8445, we created this master list for MLlib features we plan to

[jira] [Commented] (SPARK-10324) MLlib 1.6 Roadmap

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14724899#comment-14724899 ] Xiangrui Meng commented on SPARK-10324: --- Changed priority to blocker to make this list more

[jira] [Updated] (SPARK-10324) MLlib 1.6 Roadmap

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10324: -- Priority: Blocker (was: Critical) > MLlib 1.6 Roadmap > - > >

[jira] [Created] (SPARK-10388) Public dataset loader interface

2015-09-01 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-10388: - Summary: Public dataset loader interface Key: SPARK-10388 URL: https://issues.apache.org/jira/browse/SPARK-10388 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-10324) MLlib 1.6 Roadmap

2015-09-01 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-10324: -- Description: Following SPARK-8445, we created this master list for MLlib features we plan to

[jira] [Assigned] (SPARK-10389) support order by non-attribute grouping expression on Aggregate

2015-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-10389: Assignee: Apache Spark > support order by non-attribute grouping expression on Aggregate

[jira] [Commented] (SPARK-10389) support order by non-attribute grouping expression on Aggregate

2015-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14724905#comment-14724905 ] Apache Spark commented on SPARK-10389: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Commented] (SPARK-10408) Autoencoder

2015-09-01 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726423#comment-14726423 ] Alexander Ulanov commented on SPARK-10408: -- Added implementation for (1) that is basic deep

[jira] [Created] (SPARK-10409) Multilayer perceptron regression

2015-09-01 Thread Alexander Ulanov (JIRA)
Alexander Ulanov created SPARK-10409: Summary: Multilayer perceptron regression Key: SPARK-10409 URL: https://issues.apache.org/jira/browse/SPARK-10409 Project: Spark Issue Type:

[jira] [Commented] (SPARK-10409) Multilayer perceptron regression

2015-09-01 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726435#comment-14726435 ] Alexander Ulanov commented on SPARK-10409: -- Basic implementation with the current ML api can be

[jira] [Updated] (SPARK-10408) Autoencoder

2015-09-01 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Ulanov updated SPARK-10408: - Description: Goal: Implement various types of autoencoders Requirements: 1)Basic (deep)

[jira] [Commented] (SPARK-10314) [CORE]RDD persist to OFF_HEAP tachyon got block rdd_x_x not found exception when parallelism is big than data split size

2015-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726551#comment-14726551 ] Apache Spark commented on SPARK-10314: -- User 'romansew' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-10408) Autoencoder

2015-09-01 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726423#comment-14726423 ] Alexander Ulanov edited comment on SPARK-10408 at 9/1/15 11:55 PM: ---

[jira] [Comment Edited] (SPARK-10408) Autoencoder

2015-09-01 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726423#comment-14726423 ] Alexander Ulanov edited comment on SPARK-10408 at 9/1/15 11:55 PM: ---

[jira] [Comment Edited] (SPARK-10408) Autoencoder

2015-09-01 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726423#comment-14726423 ] Alexander Ulanov edited comment on SPARK-10408 at 9/1/15 11:54 PM: ---

[jira] [Updated] (SPARK-5269) BlockManager.dataDeserialize always creates a new serializer instance

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-5269: - Target Version/s: 1.6.0 > BlockManager.dataDeserialize always creates a new serializer instance >

[jira] [Comment Edited] (SPARK-10405) Support takeOrdered and topK values per key

2015-09-01 Thread ashish shenoy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726331#comment-14726331 ] ashish shenoy edited comment on SPARK-10405 at 9/1/15 10:39 PM: [~srowen]

[jira] [Created] (SPARK-10410) spark 1.4.1 kill command does not work with streaming job.

2015-09-01 Thread Bryce Ageno (JIRA)
Bryce Ageno created SPARK-10410: --- Summary: spark 1.4.1 kill command does not work with streaming job. Key: SPARK-10410 URL: https://issues.apache.org/jira/browse/SPARK-10410 Project: Spark

[jira] [Updated] (SPARK-4122) Add library to write data back to Kafka

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-4122: - Target Version/s: 1.6.0 > Add library to write data back to Kafka >

[jira] [Commented] (SPARK-3655) Support sorting of values in addition to keys (i.e. secondary sort)

2015-09-01 Thread Nick Xie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726658#comment-14726658 ] Nick Xie commented on SPARK-3655: - Thanks for the quick changes to rid of Ordering dependency. Since I am

[jira] [Created] (SPARK-10412) In SQL tab, show execution memory per physical operator

2015-09-01 Thread Andrew Or (JIRA)
Andrew Or created SPARK-10412: - Summary: In SQL tab, show execution memory per physical operator Key: SPARK-10412 URL: https://issues.apache.org/jira/browse/SPARK-10412 Project: Spark Issue

[jira] [Updated] (SPARK-10410) spark 1.4.1 kill command does not work with streaming job.

2015-09-01 Thread Bryce Ageno (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryce Ageno updated SPARK-10410: Shepherd: (was: Bryce Ageno) > spark 1.4.1 kill command does not work with streaming job. >

[jira] [Commented] (SPARK-3655) Support sorting of values in addition to keys (i.e. secondary sort)

2015-09-01 Thread Koert Kuipers (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726664#comment-14726664 ] Koert Kuipers commented on SPARK-3655: -- Did you build a version that does not use Optional for java

[jira] [Updated] (SPARK-10408) Autoencoder

2015-09-01 Thread Alexander Ulanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Ulanov updated SPARK-10408: - Issue Type: Umbrella (was: Improvement) > Autoencoder > --- > >

[jira] [Created] (SPARK-10408) Autoencoder

2015-09-01 Thread Alexander Ulanov (JIRA)
Alexander Ulanov created SPARK-10408: Summary: Autoencoder Key: SPARK-10408 URL: https://issues.apache.org/jira/browse/SPARK-10408 Project: Spark Issue Type: Improvement

[jira] [Comment Edited] (SPARK-10314) [CORE]RDD persist to OFF_HEAP tachyon got block rdd_x_x not found exception when parallelism is big than data split size

2015-09-01 Thread Xiaoyu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726557#comment-14726557 ] Xiaoyu Wang edited comment on SPARK-10314 at 9/2/15 1:33 AM: - I resubmit the

[jira] [Commented] (SPARK-10314) [CORE]RDD persist to OFF_HEAP tachyon got block rdd_x_x not found exception when parallelism is big than data split size

2015-09-01 Thread Xiaoyu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726557#comment-14726557 ] Xiaoyu Wang commented on SPARK-10314: - I resubmit the pull request on the master branch > [CORE]RDD

[jira] [Created] (SPARK-10411) In SQL tab move visualization above explain output

2015-09-01 Thread Andrew Or (JIRA)
Andrew Or created SPARK-10411: - Summary: In SQL tab move visualization above explain output Key: SPARK-10411 URL: https://issues.apache.org/jira/browse/SPARK-10411 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-3586) Support nested directories in Spark Streaming

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-3586: - Labels: (was: patch) > Support nested directories in Spark Streaming >

[jira] [Updated] (SPARK-3586) Support nested directories in Spark Streaming

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-3586: - Target Version/s: 1.6.0 > Support nested directories in Spark Streaming >

[jira] [Updated] (SPARK-10034) add regression test for sort on

2015-09-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-10034: Summary: add regression test for sort on (was: Can't analyze Sort on Aggregate with aggregation

[jira] [Updated] (SPARK-10034) add regression test for Sort on Aggregate

2015-09-01 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-10034: Summary: add regression test for Sort on Aggregate (was: add regression test for sort on ) > add

[jira] [Updated] (SPARK-10081) Skip re-computing getMissingParentStages in DAGScheduler

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10081: -- Target Version/s: 1.6.0 > Skip re-computing getMissingParentStages in DAGScheduler >

[jira] [Updated] (SPARK-10081) Skip re-computing getMissingParentStages in DAGScheduler

2015-09-01 Thread Andrew Or (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Or updated SPARK-10081: -- Issue Type: Improvement (was: Bug) > Skip re-computing getMissingParentStages in DAGScheduler >

[jira] [Assigned] (SPARK-9769) Add Python API for ml.feature.CountVectorizerModel

2015-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9769: --- Assignee: (was: Apache Spark) > Add Python API for ml.feature.CountVectorizerModel >

[jira] [Commented] (SPARK-9769) Add Python API for ml.feature.CountVectorizerModel

2015-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726265#comment-14726265 ] Apache Spark commented on SPARK-9769: - User 'holdenk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-9769) Add Python API for ml.feature.CountVectorizerModel

2015-09-01 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-9769: --- Assignee: Apache Spark > Add Python API for ml.feature.CountVectorizerModel >

[jira] [Resolved] (SPARK-10392) Pyspark - Wrong DateType support on JDBC connection

2015-09-01 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-10392. Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8556

[jira] [Comment Edited] (SPARK-10405) Support takeOrdered and topK values per key

2015-09-01 Thread ashish shenoy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726331#comment-14726331 ] ashish shenoy edited comment on SPARK-10405 at 9/1/15 10:38 PM: [~srowen]

[jira] [Created] (SPARK-10407) Possible Stack-overflow using InheritableThreadLocal nested-properties for SparkContext.localProperties

2015-09-01 Thread Matt Cheah (JIRA)
Matt Cheah created SPARK-10407: -- Summary: Possible Stack-overflow using InheritableThreadLocal nested-properties for SparkContext.localProperties Key: SPARK-10407 URL:

[jira] [Resolved] (SPARK-10162) PySpark filters with datetimes mess up when datetimes have timezones.

2015-09-01 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu resolved SPARK-10162. Resolution: Fixed Fix Version/s: 1.6.0 Issue resolved by pull request 8555

[jira] [Created] (SPARK-10406) Document spark on yarn distributed cache symlink functionality

2015-09-01 Thread Thomas Graves (JIRA)
Thomas Graves created SPARK-10406: - Summary: Document spark on yarn distributed cache symlink functionality Key: SPARK-10406 URL: https://issues.apache.org/jira/browse/SPARK-10406 Project: Spark

[jira] [Commented] (SPARK-10405) Support takeOrdered and topK values per key

2015-09-01 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14726296#comment-14726296 ] Sean Owen commented on SPARK-10405: --- This is fairly easy already with foldByKey and a priority queue --

[jira] [Resolved] (SPARK-4223) Support * (meaning all users) as part of the acls

2015-09-01 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-4223. Resolution: Fixed Fix Version/s: 1.6.0 > Support * (meaning all users) as part of the acls >

[jira] [Created] (SPARK-10404) Worker should terminate previous executor before launch new one

2015-09-01 Thread Davies Liu (JIRA)
Davies Liu created SPARK-10404: -- Summary: Worker should terminate previous executor before launch new one Key: SPARK-10404 URL: https://issues.apache.org/jira/browse/SPARK-10404 Project: Spark

  1   2   3   >