[jira] [Updated] (SPARK-22229) SPIP: RDMA Accelerated Shuffle Engine

2019-01-25 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-9: -- Affects Version/s: 3.0.0 2.4.0 > SPIP: RDMA Accelerated Shuffle Engine

[jira] [Commented] (SPARK-22229) SPIP: RDMA Accelerated Shuffle Engine

2019-01-25 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16752282#comment-16752282 ] Peter Rudenko commented on SPARK-9: --- [~S71955] - there's no yet PR to include SparkRDMA to

[jira] [Commented] (SPARK-7131) Move tree,forest implementation from spark.mllib to spark.ml

2015-12-09 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15049368#comment-15049368 ] Peter Rudenko commented on SPARK-7131: -- Please remove final classes from RF and GBM models in ml

[jira] [Updated] (SPARK-10870) Criteo Display Advertising Challenge dataset

2015-09-29 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-10870: -- Description: Very useful dataset to test pipeline because of: # "Big data" dataset - original

[jira] [Created] (SPARK-10870) Criteo Display Advertising Challenge dataset

2015-09-29 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-10870: - Summary: Criteo Display Advertising Challenge dataset Key: SPARK-10870 URL: https://issues.apache.org/jira/browse/SPARK-10870 Project: Spark Issue Type:

[jira] [Updated] (SPARK-10870) Criteo Display Advertising Challenge

2015-09-29 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-10870: -- Summary: Criteo Display Advertising Challenge (was: Criteo Display Advertising Challenge

[jira] [Created] (SPARK-9170) ORC data source creates a schema with lowercase table names

2015-07-18 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-9170: Summary: ORC data source creates a schema with lowercase table names Key: SPARK-9170 URL: https://issues.apache.org/jira/browse/SPARK-9170 Project: Spark

[jira] [Updated] (SPARK-9170) ORC data source creates a schema with lowercase table names

2015-07-18 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-9170: - Description: Steps to reproduce: {code} sqlContext.range(0, 10).select('id as

[jira] [Created] (SPARK-8480) Add setName for Dataframe

2015-06-19 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-8480: Summary: Add setName for Dataframe Key: SPARK-8480 URL: https://issues.apache.org/jira/browse/SPARK-8480 Project: Spark Issue Type: Wish

[jira] [Updated] (SPARK-8480) Add setName for Dataframe

2015-06-19 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-8480: - Description: Rdd has a method setName, so in spark UI, it's more easily to understand what's this

[jira] [Created] (SPARK-8442) FitTransform method for pipeline

2015-06-18 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-8442: Summary: FitTransform method for pipeline Key: SPARK-8442 URL: https://issues.apache.org/jira/browse/SPARK-8442 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-6901) ParamGridBuilder.build with no grids should return an empty array

2015-04-14 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-6901: - Description: ParamGridBuilder.build with no grids returns array with an empty param map. {code}

[jira] [Created] (SPARK-6901) ParamGridBuilder.build with no grids should return an empty array

2015-04-14 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-6901: Summary: ParamGridBuilder.build with no grids should return an empty array Key: SPARK-6901 URL: https://issues.apache.org/jira/browse/SPARK-6901 Project: Spark

[jira] [Comment Edited] (SPARK-5114) Should Evaluator be a PipelineStage

2015-04-08 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483335#comment-14483335 ] Peter Rudenko edited comment on SPARK-5114 at 4/8/15 2:14 PM: --

[jira] [Commented] (SPARK-5114) Should Evaluator be a PipelineStage

2015-04-07 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14483335#comment-14483335 ] Peter Rudenko commented on SPARK-5114: -- +1 for should. For my use case (create

[jira] [Commented] (SPARK-3702) Standardize MLlib classes for learners, models

2015-04-06 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481342#comment-14481342 ] Peter Rudenko commented on SPARK-3702: -- For trees based algorithms curious whether

[jira] [Comment Edited] (SPARK-3702) Standardize MLlib classes for learners, models

2015-04-06 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14481342#comment-14481342 ] Peter Rudenko edited comment on SPARK-3702 at 4/6/15 4:06 PM: --

[jira] [Commented] (SPARK-2243) Support multiple SparkContexts in the same JVM

2015-03-11 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14357591#comment-14357591 ] Peter Rudenko commented on SPARK-2243: -- Unfortunatelly it doesn't work in

[jira] [Commented] (SPARK-3477) Clean up code in Yarn Client / ClientBase

2015-03-09 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14353784#comment-14353784 ] Peter Rudenko commented on SPARK-3477: -- +1 to return these classes to public. There's

[jira] [Commented] (SPARK-5844) Optimize Pipeline.fit for ParamGrid

2015-02-16 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14323253#comment-14323253 ] Peter Rudenko commented on SPARK-5844: -- Here's a solution i came up with. Maybe would

[jira] [Commented] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14320038#comment-14320038 ] Peter Rudenko commented on SPARK-4766: -- Very important feature that could make pretty

[jira] [Created] (SPARK-5804) Explicitly manage cache in Crossvalidation k-fold loop

2015-02-13 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-5804: Summary: Explicitly manage cache in Crossvalidation k-fold loop Key: SPARK-5804 URL: https://issues.apache.org/jira/browse/SPARK-5804 Project: Spark Issue

[jira] [Updated] (SPARK-5807) Parallel grid search

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5807: - Description: Right now in CrossValidator for each fold combination and ParamGrid hyperparameter

[jira] [Created] (SPARK-5807) Parallel grid search

2015-02-13 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-5807: Summary: Parallel grid search Key: SPARK-5807 URL: https://issues.apache.org/jira/browse/SPARK-5807 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-5807) Parallel grid search

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5807: - Description: Right now in CrossValidator for each fold combination and ParamGrid hyperparameter

[jira] [Updated] (SPARK-5796) Do not transform data on last estimator in Pipeline

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5796: - Priority: Minor (was: Major) Do not transform data on last estimator in Pipeline

[jira] [Created] (SPARK-5796) Do not transform data on last estimator in Pipeline

2015-02-13 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-5796: Summary: Do not transform data on last estimator in Pipeline Key: SPARK-5796 URL: https://issues.apache.org/jira/browse/SPARK-5796 Project: Spark Issue

[jira] [Updated] (SPARK-5796) Do not transform data on last estimator in Pipeline

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5796: - Description: If it's a last stage in Pipeline there's no need to transform data, since there's no

[jira] [Updated] (SPARK-5796) Do not transform data on a last stage in Pipeline

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5796: - Summary: Do not transform data on a last stage in Pipeline (was: Do not transform data on last

[jira] [Updated] (SPARK-5796) Do not transform data on a last estimator in Pipeline

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5796: - Summary: Do not transform data on a last estimator in Pipeline (was: Do not transform data on a

[jira] [Updated] (SPARK-5796) Do not transform data on a last estimator in Pipeline

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5796: - Description: If it's a last estimator in Pipeline there's no need to transform data, since

[jira] [Updated] (SPARK-5796) Do not transform data on a last estimator in Pipeline

2015-02-13 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-5796: - Affects Version/s: (was: 1.2.1) 1.3.0 Do not transform data on a last

[jira] [Commented] (SPARK-5288) Stabilize Spark SQL data type API followup

2015-01-28 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14295128#comment-14295128 ] Peter Rudenko commented on SPARK-5288: -- NumericType should be public. Here's a use

[jira] [Comment Edited] (SPARK-5288) Stabilize Spark SQL data type API followup

2015-01-28 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14295128#comment-14295128 ] Peter Rudenko edited comment on SPARK-5288 at 1/28/15 1:25 PM:

[jira] [Created] (SPARK-5455) Add MultipleTransformer abstract class

2015-01-28 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-5455: Summary: Add MultipleTransformer abstract class Key: SPARK-5455 URL: https://issues.apache.org/jira/browse/SPARK-5455 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-4766) ML Estimator Params should subclass Transformer Params

2015-01-16 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14280571#comment-14280571 ] Peter Rudenko commented on SPARK-4766: -- Also make a traits that extends Params

[jira] [Updated] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-4101: - Description: 1) Would be nice to be able to retrieve underlying model map, to be able to work

[jira] [Updated] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-4101: - Description: 1) Would be nice to be able to retrieve underlying model map, to be able to work

[jira] [Commented] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229921#comment-14229921 ] Peter Rudenko commented on SPARK-4101: -- Here's an interactive example:

[jira] [Commented] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14229943#comment-14229943 ] Peter Rudenko commented on SPARK-4101: -- But i want to be able to extend it further

[jira] [Closed] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-12-01 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko closed SPARK-4101. Resolution: Fixed The main feature fixed by SPARK-4582, other functionality is not critical.

[jira] [Created] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-10-27 Thread Peter Rudenko (JIRA)
Peter Rudenko created SPARK-4101: Summary: [MLLIB] Improve API in Word2Vec model Key: SPARK-4101 URL: https://issues.apache.org/jira/browse/SPARK-4101 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-10-27 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-4101: - Description: 1) Would be nice to be able to retrieve underlying model map (make the model field

[jira] [Updated] (SPARK-4101) [MLLIB] Improve API in Word2Vec model

2014-10-27 Thread Peter Rudenko (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Rudenko updated SPARK-4101: - Description: 1) Would be nice to be able to retrieve underlying model map, to be able to work