[jira] [Created] (SPARK-24114) improve instrumentation for spark.ml.recommendation

2018-04-27 Thread yogesh garg (JIRA)
yogesh garg created SPARK-24114: --- Summary: improve instrumentation for spark.ml.recommendation Key: SPARK-24114 URL: https://issues.apache.org/jira/browse/SPARK-24114 Project: Spark Issue

[jira] [Commented] (SPARK-24114) improve instrumentation for spark.ml.recommendation

2018-04-27 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16457110#comment-16457110 ] yogesh garg commented on SPARK-24114: - I would like to work on this. > improve instrumentation for

[jira] [Commented] (SPARK-24115) improve instrumentation for spark.ml.tuning

2018-04-27 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16457113#comment-16457113 ] yogesh garg commented on SPARK-24115: - I would like to work on this. > improve instrumentation for

[jira] [Created] (SPARK-24115) improve instrumentation for spark.ml.tuning

2018-04-27 Thread yogesh garg (JIRA)
yogesh garg created SPARK-24115: --- Summary: improve instrumentation for spark.ml.tuning Key: SPARK-24115 URL: https://issues.apache.org/jira/browse/SPARK-24115 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-23562) RFormula handleInvalid should handle invalid values in non-string columns.

2018-03-07 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390434#comment-16390434 ] yogesh garg edited comment on SPARK-23562 at 3/7/18 11:33 PM: -- Error in

[jira] [Comment Edited] (SPARK-23562) RFormula handleInvalid should handle invalid values in non-string columns.

2018-03-07 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390434#comment-16390434 ] yogesh garg edited comment on SPARK-23562 at 3/7/18 11:30 PM: -- Error in

[jira] [Commented] (SPARK-23562) RFormula handleInvalid should handle invalid values in non-string columns.

2018-03-07 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390434#comment-16390434 ] yogesh garg commented on SPARK-23562: - Error in question can be reproduced with the following code in

[jira] [Commented] (SPARK-22915) ML test for StructuredStreaming: spark.ml.feature, N-Z

2018-02-27 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16378978#comment-16378978 ] yogesh garg commented on SPARK-22915: - I have started working on this and can raise a PR soon. Thanks

[jira] [Commented] (SPARK-22915) ML test for StructuredStreaming: spark.ml.feature, N-Z

2018-02-27 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16379035#comment-16379035 ] yogesh garg commented on SPARK-22915: - Ah, doesn't make sense for me to take it then. Thanks! Please

[jira] [Commented] (SPARK-18630) PySpark ML memory leak

2018-02-28 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16381244#comment-16381244 ] yogesh garg commented on SPARK-18630: - I would like to take this. If I understand correctly, moving

[jira] [Created] (SPARK-23690) VectorAssembler should have handleInvalid to handle columns with null values

2018-03-14 Thread yogesh garg (JIRA)
yogesh garg created SPARK-23690: --- Summary: VectorAssembler should have handleInvalid to handle columns with null values Key: SPARK-23690 URL: https://issues.apache.org/jira/browse/SPARK-23690 Project:

[jira] [Created] (SPARK-23871) add python api for VectorAssembler handleInvalid

2018-04-04 Thread yogesh garg (JIRA)
yogesh garg created SPARK-23871: --- Summary: add python api for VectorAssembler handleInvalid Key: SPARK-23871 URL: https://issues.apache.org/jira/browse/SPARK-23871 Project: Spark Issue Type:

[jira] [Created] (SPARK-23870) Forward RFormula handleInvalid Param to VectorAssembler

2018-04-04 Thread yogesh garg (JIRA)
yogesh garg created SPARK-23870: --- Summary: Forward RFormula handleInvalid Param to VectorAssembler Key: SPARK-23870 URL: https://issues.apache.org/jira/browse/SPARK-23870 Project: Spark Issue

[jira] [Commented] (SPARK-23871) add python api for VectorAssembler handleInvalid

2018-04-05 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16427543#comment-16427543 ] yogesh garg commented on SPARK-23871: - I hadn't started working on this yet. Feel free to take it. >

[jira] [Comment Edited] (SPARK-23690) VectorAssembler should have handleInvalid to handle columns with null values

2018-03-19 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16405174#comment-16405174 ] yogesh garg edited comment on SPARK-23690 at 3/19/18 7:04 PM: -- In an offline

[jira] [Comment Edited] (SPARK-23690) VectorAssembler should have handleInvalid to handle columns with null values

2018-03-19 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16405174#comment-16405174 ] yogesh garg edited comment on SPARK-23690 at 3/19/18 7:03 PM: -- In an offline

[jira] [Commented] (SPARK-23690) VectorAssembler should have handleInvalid to handle columns with null values

2018-03-19 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16405174#comment-16405174 ] yogesh garg commented on SPARK-23690: - In an offline discussion with [~mrbago], we discussed the

[jira] [Commented] (SPARK-18630) PySpark ML memory leak

2018-03-01 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16382886#comment-16382886 ] yogesh garg commented on SPARK-18630: - After some discussion, I think it makes sense to move just the

[jira] [Updated] (SPARK-25901) Barrier mode spawns a bunch of threads that get collected on gc

2018-10-31 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yogesh garg updated SPARK-25901: Attachment: Screen Shot 2018-10-31 at 11.57.25 AM.png Screen Shot 2018-10-31 at

[jira] [Created] (SPARK-25901) Barrier mode spawns a bunch of threads that get collected on gc

2018-10-31 Thread yogesh garg (JIRA)
yogesh garg created SPARK-25901: --- Summary: Barrier mode spawns a bunch of threads that get collected on gc Key: SPARK-25901 URL: https://issues.apache.org/jira/browse/SPARK-25901 Project: Spark

[jira] [Commented] (SPARK-25901) Barrier mode spawns a bunch of threads that get collected on gc

2018-10-31 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16670573#comment-16670573 ] yogesh garg commented on SPARK-25901: - I am working on this task. > Barrier mode spawns a bunch of

[jira] [Comment Edited] (SPARK-25901) Barrier mode spawns a bunch of threads that get collected on gc

2018-10-31 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16670573#comment-16670573 ] yogesh garg edited comment on SPARK-25901 at 10/31/18 7:06 PM: --- I am

[jira] [Updated] (SPARK-25901) Barrier mode spawns a bunch of threads that get collected on gc

2018-10-31 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yogesh garg updated SPARK-25901: Description: After a barrier job is terminated (successfully or interrupted), the accompanying

[jira] [Commented] (SPARK-25901) Barrier mode spawns a bunch of threads that get collected on gc

2018-11-01 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16672202#comment-16672202 ] yogesh garg commented on SPARK-25901: - [~jiangxb1987] m thanks for approving the PR, can we assign