[jira] [Commented] (SPARK-13030) Change OneHotEncoder to Estimator

2016-03-09 Thread Wojciech Jurczyk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15186722#comment-15186722 ] Wojciech Jurczyk commented on SPARK-13030: -- I am not sure if I get you correctly. Are you

[jira] [Commented] (SPARK-12874) ML StringIndexer does not protect itself from column name duplication

2016-02-02 Thread Wojciech Jurczyk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15127934#comment-15127934 ] Wojciech Jurczyk commented on SPARK-12874: -- Thank you for feedback and willingness to help,

[jira] [Created] (SPARK-13030) Change OneHotEncoder to Estimator

2016-01-26 Thread Wojciech Jurczyk (JIRA)
Wojciech Jurczyk created SPARK-13030: Summary: Change OneHotEncoder to Estimator Key: SPARK-13030 URL: https://issues.apache.org/jira/browse/SPARK-13030 Project: Spark Issue Type:

[jira] [Created] (SPARK-12877) TrainValidationSplit is missing in pyspark.ml.tuning

2016-01-18 Thread Wojciech Jurczyk (JIRA)
Wojciech Jurczyk created SPARK-12877: Summary: TrainValidationSplit is missing in pyspark.ml.tuning Key: SPARK-12877 URL: https://issues.apache.org/jira/browse/SPARK-12877 Project: Spark

[jira] [Created] (SPARK-12874) ML StringIndexer does not protect itself from column name duplication

2016-01-17 Thread Wojciech Jurczyk (JIRA)
Wojciech Jurczyk created SPARK-12874: Summary: ML StringIndexer does not protect itself from column name duplication Key: SPARK-12874 URL: https://issues.apache.org/jira/browse/SPARK-12874

[jira] [Commented] (SPARK-7146) Should ML sharedParams be a public API?

2016-01-12 Thread Wojciech Jurczyk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7146?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15093932#comment-15093932 ] Wojciech Jurczyk commented on SPARK-7146: - {quote}Cons: Users have to be careful since parameters

[jira] [Updated] (SPARK-12751) Traits generated by SharedParamsCodeGen should not be private

2016-01-11 Thread Wojciech Jurczyk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wojciech Jurczyk updated SPARK-12751: - Description: Many Estimators and Transformers mix in traits generated by

[jira] [Created] (SPARK-12751) Traits generated by SharedParamsCodeGen should not be private

2016-01-11 Thread Wojciech Jurczyk (JIRA)
Wojciech Jurczyk created SPARK-12751: Summary: Traits generated by SharedParamsCodeGen should not be private Key: SPARK-12751 URL: https://issues.apache.org/jira/browse/SPARK-12751 Project: Spark

[jira] [Updated] (SPARK-12751) Traits generated by SharedParamsCodeGen should not be private

2016-01-11 Thread Wojciech Jurczyk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wojciech Jurczyk updated SPARK-12751: - Priority: Minor (was: Major) > Traits generated by SharedParamsCodeGen should not be

[jira] [Commented] (SPARK-12711) ML StopWordsRemover does not protect itself from column name duplication

2016-01-09 Thread Wojciech Jurczyk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15090535#comment-15090535 ] Wojciech Jurczyk commented on SPARK-12711: -- [~josephkb]Is there any particular reason why

[jira] [Commented] (SPARK-11478) ML StringIndexer return inconsistent schema

2015-12-16 Thread Wojciech Jurczyk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15060210#comment-15060210 ] Wojciech Jurczyk commented on SPARK-11478: -- Any progress on this, [~yanboliang]? I faced the