[jira] [Commented] (SPARK-22586) Feature selection

2017-11-22 Thread Jorge Gonzalez Lopez (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-22586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16263498#comment-16263498
 ] 

Jorge Gonzalez Lopez commented on SPARK-22586:
--

Sorry, for posting on the wrong section. I'll forward the question to the 
mailing list.

> Feature selection 
> --
>
> Key: SPARK-22586
> URL: https://issues.apache.org/jira/browse/SPARK-22586
> Project: Spark
>  Issue Type: Improvement
>  Components: ML
>Affects Versions: 2.2.0
>Reporter: Jorge Gonzalez Lopez
>Priority: Minor
>
> Hello everyone, 
> I would like to know if there are plans to add different score functions to 
> perform feature selection under the same interface. I saw two previous issues 
> related to the topic:
> https://issues.apache.org/jira/browse/SPARK-6531
> https://issues.apache.org/jira/browse/SPARK-1473
> However, it seems nothing was added at the end. I would like to know if there 
> was some problem then, because I wouldn't mind taking a closer look to it in 
> case people would be interested. 
> Additionally, I think it would be interested to include a score metric 
> between continuous attributes (for regression), and between continuous and 
> discrete (for classification). This has already been done successfully on 
> http://scikit-learn.org/stable/modules/feature_selection.html#univariate-feature-selection



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-22586) Feature selection

2017-11-22 Thread Jorge Gonzalez Lopez (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-22586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jorge Gonzalez Lopez updated SPARK-22586:
-
Environment: (was: Hello everyone, 

I would like to know if there are plans to add different score functions to 
perform feature selection under the same interface. I saw two previous issues 
related to the topic:

https://issues.apache.org/jira/browse/SPARK-6531
https://issues.apache.org/jira/browse/SPARK-1473

However, it seems nothing was added at the end. I would like to know if there 
was some problem then, because I wouldn't mind taking a closer look to it in 
case people would be interested. 

Additionally, I think it would be interested to include a score metric between 
continuous attributes (for regression), and between continuous and discrete 
(for classification). This has already been done successfully on 
http://scikit-learn.org/stable/modules/feature_selection.html#univariate-feature-selection)

> Feature selection 
> --
>
> Key: SPARK-22586
> URL: https://issues.apache.org/jira/browse/SPARK-22586
> Project: Spark
>  Issue Type: Improvement
>  Components: ML
>Affects Versions: 2.2.0
>Reporter: Jorge Gonzalez Lopez
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-22586) Feature selection

2017-11-22 Thread Jorge Gonzalez Lopez (JIRA)
Jorge Gonzalez Lopez created SPARK-22586:


 Summary: Feature selection 
 Key: SPARK-22586
 URL: https://issues.apache.org/jira/browse/SPARK-22586
 Project: Spark
  Issue Type: Improvement
  Components: ML
Affects Versions: 2.2.0
 Environment: Hello everyone, 

I would like to know if there are plans to add different score functions to 
perform feature selection under the same interface. I saw two previous issues 
related to the topic:

https://issues.apache.org/jira/browse/SPARK-6531
https://issues.apache.org/jira/browse/SPARK-1473

However, it seems nothing was added at the end. I would like to know if there 
was some problem then, because I wouldn't mind taking a closer look to it in 
case people would be interested. 

Additionally, I think it would be interested to include a score metric between 
continuous attributes (for regression), and between continuous and discrete 
(for classification). This has already been done successfully on 
http://scikit-learn.org/stable/modules/feature_selection.html#univariate-feature-selection
Reporter: Jorge Gonzalez Lopez
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-22586) Feature selection

2017-11-22 Thread Jorge Gonzalez Lopez (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-22586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jorge Gonzalez Lopez updated SPARK-22586:
-
Description: 
Hello everyone, 

I would like to know if there are plans to add different score functions to 
perform feature selection under the same interface. I saw two previous issues 
related to the topic:

https://issues.apache.org/jira/browse/SPARK-6531
https://issues.apache.org/jira/browse/SPARK-1473

However, it seems nothing was added at the end. I would like to know if there 
was some problem then, because I wouldn't mind taking a closer look to it in 
case people would be interested. 

Additionally, I think it would be interested to include a score metric between 
continuous attributes (for regression), and between continuous and discrete 
(for classification). This has already been done successfully on 
http://scikit-learn.org/stable/modules/feature_selection.html#univariate-feature-selection

> Feature selection 
> --
>
> Key: SPARK-22586
> URL: https://issues.apache.org/jira/browse/SPARK-22586
> Project: Spark
>  Issue Type: Improvement
>  Components: ML
>Affects Versions: 2.2.0
>Reporter: Jorge Gonzalez Lopez
>Priority: Minor
>
> Hello everyone, 
> I would like to know if there are plans to add different score functions to 
> perform feature selection under the same interface. I saw two previous issues 
> related to the topic:
> https://issues.apache.org/jira/browse/SPARK-6531
> https://issues.apache.org/jira/browse/SPARK-1473
> However, it seems nothing was added at the end. I would like to know if there 
> was some problem then, because I wouldn't mind taking a closer look to it in 
> case people would be interested. 
> Additionally, I think it would be interested to include a score metric 
> between continuous attributes (for regression), and between continuous and 
> discrete (for classification). This has already been done successfully on 
> http://scikit-learn.org/stable/modules/feature_selection.html#univariate-feature-selection



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org