[ 
https://issues.apache.org/jira/browse/SPARK-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen resolved SPARK-1303.
------------------------------
    Resolution: Won't Fix

Sounds like this should start outside MLlib: 
https://github.com/apache/spark/pull/216

> Added discretization capability to MLlib.
> -----------------------------------------
>
>                 Key: SPARK-1303
>                 URL: https://issues.apache.org/jira/browse/SPARK-1303
>             Project: Spark
>          Issue Type: New Feature
>          Components: MLlib
>            Reporter: LIDIAgroup
>
> Some time ago, we have commented with Ameet Talwalkar the possibilty of 
> including both Feature Selection and Discretization algorithms to MLlib.
> In this patch we've implemented Entropy Minimization Discretization following 
> the algorithm described in the paper "Multi-interval discretization of 
> continuous-valued attributes for classification learning" by Fayyad and Irani 
> (1993). This is one of the most used Discretizers and is already included in 
> most libraries like Weka, etc. This can be used as base for FS algorims and 
> the NaiveBayes already included in MLlib.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to