[
https://issues.apache.org/jira/browse/SPARK-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14545027#comment-14545027
]
Alex commented on SPARK-2344:
-----------------------------
Hi,
How are you? I have couple of questions:
1) When are you planning to submit the FCM to the main spark branch? (I'm
interested working on top of it for Feature Weight FCM improvements)
2) How to know if there is a way for Spark to make the RDD distribution
based on input data columns rather then rows ?
​Thanks,
Alex
> Add Fuzzy C-Means algorithm to MLlib
> ------------------------------------
>
> Key: SPARK-2344
> URL: https://issues.apache.org/jira/browse/SPARK-2344
> Project: Spark
> Issue Type: New Feature
> Components: MLlib
> Reporter: Alex
> Priority: Minor
> Labels: clustering
> Original Estimate: 1m
> Remaining Estimate: 1m
>
> I would like to add a FCM (Fuzzy C-Means) algorithm to MLlib.
> FCM is very similar to K - Means which is already implemented, and they
> differ only in the degree of relationship each point has with each cluster:
> (in FCM the relationship is in a range of [0..1] whether in K - Means its 0/1.
> As part of the implementation I would like:
> - create a base class for K- Means and FCM
> - implement the relationship for each algorithm differently (in its class)
> I'd like this to be assigned to me.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]