[
https://issues.apache.org/jira/browse/SPARK-4001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jacky Li updated SPARK-4001:
----------------------------
Attachment: Distributed frequent item mining algorithm based on Spark.pptx
[~mengxr] please check the attached file, we have tested it using a open data
set from http://fimi.ua.ac.be/data/
Currently our test cluster is small (4 nodes), we will test it in a larger
cluster later, if required.
> Add Apriori algorithm to Spark MLlib
> ------------------------------------
>
> Key: SPARK-4001
> URL: https://issues.apache.org/jira/browse/SPARK-4001
> Project: Spark
> Issue Type: New Feature
> Components: MLlib
> Reporter: Jacky Li
> Assignee: Jacky Li
> Attachments: Distributed frequent item mining algorithm based on
> Spark.pptx
>
>
> Apriori is the classic algorithm for frequent item set mining in a
> transactional data set. It will be useful if Apriori algorithm is added to
> MLLib in Spark
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]