Zhang JiaJin created SPARK-6487:
-----------------------------------
Summary: Add sequential pattern mining algorithm to Spark MLlib
Key: SPARK-6487
URL: https://issues.apache.org/jira/browse/SPARK-6487
Project: Spark
Issue Type: New Feature
Components: MLlib
Reporter: Zhang JiaJin
Sequential pattern mining is an important branch in the pattern mining. In the
past the actual work, we use the sequence mining (mainly PrefixSpan algorithm)
to find the telecommunication signaling sequence pattern, achieved good
results. But once the data is too large, the operation time is too long, even
can not meet the the service requirements. We are ready to implement the
PrefixSpan algorithm in spark, and applied to our subsequent work.
The related Paper: "Distributed PrefixSpan algorithm based on MapReduce".
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]