[
https://issues.apache.org/jira/browse/SPARK-2613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Xiangrui Meng closed SPARK-2613.
--------------------------------
Assignee: Xiangrui Meng (was: Liquan Pei)
> CLONE - word2vec: Distributed Representation of Words
> -----------------------------------------------------
>
> Key: SPARK-2613
> URL: https://issues.apache.org/jira/browse/SPARK-2613
> Project: Spark
> Issue Type: New Feature
> Components: MLlib
> Reporter: Yifan Yang
> Assignee: Xiangrui Meng
> Original Estimate: 672h
> Remaining Estimate: 672h
>
> We would like to add parallel implementation of word2vec to MLlib. word2vec
> finds distributed representation of words through training of large data
> sets. The Spark programming model fits nicely with word2vec as the training
> algorithm of word2vec is embarrassingly parallel. We will focus on skip-gram
> model and negative sampling in our initial implementation.
--
This message was sent by Atlassian JIRA
(v6.2#6252)