GitHub user Ishiihara opened a pull request:
https://github.com/apache/spark/pull/2356
[SPARK-3486][MLlib][PySpark] PySpark support for Word2Vec
@mengxr
Added PySpark support for Word2Vec
Change list
(1) PySpark support for Word2Vec
(2) SerDe support of string sequence both on python side and JVM side
(3) Test for SerDe of string sequence on JVM side
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/Ishiihara/spark Word2Vec-python
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/2356.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #2356
----
commit c867fdfdf623c2e9905a376d35987dbe2914e329
Author: Liquan Pei <[email protected]>
Date: 2014-09-10T08:51:44Z
add Word2Vec to pyspark
commit 0ad3ac1efed6258607a79c0d45345d70a17dee47
Author: Liquan Pei <[email protected]>
Date: 2014-09-10T10:02:56Z
minor fix
commit 48d5e721a58924f33ebef31b9e67454f45480d5c
Author: Liquan Pei <[email protected]>
Date: 2014-09-11T09:50:30Z
Functionality improvement
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]