Joseph K. Bradley created SPARK-7045:
----------------------------------------
Summary: Word2Vec: avoid intermediate representation when creating
model
Key: SPARK-7045
URL: https://issues.apache.org/jira/browse/SPARK-7045
Project: Spark
Issue Type: Improvement
Components: MLlib
Affects Versions: 1.4.0
Reporter: Joseph K. Bradley
Priority: Minor
Word2VecModel now stores the word vectors as a single, flat array; Word2Vec
does as well. However, when Word2Vec creates the model, it builds an
intermediate representation. We should skip that intermediate representation.
However, it will be nice to create a public constructor for Word2VecModel which
takes that intermediate representation (a Map from String words to their
Vectors), since it's a user-friendly representation.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]