Github user kalmanchapman commented on the issue:
https://github.com/apache/flink/pull/2735
@kateri1 - I agree that seeking a solution with Flink's data structures is
valuable.
I also think that Flink-ML is in a unique position to implement
streaming-first, iterative implementations of this algorithm. They are fairly
novel on the web, but in theory have been implemented in [Gensim's
word2vec](https://github.com/RaRe-Technologies/gensim/pull/900).
Having an initial, offline implementation of word2vec in flink could be
considered as a foundation for an online word2vec that Flink would be in a
unique position to implement and be of great use to the community looking for a
scaling solution to this class of problem
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---