Shubham Chopra created SPARK-20903:
--------------------------------------

             Summary: Word2Vec Skip-Gram + Negative Sampling
                 Key: SPARK-20903
                 URL: https://issues.apache.org/jira/browse/SPARK-20903
             Project: Spark
          Issue Type: Sub-task
          Components: ML, MLlib
    Affects Versions: 2.1.1
            Reporter: Shubham Chopra


SkipGram + Negative Sampling is shown to be comparative or out-performing the 
hierarchical softmax based approach currently implemented with Spark. Since 
word2vec is largely a pre-processing step, the performance often can depend on 
the application it is being used for, and the corpus it is estimated on. These 
implementation give users the choice of picking one that works best for their 
use-case.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to