[
https://issues.apache.org/jira/browse/HIVEMALL-74?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15965234#comment-15965234
]
Takuya Kitazawa edited comment on HIVEMALL-74 at 4/12/17 2:22 AM:
------------------------------------------------------------------
There are several incremental variants of pLSA.
One is called *fold-in* which re-runs EM steps for new documents *by fixing
P(w|z)*. This strategy is described in the original papers, but we have no way
to update *P(w|z)* for newly observed documents. So, it does not fit to our
purpose.
Incremental methods which update P(w|z) are described in: "Incremental
Probabilistic Latent Semantic Analysis for Automatic Question Recommendation"
https://pdfs.semanticscholar.org/b66e/c7faf2e4888503f7ad1537d284f350fb3e58.pdf
(Table 1 in the paper compares different algorithms); we will implement one of
the three incremental variants, *IPLSA*, *MAP-PLSA* and *Our method*, in
Hivemall.
was (Author: takuti):
Online PLSA
http://poseidon.csd.auth.gr/papers/PUBLISHED/JOURNAL/pdf/2014/D48.pdf
> Implement pLSA algorithm
> ------------------------
>
> Key: HIVEMALL-74
> URL: https://issues.apache.org/jira/browse/HIVEMALL-74
> Project: Hivemall
> Issue Type: Wish
> Reporter: Makoto Yui
> Priority: Minor
> Labels: Algorithm
>
> Support pLSA [1].
> [1] Thomas Hofmann, "Collaborative Filtering via Gauusian Probabilistic
> Latent Semantic Analysis", Proc. SIGIR, 2003.
> http://dl.acm.org/citation.cfm?id=860483
> http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.7.2476&rep=rep1&type=pdf
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)