Nobody's talked to me about it either. I'm happy to review your code when you try this out, however. Do you have a good data set you're planning on using for training? Ideally you want a supervised label set in which training data has multiple labels per document.
On Thu, Sep 5, 2013 at 9:44 AM, Ted Dunning <[email protected]> wrote: > I haven't seen any discussion of this other than what you reference. > > > On Thu, Sep 5, 2013 at 7:59 AM, Henry Lee <[email protected]> wrote: > > > I am about to implement Jake Mannix's suggestion out of Twitter fork. > > > > Has anyone already implemented "true" L-LDA out of Mahout? > > > > http://markmail.org/message/cm2a6rnxblj5azuh > > > > over this fork? > > > > > > > https://github.com/twitter/mahout/blob/master/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CVB0PriorMapper.java > > > > Thanks, > > Henry Lee > > > -- -jake
