In Mahout lsa pipeline is possible with seqdirectory, seq2sparse and ssvd commands. Nuances are understanding dictionary format and llr anaylysis of n-grams and perhaps use a slightly better lemmatizer than the default one.
With indexing part you are on your own at this point. On Jan 1, 2012 2:28 PM, "Peyman Mohajerian" <[email protected]> wrote: > Hi Guys, > > I'm interested in this work: > > http://www.ccri.com/blog/2010/4/2/latent-semantic-analysis-in-solr-using-clojure.html > > I looked at some of the comments and notices that there was interest > in incorporating it into Mahout, back in 2010. I'm also having issues > running this code due to dependencies on older version of Mahout. > > I was wondering if LSA is now directly available in Mahout? Also if I > upgrade to the latest Mahout would this Clojure code work? > > Thanks > Peyman >
