Hi Osman, Mahout has all the building blocks you need to create a LSA pipeline:
You have to vectorize your documents using seqdirectory and seq2sparse to get the term-document-matrix. After that you can you use one of our two SVD implementations [1,2] to compute the decomposition necessary for LSA. sebastian [1] https://cwiki.apache.org/MAHOUT/stochastic-singular-value-decomposition.html [2] https://cwiki.apache.org/MAHOUT/dimensional-reduction.html On 26.12.2012 19:57, Osman Başkaya wrote: > Greetings everyone, > > I want to use Latent Semantic Analysis in Mahout. Is there any > implementation for this algorithm. I checked but I couldn't find. LSA is > very similar to SVD, so I thought the reason why there is no concrete LSA > implementation. Could you clarify this for me, please? > > Thank you so much. > > Cordially, > Osman baskaya >
