Hi all, I noticed the development of the Spark co-occurrence of MAHOUT-1464 and I wondered if I could get similar results but with less scalability when I use MultithreadedBatchItemSimilarities with LLRSimilarity.
I want to use a co-occurrence recommender on a smallish datasets of a few GBs that does not warrant the use of a Spark cluster. Is the Spark implementation mostly a more scalable version or is it an improved implementation that gives different or better results? Cheers, Frank
