Hi Pat, Thanks for your comment, I found it quite helpful. I'm also trying to build a content-based recommender. One question though: How can I use RunSimilarityJob for online data? I mean, I have a dataset and the approach you describe works pretty well to precompute the similarity matrix. However, when I get new content in my dataset (it is a dataset of news), I can I compute the similarity of only that new item against the rest without computing the whole matrix again?
Many thanks.
