That looks basically sound. You probably want to wrap the PearsonCorrelationSimilarity in a CachingItemSimilarity.
You may also simply wish to try a different algorithm. What's the data like? if it has lots of items, this is not the best choice. Next step here would be to profile to see where the time is spent. You might just debug, and pause the processing periodically to see where the thread is. That may show where time is spent. 2010/7/14 Young <[email protected]>: > Thanks, Sean. Below is my code. >
