Hi, i have regular process for recommendation calculation. It is in production and runs for more than a week. Each start of day process: 1. consumes data current_day - 2 months 2. prepares data for mahout (use mahout ids) 3. feeds prepared data to ItemSimilarityJob 4. remaps result from mahout id to source id
I've started to get strange results for extremely popular items. For example: Iphone gets covers and iphone-related tools as recommendations and there absolutely unrelated items apear in top-recommendations. What are the right way to debug such situations? What can i read? There were no changes to any system.
