RowSimilarityJob countObservations

Wei Li Tue, 23 Sep 2014 23:07:46 -0700

Hi All:

    In RowSimilairtyJob, I see the main functionality of countObservations
job is to aggregate the number of users for each item, is that right? if
so, why not directly calculate the counts, just like the WordCount logic.
In current implementations, we initialized a RandomAccessSparseVector,
which may cause the OutOfMemory issue when the number of users is large. Am
I understanding correctly? thanks.



Best
Wei

RowSimilarityJob countObservations

Reply via email to