> KNN does not have a stochastic element. I think you would get the same > results on one platform, unless I'm missing something.
These also have a stochastic element, as the Hadoop-based recommenders randomly down-sample to the interaction histories of power-users. However, this should only have a small impact on the result and can also me made deterministic by fixing the seed of the RNG. > On Sun, Mar 17, 2013 at 1:43 PM, Koobas <[email protected]> wrote: > >> I am asking the basic reproducibility question. >> If I run twice on the same dataset, with the same hardware setup, will I >> always get the same resuts? >> Or is there any chance that on two different runs, the same user will get >> slightly different suggestions? >> I am mostly revolving in the space of numerical libraries, where >> reproducibility is, sort of, a big deal. >> Maybe it's not much of a concern in machine learning. >> I am just curious. >> >> >> On Sun, Mar 17, 2013 at 8:46 AM, Sean Owen <[email protected]> wrote: >> >>> What's your question? ALS has a random starting point which changes the >>> results a bit. Not sure about KNN though. >>> >>> >> >>> On Sun, Mar 17, 2013 at 3:03 AM, Koobas <[email protected]> wrote: >>> >>>> Can anybody shed any light on the issue of reproducibility in Mahout, >>>> with and without Hadoop, specifically in the context of kNN and ALS >>>> recommenders? >>>> >>> >> >
