Re: reproducibility

Sebastian Schelter Mon, 18 Mar 2013 00:11:28 -0700

> KNN does not have a stochastic element. I think you would get the same
> results on one platform, unless I'm missing something.


These also have a stochastic element, as the Hadoop-based recommenders
randomly down-sample to the interaction histories of power-users.
However, this should only have a small impact on the result and can also
me made deterministic by fixing the seed of the RNG.

> On Sun, Mar 17, 2013 at 1:43 PM, Koobas <[email protected]> wrote:
> 
>> I am asking the basic reproducibility question.
>> If I run twice on the same dataset, with the same hardware setup, will I
>> always get the same resuts?
>> Or is there any chance that on two different runs, the same user will get
>> slightly different suggestions?
>> I am mostly revolving in the space of numerical libraries, where
>> reproducibility is, sort of, a big deal.
>> Maybe it's not much of a concern in machine learning.
>> I am just curious.
>>
>>
>> On Sun, Mar 17, 2013 at 8:46 AM, Sean Owen <[email protected]> wrote:
>>
>>> What's your question? ALS has a random starting point which changes the
>>> results a bit. Not sure about KNN though.
>>>
>>>
>>
>>> On Sun, Mar 17, 2013 at 3:03 AM, Koobas <[email protected]> wrote:
>>>
>>>> Can anybody shed any light on the issue of reproducibility in Mahout,
>>>> with and without Hadoop, specifically in the context of kNN and ALS
>>>> recommenders?
>>>>
>>>
>>
>

Re: reproducibility

Reply via email to