For the training I am using the code in the MovieLensALS example with 
trainImplicit set to false 
and for the training RMSE I use the

val rmseTr = computeRmse(model, training, params.implicitPrefs).

The computeRmse() method is provided in the MovieLensALS class.


Thanks a lot, 
Kostas


> On Nov 26, 2014, at 2:41 PM, Sean Owen <so...@cloudera.com> wrote:
> 
> How are you computing RMSE?
> and how are you training the model -- not with trainImplicit right?
> I wonder if you are somehow optimizing something besides RMSE.
> 
> On Wed, Nov 26, 2014 at 2:36 PM, Kostas Kloudas <kklou...@gmail.com> wrote:
>> Once again, the error even with the training dataset increases. The results
>> are:
>> 
>> Running 1 iterations
>> For 1 iter.: Test RMSE  = 1.2447121194304893  Training RMSE =
>> 1.2394166987104076 (34.751317636 s).
>> Running 5 iterations
>> For 5 iter.: Test RMSE  = 1.3253957117600659  Training RMSE =
>> 1.3206317416138509 (37.693118023000004 s).
>> Running 9 iterations
>> For 9 iter.: Test RMSE  = 1.3255293380139364  Training RMSE =
>> 1.3207661218210436 (41.046175661 s).
>> Running 13 iterations
>> For 13 iter.: Test RMSE  = 1.3255295352665748  Training RMSE =
>> 1.3207663201865092 (47.763619515 s).
>> Running 17 iterations
>> For 17 iter.: Test RMSE  = 1.32552953555787  Training RMSE =
>> 1.3207663204794406 (59.682361103000005 s).
>> Running 21 iterations
>> For 21 iter.: Test RMSE  = 1.3255295355583026  Training RMSE =
>> 1.3207663204798756 (57.210578232 s).
>> Running 25 iterations
>> For 25 iter.: Test RMSE  = 1.325529535558303  Training RMSE =
>> 1.3207663204798765 (65.785485882 s).
>> 
>> Thanks a lot,
>> Kostas
>> 
>> On Nov 26, 2014, at 12:04 PM, Nick Pentreath <nick.pentre...@gmail.com>
>> wrote:
>> 
>> copying user group - I keep replying directly vs reply all :)
>> 
>> On Wed, Nov 26, 2014 at 2:03 PM, Nick Pentreath <nick.pentre...@gmail.com>
>> wrote:
>>> 
>>> ALS will be guaranteed to decrease the squared error (therefore RMSE) in
>>> each iteration, on the training set.
>>> 
>>> This does not hold for the test set / cross validation. You would expect
>>> the test set RMSE to stabilise as iterations increase, since the algorithm
>>> converges - but not necessarily to decrease.
>>> 
>>> On Wed, Nov 26, 2014 at 1:57 PM, Kostas Kloudas <kklou...@gmail.com>
>>> wrote:
>>>> 
>>>> Hi all,
>>>> 
>>>> I am getting familiarized with Mllib and a thing I noticed is that
>>>> running the MovieLensALS
>>>> example on the movieLens dataset for increasing number of iterations does
>>>> not decrease the
>>>> rmse.
>>>> 
>>>> The results for 0.6% training set and 0.4% test are below. For training
>>>> set to 0.8%, the results
>>>> are almost identical. Shouldn’t it be normal to see a decreasing error?
>>>> Especially going from 1 to 5 iterations.
>>>> 
>>>> Running 1 iterations
>>>> Test RMSE for 1 iter. = 1.2452964343277886 (52.757125927000004 s).
>>>> Running 5 iterations
>>>> Test RMSE for 5 iter. = 1.3258973764470259 (61.183927666 s).
>>>> Running 9 iterations
>>>> Test RMSE for 9 iter. = 1.3260308117704385 (61.84948875800001 s).
>>>> Running 13 iterations
>>>> Test RMSE for 13 iter. = 1.3260310099809915 (73.799510125 s).
>>>> Running 17 iterations
>>>> Test RMSE for 17 iter. = 1.3260310102735398 (77.56512185300001 s).
>>>> Running 21 iterations
>>>> Test RMSE for 21 iter. = 1.3260310102739703 (79.607495074 s).
>>>> Running 25 iterations
>>>> Test RMSE for 25 iter. = 1.326031010273971 (88.631776301 s).
>>>> Running 29 iterations
>>>> Test RMSE for 29 iter. = 1.3260310102739712 (101.178383079 s).
>>>> 
>>>> Thanks  a lot,
>>>> Kostas
>>>> ---------------------------------------------------------------------
>>>> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
>>>> For additional commands, e-mail: user-h...@spark.apache.org
>>>> 
>>> 
>> 
>> 


---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to