On Sat, Mar 11, 2017 at 02:31:04AM +0530, Lakshya Agrawal wrote:
> Hello everyone,
> I was going through the research paper "
> http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.129.4662&rep=rep1&type=pdf";
> (Alternatives to neighborhood based collaborative filtering) .I have
> understood the paper but was not able to comprehend "Normalizing by
> removing global effects(Methods)" properly.
> 
> I have understood that we estimate one "effect" at a time,in sequence . Is
> it that initially the values "X(ui)" are all set to one for all "i"  ? Also
> what about "Theta(u)"  for the first effect is it that  initially
> "n"(number of movies rated my user) random values are assigned to it from a
> Normal Distribution and on later stages it is calculated by using
> "Thetacap(u)"?Also does for
> "X(ui)" depend upon the effect ?
> 
> It would be of great help if  I could be provided more information/paper
> relating to this.

Hi Lakshya,

x_ui corresponds to the user rating for user u and item i (the
"explanatory variable"), so that is what your data is.  But the authors
appear to abuse notation a little, and use different x_ui for different
global effects.  So x_ui does not _depend_ on the global or user effects
but different x_ui may be used depending on the type of effect being
modeled.  I hope that makes sense; personally I think the paper could
have been written better to be more clear about this.  But maybe there
were space limits they had to conform to?

So anyway I think, your total model in the end is

r_ui = \theta_u1 x_ui1 + \theta_u2 x_ui2 + ... + error

where \theta_u1 and x_ui1 correspond to the 'user main effects' (where
x_ui1 = 1 for all u, i) and \theta_u2 and x_ui2 correspond to other
global effects, and so forth.

\theta_u can be estimated by using the estimator they give at the end of
section 3.1.  You can calculate \hat{\theta}_u easily with the last
equation on page 9, and then you can "shrink" this with the last
equation of Section 3.1 to obtain a good estimator for \theta_u.

I hope this clarifies things.  You might also consider looking around to
see if any follow-up work that cites this paper has a better or more
usable perspective.

Thanks,

Ryan

-- 
Ryan Curtin    | "Indeed!"
[email protected] |   - David Lo Pan
_______________________________________________
mlpack mailing list
[email protected]
http://knife.lugatgt.org/cgi-bin/mailman/listinfo/mlpack

Reply via email to