Re: [Taste] Sanity Check and Questions

Grant Ingersoll Fri, 19 Jun 2009 04:17:09 -0700

This is all good stuff, here, Ted.  Thank you.

For the task at I hand, I am focusing on what is available in Taste asan expression of some level of capability for doing CF.


Two things that aren't clear to me just yet from the Taste APIs are:

1. Given a new user with no ratings, recommend items. I see therecommenders have an estimatePreference() method, maybe that helps. Isuppose the other option is to assume the user rates all items asaverage and go from there.

2. As a related approach, given a user visiting an item, recommendother items. For the latter, I imagine that if I transpose the modelto go from items->users, I can then get a set of recommended users.Then, from those users (reverting back to the original model) I canthen get recommended items.




On Jun 18, 2009, at 4:43 PM, Ted Dunning wrote:

Grant,
The data you described is pretty simple and should produce goodresults atall levels of overlap. That it does not is definitely a problem.In fact,I would recommend making the data harder to deal with by giving non-Lincolnitems highly variable popularities and then making the groundlingsrate
items according to their popularity.  This will result in an apparent
pattern where the inclusion of any number of non-lincoln fans willshow anapparent pattern of liking popular items. The correct inferenceshould,however, be that any neighbor group that has a large number ofLincoln fans
seems to like popular items less than expected.
For problems like this, I have had good luck with using measuresthat wererobust in the face of noise (aka small counts) and in the face oflargeexternal trends (aka the top-40 problem). The simplest one that Iknow of
is the generalized multinomial log-likelihood
ratio<http://tdunning.blogspot.com/2008/03/surprise-and-coincidence.html>that
you hear me nattering about so often.  LSI does a decent job of
dealing
with the top-40, but has trouble with small counts.  LDA and related
probabiliistic methods should work somewhat better than log-likelihood
ratio, but are considerably more complex to implement.
The key here is to compare counts within the local neighborhood tocountsoutside the neighborhood. Things that are significantly differentabout the
neighborhood relative to rest of the world are candidates for
recommendation. Things to avoid when looking for interestingdifferences
include:
- correlation measures such as Pearson's R (based on normaldistributionapproximation and unscaled thus suffers from both small count andtop-40
problems)
- anomaly measures based simply on frequency ratios (very sensitiveto small
count problems, doesn't account for top-40 at all)
What I would recommend for a nearest neighbor approach is tocontinue withthe current neighbor retrieval, but switch to a log-likelihood ratiofor
generating recommendations.
What I would recommend for a production system would be to scrap thenearestneighbor approach entirely and go to a coocurrence matrix basedapproach.This costs much less to compute at recommendation and is very robustagainst
both small counts and top-40 issues.

On Thu, Jun 18, 2009 at 9:37 AM, Sean Owen <[email protected]> wrote:
Still seems a little funny. I am away from the code otherwise Iwould check
- forget if I ever implemented weighting in the standard correlation
similarity metrics. A pure correlation does not account for whetheryouoverlapped in 10 or 1000 items. This sort of weighting existselsewhere but
I forget about here. It might help.
--
Ted Dunning, CTO
DeepDyve

Re: [Taste] Sanity Check and Questions

Reply via email to