I think Ted's suggestion is you'll find Lucene will be _a lot faster_
for this task as you don't need all the other trappings of a DB.
On Jul 13, 2009, at 4:36 AM, Sean Owen wrote:
How does Lucene go from item-item links to recommendations? I'm
missing where the notion of user ratings, or even users, come into
play, or the strength of the association.
If the issue is really just storing the item-item links efficiently in
a way that isn't in memory, how about I cook up a JDBC-based
implementation? Seem more direct.
On Fri, Jul 10, 2009 at 11:56 PM, Ted Dunning<[email protected]>
wrote:
Yes.
One gotcha is that you generally have to limit document size a bit
to get
good performance. This is not a big deal because document
normalization
makes it hard for these documents to be retrieved in any case.
Also, these
are typically not good second order recommendations. First order
recommendations are the top-40 kinds of things and make poor
recommendations
for a bunch of reasons. Second order recommendations are those
that are
based on your history. They make much better recommendations.
On Fri, Jul 10, 2009 at 3:50 PM, Jason Rutherglen <
[email protected]> wrote:
Interesting. So we're creating the item-item matrix using one of
the Mahout
algorithms (like Taste?), then dumping it into Lucene.
--------------------------
Grant Ingersoll
http://www.lucidimagination.com/
Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)
using Solr/Lucene:
http://www.lucidimagination.com/search