RE: Using Lucene to match document sets to each other

2011-12-19 Thread Paul Allan Hill
I'm not sure I understand what your field arrangement would be when you say "[T]he items I'm pulling in from the web contain large bodies of text (descriptions) whereas the products in my catalog consist of shorter fields such as product name, manufacturer, product code, etc. So using the smaller

Re: Using Lucene to match document sets to each other

2011-12-16 Thread Erick Erickson
Have you looked at Lucene's "MoreLikeThis"? I confess I haven't worked with this enough to recommend *how* to use it, but it seems like it's in the general area you're talking about. http://lucene.apache.org/java/3_5_0/api/contrib-queries/org/apache/lucene/search/similar/MoreLikeThis.html Best Er

Re: Using Lucene to match document sets to each other

2011-12-16 Thread Josh Stone
Thanks for the response Donna. That would make more sense, but the items I'm pulling in from the web contain large bodies of text (descriptions) whereas the products in my catalog consist of shorter fields such as product name, manufacturer, product code, etc. So using the smaller fields from my ca

Re: Using Lucene to match document sets to each other

2011-12-16 Thread Donna L Gresh
Maybe I'm misunderstanding what you're trying to do, but why not do it the other way around; that is, index the items in your catalog, and use the items on the web as the query into the catalog. I have an analogous process (though completely different application area) and I index the stuff th