I'm not sure I understand what your field arrangement would be when you say
"[T]he items I'm pulling in from the web contain large bodies of text
(descriptions) whereas the products in my catalog consist of shorter fields
such as product name, manufacturer, product code, etc. So using the smaller
Have you looked at Lucene's "MoreLikeThis"? I confess I haven't
worked with this enough to recommend *how* to use it, but it seems
like it's in the general area you're talking about.
http://lucene.apache.org/java/3_5_0/api/contrib-queries/org/apache/lucene/search/similar/MoreLikeThis.html
Best
Er
Thanks for the response Donna. That would make more sense, but the items
I'm pulling in from the web contain large bodies of text (descriptions)
whereas the products in my catalog consist of shorter fields such as
product name, manufacturer, product code, etc. So using the smaller fields
from my ca
Maybe I'm misunderstanding what you're trying to do, but why not do it the
other
way around; that is, index the items in your catalog, and use the items on
the web
as the query into the catalog. I have an analogous process (though
completely
different application area) and I index the stuff th