Rather than start a new thread, I'd like to follow up on this. I'm going to
oversimplify but the basic question should be straightforward.
I currently have one very large SOLR index, and 5 small ones which contain
filtered subsets out of the big one and are used for faceting in one area of
our
hi,
I'm looking through the list archives and the documentation on boost
queries, and I don't see anything that matches this case.
I have an index of documents, some of which are very similar but not
identical. Therefore the scores are very close and the ordering is affected
by somewhat
That is correct, we see similar if not longer query times if run in a regular
query through the admin tool using the same terms that MLT is selecting. I
was testing MLT to see if this was an unavoidable consequence of having
terms that occur in a large number of documents, or whether it was some
Is there any way of doing this with the SOLR handler? I was looking for a
param something like mlt.maxdf that could be applied in order to enforce
this type of condition, but there doesn't seem to be one (in 1.3 at least).
Walter Underwood wrote:
Common terms are not that useful for More
After rebuilding my index over the weekend with termVectors enabled for the
relevant fields, I've run some basic testing against the MoreLikeThis
handler with these settings from the SOLR Wiki {boost=true, mindf=1,
mintf=1}.
My index contains around 20M documents, averaging under 1K of content
the updated documents
will get the vectors added and the others won't get them but will continue
to work, allowing me to rebuild in the background while not breaking
anything in my existing incremental update/release cycle.
I appreciate your help.
Eric Kilby
--
View this message in context:
http