: It's got one difference from yours, in that the terms are allowed to : occur in any order in the sub-phrases (so phrase "C B" from your : original example is scored like "B C").
there's a much bigger differnece, in that your technique won't reqard documents where B and C are "near" eachother, but A is farther away in the document then the proximity value you calculate. Daniel's goal is to make sure that documents matching any subphrase of the orriginal query get a increase in score based on the length of hte subphrase. in his specific example the orriginal query only had three words, and he wanted all of them to be mandatory, but consider the case where they are all optional. if i search for 'A B C Z', and Z is a nonexistent term, he wants documents matching the phrase "A B C" to get better scores then documents matching the phrase "A B", or "B C" which should get better score then documents that just match the individual terms with large gaps in between them. your approach will still only increase the scores of documents where *all* of the terms appear within some proximity. -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]