On 11/8/06 10:30 AM, "Chris Hostetter" <[EMAIL PROTECTED]> wrote:
> : Also, the phonetic matches are ranked a bit high, so I'm trying a > : sub-1.0 boost. I was expecting the lower idf to fix that automatically. > : The metaphone will almost always have a lower idf because multiple > : words are mapped to one metaphone, so the encoded term occurs in more > : documents than the surface terms. > > That all makes sense, and yet it's not what you are observing ... which > leads me to believe you (and I since i want to agree with you) are missing > something subtle .... what does the the Explanation look like for two > documenets where you feel like one should score higher then the other but > they don't? That is my next step. Maybe create some test documents in my corpus and spend some quality time with Explain and grokking DisMax. I need to customize Similarity anyway. wunder -- Walter Underwood Search Guru, Netflix