And there is a close correlation between n-gram matching scores and edit distance scores.
On Tue, Sep 1, 2009 at 11:44 AM, Sean Owen <[email protected]> wrote: > Yeah that probably kills the idea doesn't it... the 'best' centroid is well > defined this way, but, searching for it may be completely unreasonable. I > see why counts doesn't have this problem. > -- Ted Dunning, CTO DeepDyve
