On Tue, 2006-06-13 at 15:14 -0400, Bob Carpenter wrote: > I refactored the org.apache.lucene.search.FuzzyTermEnum
Nice! > Does anyone have regression/performance test harnesses? I have a quite unscientific test I ran. 500 documents in corpus. One index reader. A new Index searcher for each query iteration. 100 iterations. Each iteration contains 10 queries: is.search(new FuzzyQuery(new Term(field, "hare"), d)); is.search(new FuzzyQuery(new Term(field, "nimi"), d)); is.search(new FuzzyQuery(new Term(field, "miama"), d)); is.search(new FuzzyQuery(new Term(field, "mamma"), d)); is.search(new FuzzyQuery(new Term(field, "sumatra"), d)); is.search(new FuzzyQuery(new Term(field, "buch"), d)); is.search(new FuzzyQuery(new Term(field, "busch"), d)); is.search(new FuzzyQuery(new Term(field, "hejples"), d)); is.search(new FuzzyQuery(new Term(field, "sveden"), d)); is.search(new FuzzyQuery(new Term(field, "cwedish"), d)); I do not gather the Documents from the index. 1. On my issue 550-index Old implementation: 1000 fuzzy~0.1, 633 per min. 1000 fuzzy~0.2, 784 per min. 1000 fuzzy~0.3, 1236 per min. 1000 fuzzy~0.4, 1462 per min. 1000 fuzzy~0.5, 1917 per min. 1000 fuzzy~0.6, 2574 per min. 1000 fuzzy~0.7, 2750 per min. 1000 fuzzy~0.8, 3375 per min. 1000 fuzzy~0.9, 3524 per min. With your fixes: 1000 fuzzy~0.1, 603 per min. 1000 fuzzy~0.2, 886 per min. 1000 fuzzy~0.3, 1403 per min. 1000 fuzzy~0.4, 1681 per min. 1000 fuzzy~0.5, 2165 per min. 1000 fuzzy~0.6, 2961 per min. 1000 fuzzy~0.7, 3137 per min. 1000 fuzzy~0.8, 3948 per min. 1000 fuzzy~0.9, 4594 per min. 2. Standard RAMDirectory: Old implementation: 1000 fuzzy~0.1, 121 per min. 1000 fuzzy~0.2, 190 per min. 1000 fuzzy~0.3, 342 per min. 1000 fuzzy~0.4, 456 per min. 1000 fuzzy~0.5, 578 per min. 1000 fuzzy~0.6, 632 per min. 1000 fuzzy~0.7, 645 per min. 1000 fuzzy~0.8, 679 per min. 1000 fuzzy~0.9, 696 per min. With your fixes: 1000 fuzzy~0.1, 117 per min. 1000 fuzzy~0.2, 185 per min. 1000 fuzzy~0.3, 329 per min. 1000 fuzzy~0.4, 425 per min. 1000 fuzzy~0.5, 585 per min. 1000 fuzzy~0.6, 612 per min. 1000 fuzzy~0.7, 653 per min. 1000 fuzzy~0.8, 615 per min. 1000 fuzzy~0.9, 671 per min. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]