On Tue, 2006-06-13 at 15:14 -0400, Bob Carpenter wrote:
> I refactored the org.apache.lucene.search.FuzzyTermEnum

Nice!

> Does anyone have regression/performance test harnesses?

I have a quite unscientific test I ran. 500 documents in corpus. One
index reader. A new Index searcher for each query iteration. 100
iterations. Each iteration contains 10 queries:

is.search(new FuzzyQuery(new Term(field, "hare"), d));
is.search(new FuzzyQuery(new Term(field, "nimi"), d));
is.search(new FuzzyQuery(new Term(field, "miama"), d));
is.search(new FuzzyQuery(new Term(field, "mamma"), d));
is.search(new FuzzyQuery(new Term(field, "sumatra"), d));
is.search(new FuzzyQuery(new Term(field, "buch"), d));
is.search(new FuzzyQuery(new Term(field, "busch"), d));
is.search(new FuzzyQuery(new Term(field, "hejples"), d));
is.search(new FuzzyQuery(new Term(field, "sveden"), d));
is.search(new FuzzyQuery(new Term(field, "cwedish"), d));

I do not gather the Documents from the index.


1. On my issue 550-index

Old implementation:

1000 fuzzy~0.1, 633 per min.
1000 fuzzy~0.2, 784 per min.
1000 fuzzy~0.3, 1236 per min.
1000 fuzzy~0.4, 1462 per min.
1000 fuzzy~0.5, 1917 per min.
1000 fuzzy~0.6, 2574 per min.
1000 fuzzy~0.7, 2750 per min.
1000 fuzzy~0.8, 3375 per min.
1000 fuzzy~0.9, 3524 per min.

With your fixes:

1000 fuzzy~0.1, 603 per min.
1000 fuzzy~0.2, 886 per min.
1000 fuzzy~0.3, 1403 per min.
1000 fuzzy~0.4, 1681 per min.
1000 fuzzy~0.5, 2165 per min.
1000 fuzzy~0.6, 2961 per min.
1000 fuzzy~0.7, 3137 per min.
1000 fuzzy~0.8, 3948 per min.
1000 fuzzy~0.9, 4594 per min.

2. Standard RAMDirectory:

Old implementation:

1000 fuzzy~0.1, 121 per min.
1000 fuzzy~0.2, 190 per min.
1000 fuzzy~0.3, 342 per min.
1000 fuzzy~0.4, 456 per min.
1000 fuzzy~0.5, 578 per min.
1000 fuzzy~0.6, 632 per min.
1000 fuzzy~0.7, 645 per min.
1000 fuzzy~0.8, 679 per min.
1000 fuzzy~0.9, 696 per min.

With your fixes:

1000 fuzzy~0.1, 117 per min.
1000 fuzzy~0.2, 185 per min.
1000 fuzzy~0.3, 329 per min.
1000 fuzzy~0.4, 425 per min.
1000 fuzzy~0.5, 585 per min.
1000 fuzzy~0.6, 612 per min.
1000 fuzzy~0.7, 653 per min.
1000 fuzzy~0.8, 615 per min.
1000 fuzzy~0.9, 671 per min.




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to