[
https://issues.apache.org/jira/browse/LUCENE-4282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13427264#comment-13427264
]
Robert Muir commented on LUCENE-4282:
-------------------------------------
thanks for reporting and looking into this!
I think the bug is just the use of floats at all in this enum.
{noformat}
- if (similarity > minSimilarity) {
+ if (ed <= maxEdits) {
boostAtt.setBoost((similarity - minSimilarity) * scale_factor);
//System.out.println(" yes");
return AcceptStatus.YES;
} else {
+ System.out.println("reject: " + term.utf8ToString());
return AcceptStatus.NO;
}
{noformat}
This seems to fix it for me. We should remove all float crap from this enum,
we dont need it, only a slower deprecated class in the sandbox needs it.
> Automaton Fuzzy Query doesn't deliver all results
> -------------------------------------------------
>
> Key: LUCENE-4282
> URL: https://issues.apache.org/jira/browse/LUCENE-4282
> Project: Lucene - Core
> Issue Type: Bug
> Components: core/search
> Affects Versions: 4.0-ALPHA
> Reporter: Johannes Christen
> Assignee: Robert Muir
> Labels: newbie
> Attachments: ModifiedFuzzyTermsEnum.java, ModifiedFuzzyTermsEnum.java
>
>
> Having a small index with n documents where each document has one of the
> following terms:
> WEBER, WEBE, WEB, WBR, WE, (and some more)
> The new FuzzyQuery (Automaton) with maxEdits=2 only delivers the expected
> terms WEBER and WEBE in the rewritten query. The expected terms WEB and WBR
> which have an edit distance of 2 as well are missing.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]