[ https://issues.apache.org/jira/browse/LUCENE-2286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Robert Muir updated LUCENE-2286: -------------------------------- Attachment: LUCENE-2286.patch attached is a patch, with backwards-break in CHANGES. > enable DefaultSimilarity.setDiscountOverlaps by default > ------------------------------------------------------- > > Key: LUCENE-2286 > URL: https://issues.apache.org/jira/browse/LUCENE-2286 > Project: Lucene - Java > Issue Type: Improvement > Components: Query/Scoring > Reporter: Robert Muir > Attachments: LUCENE-2286.patch > > > I think we should enable setDiscountOverlaps in DefaultSimilarity by default. > If you are using synonyms or commongrams or a number of other > 0-posInc-term-injecting methods, these currently screw up your length > normalization. > These terms have a position increment of zero, so they shouldnt count towards > the length of the document. > I've done relevance tests with persian showing the difference is significant, > and i think its a big trap to anyone using synonyms, etc: your relevance can > actually get worse if you don't flip this boolean flag. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org