[
https://issues.apache.org/jira/browse/LUCENE-2062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12784347#action_12784347
]
Robert Muir commented on LUCENE-2062:
-------------------------------------
Thanks for reviewing Mark.
btw there are some comments in the tests, I think this algorithm is too
conservative in some places (specifically the length constraints).
But I don't have the test collection to verify that modifying these won't
destroy relevance, so I prefer sticking with the published algorithm.
> Bulgarian Analyzer
> ------------------
>
> Key: LUCENE-2062
> URL: https://issues.apache.org/jira/browse/LUCENE-2062
> Project: Lucene - Java
> Issue Type: New Feature
> Components: contrib/analyzers
> Reporter: Robert Muir
> Assignee: Robert Muir
> Priority: Minor
> Fix For: 3.1
>
> Attachments: LUCENE-2062.patch, LUCENE-2062.patch, LUCENE-2062.patch,
> LUCENE-2062.patch, LUCENE-2062.patch
>
>
> someone asked about bulgarian analysis on solr-user today...
> http://www.lucidimagination.com/search/document/e1e7a5636edb1db2/non_english_languages
> I was surprised we did not have anything.
> This analyzer implements the algorithm specified here,
> http://members.unine.ch/jacques.savoy/Papers/BUIR.pdf
> In the measurements there, this improves MAP approx 34%
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]