[ 
https://issues.apache.org/jira/browse/LUCENE-2062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12784347#action_12784347
 ] 

Robert Muir commented on LUCENE-2062:
-------------------------------------

Thanks for reviewing Mark.

btw there are some comments in the tests, I think this algorithm is too 
conservative in some places (specifically the length constraints).
But I don't have the test collection to verify that modifying these won't 
destroy relevance, so I prefer sticking with the published algorithm.


> Bulgarian Analyzer
> ------------------
>
>                 Key: LUCENE-2062
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2062
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: contrib/analyzers
>            Reporter: Robert Muir
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.1
>
>         Attachments: LUCENE-2062.patch, LUCENE-2062.patch, LUCENE-2062.patch, 
> LUCENE-2062.patch, LUCENE-2062.patch
>
>
> someone asked about bulgarian analysis on solr-user today... 
> http://www.lucidimagination.com/search/document/e1e7a5636edb1db2/non_english_languages
> I was surprised we did not have anything.
> This analyzer implements the algorithm specified here, 
> http://members.unine.ch/jacques.savoy/Papers/BUIR.pdf
> In the measurements there, this improves MAP approx 34%

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to