[
https://issues.apache.org/jira/browse/LUCENE-8028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16253457#comment-16253457
]
Robert Muir commented on LUCENE-8028:
-------------------------------------
Can we instead factor out this stemmer into its own stemmer file? I don't think
we should mix together two stemmers in the same file with conditionals. See for
example the German package (or many other languages) where there are multiple
stemmers.
Also lets avoid modifying the analyzer for now. The analyzer just represents
defaults but we shouldn't add conditional options. Instead as a start we should
just add the new stemmer, and make it easy for people to instantiate it e.g. in
CustomAnalyzer.
> Arabic Stemmer improvement for Better Search Accuracy
> -----------------------------------------------------
>
> Key: LUCENE-8028
> URL: https://issues.apache.org/jira/browse/LUCENE-8028
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Ayah Shamandi
> Labels: Arabic, Stemmer, improvement
>
> HI, this is Ayah - bidi developer at IBM Egypt - Globalization Team, we are
> responsible to support Arabic at IBM products and services and as we use
> lucence at many of services, we found that it needs major improvement at
> Arabic stemmer, we implement the following two papers
> https://dl.acm.org/citation.cfm?id=1921657 and
> http://waset.org/publications/10005688/arabic-light-stemmer-for-better-search-accuracy
> to improve lucene arabic stemmer function and would like to open a Pull
> request to let you integrate it as a part of lucene
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]