[
https://issues.apache.org/jira/browse/SOLR-4565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13662871#comment-13662871
]
Erlend Garåsen commented on SOLR-4565:
--------------------------------------
There are not so many differences between the stemming rules for these two
languages. The only difference is that you must skip some rules for Nynorsk if
you have configuring the stemmer to only use Bokmål.
Both Nynorsk and Bokmål have endings with "-ene", for instance many feminine
indefinite nouns in plural form such as "jentene" (same for both languages).
For these nouns, you must only exclude stemming for words ending with "-ane" if
you have configured it for Bokmål.
The same rules apply to masculine indefinite nouns in plural form for Nynorsk,
i.e. endings with "-ar". The stemmer must skip those endings as long as only
Bokmål is used.
> Extend NorwegianMinimalStemFilter to handle "nynorsk"
> -----------------------------------------------------
>
> Key: SOLR-4565
> URL: https://issues.apache.org/jira/browse/SOLR-4565
> Project: Solr
> Issue Type: Improvement
> Components: Schema and Analysis
> Reporter: Jan Høydahl
>
> Norway has two official languages, both called "Norwegian", namely Bokmål
> (nb_NO) and Nynorsk (nn_NO).
> The NorwegianMinimalStemFilter and NorwegianLightStemFilter today only works
> with the largest of the two, namely Bokmål.
> Propose to incorporate "nn" support through a new "vaiant" config option:
> * variant="nb" or not configured -> Bokmål as today
> * variant="nn" -> Nynorsk only
> * variant="no" -> Remove stems for both nb and nn
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]