[ 
https://issues.apache.org/jira/browse/SOLR-2764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13195863#comment-13195863
 ] 

Robert Muir commented on SOLR-2764:
-----------------------------------

just some general suggestions:

in a light stemmer, i would be wary of derivational endings. 
it seems in the case of dom/het because its dealing with adj/noun that its
on the edge (maybe ok here), but if possible it would be more ideal to
avoid multiple passes... this is the kind of thing that causes snowball 
problems.

Can you think of examples for dom/het where the meaning would be changed?

for example: "freedom" is used the same way in english, but stemming this 
to "free" is very lossy, since free has a variety of meanings (such as costs 
nothing), 
some of which are incompatible with "freedom". This is the danger of stripping
derivational suffixes...

                
> Create a NorwegianLightStemmer and NorwegianMinimalStemmer
> ----------------------------------------------------------
>
>                 Key: SOLR-2764
>                 URL: https://issues.apache.org/jira/browse/SOLR-2764
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: Jan Høydahl
>             Fix For: 3.6, 4.0
>
>         Attachments: SOLR-2764.patch, SOLR-2764.patch, SOLR-2764.patch, 
> SOLR-2764.patch
>
>
> We need a simple light-weight stemmer and a minimal stemmer for 
> plural/singlular only in Norwegian

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to