[
https://issues.apache.org/jira/browse/SOLR-4678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13623660#comment-13623660
]
Robert Muir commented on SOLR-4678:
-----------------------------------
Thanks Pierre: Actually this file is synchronized from
lucene/analysis/common/src/resources/org/apache/lucene/analysis/snowball/french_stop.txt
(via a ant task from solr/ 'ant sync-analyzers')
I think we should patch this file so its in the default lucene stoplist, too.
It might also be a good idea for us to send an email about this to the snowball
list ([email protected]) as thats where this file came from,
they might be interested in the improvement, too.
> Missing word "cela" in conf/lang/stopwords_fr.txt
> -------------------------------------------------
>
> Key: SOLR-4678
> URL: https://issues.apache.org/jira/browse/SOLR-4678
> Project: Solr
> Issue Type: Improvement
> Components: Schema and Analysis
> Affects Versions: 4.2
> Reporter: Pierre Kobylanski
> Priority: Trivial
> Attachments: stopwords_fr.txt.patch
>
> Original Estimate: 10m
> Remaining Estimate: 10m
>
> NB: Not sure this defect is assigned to the right component.
> In file example/solr/collection1/conf/lang/stopwords_fr.txt,
> there is the word "celà". Though incorrect in French (cf
> http://fr.wiktionary.org/wiki/cel%C3%A0), it's common, but we may also add
> the correct spelling (e.g. "cela", whitout accent) to that stopwords list.
> Another thing: I noticed that "celà" is the only word of the list followed by
> an unbreakable space. Is that wanted?
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]