[
https://issues.apache.org/jira/browse/LUCENE-8093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16287789#comment-16287789
]
Alan Woodward commented on LUCENE-8093:
---------------------------------------
TrimFilter would I think normally be used with a Tokenizer that doesn't split
things up, like KeywordTokenizer or NGramTokenizer, in which case removing
surrounding whitespace seems like a natural normalization to me, much like
lowercasing. And the issue with stemmers is that they can completely change
the token, such that prefixes or fuzzy queries won't make sense, which doesn't
apply here?
> TrimFilterFactory should implement MultiTermAwareComponent
> ----------------------------------------------------------
>
> Key: LUCENE-8093
> URL: https://issues.apache.org/jira/browse/LUCENE-8093
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Alan Woodward
> Assignee: Alan Woodward
> Priority: Minor
>
> TrimFilter will work perfectly well in CustomAnalyzer.normalize(), so it
> should implement MultiTermAwareComponent
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]