[jira] [Commented] (LUCENE-8093) TrimFilterFactory should implement MultiTermAwareComponent

Alan Woodward (JIRA) Tue, 12 Dec 2017 07:59:15 -0800

    [ 
https://issues.apache.org/jira/browse/LUCENE-8093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16287789#comment-16287789
 ]


Alan Woodward commented on LUCENE-8093:
---------------------------------------

TrimFilter would I think normally be used with a Tokenizer that doesn't split 
things up, like KeywordTokenizer or NGramTokenizer, in which case removing 
surrounding whitespace seems like a natural normalization to me, much like 
lowercasing.  And the issue with stemmers is that they can completely change 
the token, such that prefixes or fuzzy queries won't make sense, which doesn't 
apply here?

> TrimFilterFactory should implement MultiTermAwareComponent
> ----------------------------------------------------------
>
>                 Key: LUCENE-8093
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8093
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Alan Woodward
>            Assignee: Alan Woodward
>            Priority: Minor
>
> TrimFilter will work perfectly well in CustomAnalyzer.normalize(), so it 
> should implement MultiTermAwareComponent



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (LUCENE-8093) TrimFilterFactory should implement MultiTermAwareComponent

Reply via email to