+1, too. I hate those modes. In my opinion, we should not have analyzers at all 
(just my personal opinion). "new Field(name, TokenStream)" is much enough from 
consistency standpoint!

-----
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de


> -----Original Message-----
> From: Robert Muir [mailto:rcm...@gmail.com]
> Sent: Saturday, October 30, 2010 3:15 PM
> To: dev@lucene.apache.org
> Subject: Re: inconsistency/performance trap of empty terms
> 
> On Sat, Oct 30, 2010 at 9:00 AM, Earwin Burrfoot <ear...@gmail.com> wrote:
> > Speaking of consistency, I think NOT_ANALYZED is superfluous. Drop
> > this mode, and it can be safely reproduced by a NotAnalyzingAnalyzer
> > (insert better name here).
> >
> 
> +1. This is confusing and comes up often on the user list.
> 
> The way I think it happens is like this:
> Joe Schmoe, like a good user, just fires up StandardAnalyzer at both index and
> query time.
> Joe realizes he has a field that really shouldnt be tokenized, and sets it to
> NOT_ANALYZED.
> Joe is confused that queries dont work the way he should when he does this,
> since its still analyzed by the queryparser with StandardAnalyzer.
> 
> It would be far better to force him to use PerFieldAnalyzerWrapper +
> NotAnalyzingAnalyzer or whatever, since then it would work intuitively.
> besides, if he sets this NotAnalyzed, it actually goes thru 'analysis'
> anyway: SingleTokenAttributeSource buried in the indexer.
> And, in trunk, this means things like UTF-8 encoding are assumed, but really
> this should be completely outside of the indexer.
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional
> commands, e-mail: dev-h...@lucene.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to