On Oct 19, 2010, at 12:20 PM, Robert Muir wrote:

> On Tue, Oct 19, 2010 at 12:17 PM, DM Smith <dmsmith...@gmail.com> wrote:
> 
>> I'd be surprised if there are use cases for non-reuse.
>> 
>> IIRC: When we started down the reuse path, the goal was reuse only, not just 
>> reuse by default. But in order to bridge the past to the future, there was 
>> the possibility of continued non-reuse. In a sense non-reuse was deprecated, 
>> but I'm not sure that @deprecated as a mechanism was able to clearly 
>> indicate that.
>> 
> 
> Exactly: i don't think theres a clear way to detect that your
> tokenStream() method is "reuse-safe" and deprecate it: e.g. you have
> to implement reset() correctly in your tokenstreams.
> 
> But lets think about this: for non-experts, making Analyzer "reusable
> by default" by removing reusableTokenStream() and reusing
> tokenStream() would probably be the single largest indexing
> performance improvement we could make... the API is so confusing that
> I think many people probably have analyzers that aren't reusing today.
> 
> I think its worth considering a backwards break, especially since as
> Mike mentioned, for the very special (possibly even only theoretical!)
> non-reuse case, there are ways they could still index: but the "fast
> way" should be the "easy/default way".

To me, the backwards break is merely a code break. I can't see how it would 
break an index.

-- DM


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to