Re: who clears attributes?

Mark Miller Mon, 10 Aug 2009 15:28:53 -0700

Grant Ingersoll wrote:

On Aug 10, 2009, at 5:12 PM, Shai Erera wrote:
Maybe we should follow what I seem to read from Earwin and Grant -come up w/ real use cases, try to implement them w/ the current API,then if it's impossible, discuss how we can make the current API moreadaptive. If at the end of this we'll get back to the new API, thenwe'll at least feel better about it, and more convinced it is the wayto go.
Well, I have real use cases for it, but all of it is still missing thebiggest piece: search side support. It's the 900 lb. elephant in theroom. The 500 lb. elephant is the fact that all these attributes,AIUI, require you to hook in your own indexing chain, etc. in order toeven be indexed, which is all package private stuff. It's not evenclear to me what happens right now if you were to, say have a TokenStream that, say, had only one Attribute on it and none of theexisting attributes (term buffer, length, position, etc.) Pleasecorrect me if I am wrong, I still don't have a deep understanding ofit all.

Michael has always been up front that this new API is in preparation forflexible indexing. It doesn't give us the goodness - he has laid out thereasons for moving before the goodness comes more than once I think.From my understanding, Michael looked at what Mike was doing in one ofhis flexible indexing patches, wondered how some of the TokenStreamstuff was going to work well with it, and came up with this new API as asolution. Yes - it gets us nothing now. But its a big move, and there isno need to do everything at once - in fact it would probably be harderto do it all at once - the rest has always been on the table. 3.0 hasalways been convenient to push it before, as deprecations can than beremoved. Nothing forcing us to make that decision now though.

Honestly, though, it really gives you very little over the current,well functioning payloads capability other than stronger typing, theability to pick only those attributes that you want indexed (intheory) and a byte (or so) of savings per any token that has apayload, and we _HAVE_ right now, search support for payloads.

Payloads gives us nothing as developers - you can't use thatfunctionality without taking it from the users - payloads are for users.

Flexible indexing will lead to all kinds of little cool things - thelikes of which have been discussed a lot in older emails. It will likelylead to things we cannot predict as well. Everything will be moreflexible. It also could play a part in CSF, and work on allowing customfiles to plug into merging. Plus everything else thats been mentioned(pfor, etc) I've been sold on the long term benefits. I don't think youneed these API for them, but its my understanding it helps solve part ofthe equation.

A bunch of issues have come up. To my knowledge, they have beenaddressed with vigor every time. If someone is unhappy with howsomething has been addressed, and it needs to be addressed further,please speak up. Otherwise, I don't think the sky is falling - I thinkthe new API is being shaken out.


Oh, and now it seems the new QP is dependent on it all.

Dependent how?

--
- Mark

http://www.lucidimagination.com




---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Re: who clears attributes?

Reply via email to