On Mon, Aug 10, 2009 at 22:50, Grant Ingersoll<gsing...@apache.org> wrote:
>
> On Aug 10, 2009, at 2:00 PM, Earwin Burrfoot wrote:
>
>> I'll deviate from the topic somewhat.
>> What are exact benefits that new tokenstream API yields? Are we sure
>> we want it released with 2.9?
>> By now I only see various elaborate problems, but haven't seen a
>> single piece of code becoming simpler.
>
> In theory, it sets up for more indexing/searching possibilities in 3.0, but
> in the meantime, it is proving to be quite problematic due to back
> compatibility restrictions.
I'm not quite sure which exact indexing/searching possibilities does
the new API open for us.
Some new ways of handling text? Okay, I'd like each token to have one
more number in addition to posIncr, so I can have my 'true multiword
synonyms'. Maybe, just maybe, there will be a pair of other
extensions. Usecases here are really scarce. Plus, if they're
successful/useful, they will most probably be included out of the box,
so we don't need much flexibility here.
Something other than text? Numbers, with good rangequeries. Dates.
Spatial data. Your-type-here. For these, flexible text-processing
stream-oriented API is totally useless.

> I have serious doubts about releasing this new API until these performance
> issues are resolved and better proven out from a usability standpoint.
> It simply is too much to swallow for most users, as
> Analyzers/TokenStreams/etc. are easily the most common place for people to
> inject their own capabilities and there is no way we should be
> taking a 30% hit in performance for some theoretical speed up and new search
> capability 1 year from now.
I have a feeling that best idea, before more damage is done, is to
rollback this new API, store the patch, and try rolling it out once
again, when we have usecases/more code to justify it.

-- 
Kirill Zakharenko/Кирилл Захаренко (ear...@gmail.com)
Home / Mobile: +7 (495) 683-567-4 / +7 (903) 5-888-423
ICQ: 104465785

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to