On Jan 28, 2008, at 4:04 PM, <[EMAIL PROTECTED]> <[EMAIL PROTECTED]> wrote:
Also, search the archives for Term Vector, as you will find
discussion
of it there.
Ah I see, I need to cast it to TermPositionVector. OK.
yep
You may also, eventually, be interested in the new
TermVectorMapper capabilities in 2.3 which should help speed up the
processing of term vectors by providing a callback mechanism
to allow
you to load them into data structures that make sense for your
application.
Hm.
What I need is are startOffsets of a special term.
I use
TermPositionVector#getOffsets(TermPositionVector.indexOf("foo")).
Can TermVectorMapper speed this up?
And how can I find the offsets of something like "foo bar"? I think
this
will get tokenized into 2 terms and thus I have no chance to find
it, right?
I wouldn't say no chance... TermVectorMapper would be good for this,
as you can watch the terms as they are being loaded. Just keep track
of your last term and see if it is "foo", when you hit "bar"
What kind of special term are you looking for? There may be other
ways of solving your problem...
-Grant
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]