Interesting, thanks Franklin I'll put that in my notebook to try as well.

-andy


On Wed, Aug 12, 2009 at 2:56 PM, Franklin Simmons <
[email protected]> wrote:

> Apologies, in my reply I incorrectly stated that one would need to account
> for analyzer behaviors, which is completely untrue.
>
> To clarify, at indexing time the offset information can be stored with
> little effort - see Field.TermVector.WITH_OFFSETS and
> SegmentTermPositionVector. This should be faster and less involved than
> re-analyzing the original content at "highlight" time, with the trade-off
> being a larger index and a slight increase in indexing time.
>
>
>
>
>
> -----Original Message-----
> From: Franklin Simmons [mailto:[email protected]]
> Sent: Wednesday, August 12, 2009 3:17 PM
> To: [email protected]
> Subject: RE: get text pointer from hit or possibly highlighter
>
> Andrew,
>
> If you have control over indexing, you might accomplish this with
> TermPositionVector information, however, bear in mind that analyzers often
> discard text, e.g. the StandardAnalyzer doesn't index the word 'the', which
> you would have to account for.
>
>
> -----Original Message-----
> From: Andrew Schuler [mailto:[email protected]]
> Sent: Wednesday, August 12, 2009 12:45 PM
> To: [email protected]
> Subject: get text pointer from hit or possibly highlighter
>
> I've been doing some research trying to find out about getting a text
> position pointer for hits and this list is my last hope. If I have a
> (rather
> long) text document indexed and I get a hit on said document but the search
> term shows up near the end of the doc it would be nice to be able to know
> the position of the hit inside the doc itself. In .NET I'm thinking of
> something like a TextPointer. Does anyone know of a clever way to do this
> with Lucene.Net?
>

Reply via email to