Re: Lucene search question

Grant Ingersoll Tue, 13 Nov 2007 09:10:39 -0800


On Nov 13, 2007, at 11:59 AM, Steven D. Majewski wrote:

Lucene is great at finding documents, but not quite as good at finding
things IN documents. The index contains pointers to the terms, butthey arepointers to a token in the parsed token stream, so to find acharacter indexinto a file, you have to (I believe) run the text thru the tokenizeragain.( But lucene API gives you access to everything, even if it's notsimple or easy.I think there are some new features in the latest version that canmake thissort of thing easier, but I haven't yet figured out how to usethem. )

You can use Term Vectors to access the offset (and position)information for a document.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Lucene search question

Reply via email to