In a way that certainly needs more testing (haven't had the time), but here
is the gist:

I modified the SpanNotQuery to allow a certain number of span crossings--
making it something of a WithinSpanQuery. So instead of just being able to
say find "something" and "something else" and don't let it span a paragraph
marker span, you can say find this and it can span up to to 3 paragraph
marker spans. I then made a special standard analyzer that uses a standard
sentence recognizer regex to inject sentence marker tokens. Paragraphs seem
less detectable, so right now the analyzer just looks for the paragraph
symbol...perhaps a double newline might be better though. I still have not
worked out the best para/sent token markers to put in the index or the best
way to mark paragraphs in the input text. I also would like to make it so
that a paragraph marker also works as a sentence marker so that they do not
need to be doubled up.


- Mark

On 10/15/06, Paul Elschot <[EMAIL PROTECTED]> wrote:

Mark,

you wrote:
> > On another note...http://famestalker.com
> >
...
>
>  http://famestalker.com/devwiki/

Could you explain how "Paragraph/Sentence Proximity Searching"
is implemented in Qsol?

Regards,
Paul Elschot

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


Reply via email to