Ahh, Exactly what I was looking for. Thanks Patrick.

On Wed, Jun 2, 2021 at 6:16 PM Patrick Zhai <zhai7...@gmail.com> wrote:

> Hi Elbek,
> Maybe go with ContainedByIntervalsSource? ContainingIntervalsSource is
> actually the big source filtered by small source, and
> ContainedByIntervalsSource is the opposite so it should give the expect
> behavior?
>
> Best
> Patrick
>
> elbek kamoliddinov <elbek....@gmail.com> 于2021年6月2日周三 下午2:55写道:
>
> > Hello everyone,
> >
> > I am experimenting with Interval Queries to get phrase match count within
> > parts of an indexed field.
> >
> > ContainingIntervalsSource seemed to be the way to go but, it only
> considers
> > at most a single match per region.
> > Example:
> > Field value: "[a b c d e a c] e f g h [a c k]" (opening and closing
> square
> > braces are not part of the text, but shows region of the field I am
> > interested in)
> >
> > Within the regions in the field I am trying to find all phrase match
> > positions for say 'a c' with slop=1
> > First region has 2 matches and the second region has 1.
> > ContainingIntervalsSource produces an iterator that produces the first
> > match position for the first region ([a b c d e a c]). but there are two
> > matches in this region. It seems this behavior is by design. Is it
> possible
> > to accomplish this with the existing interval sources or one should
> write a
> > custom one for this?
> >
> > On a related note, does it make sense for ContainingIntervalsSource to
> > produce multiple match positions for the first segment?
> >
> > Thanks,
> > Elbek.
> >
>

Reply via email to