Re: XML retrieval with Intervals

2022-05-06 Thread Alan Woodward
I *think* it would be possible to write an IntervalsSource implementation that took opening and closing tags, and did the right thing here - as you say, a standard `contains` will try and minimise things, but you could write something that attempted to match an opening tag with its

Re: XML retrieval with Intervals

2022-05-06 Thread Michael Sokolov
+1 MarkLogic is an excellent product. This Lux thing was inspired by it. On Fri, May 6, 2022 at 11:29 AM Walter Underwood wrote: > > If you need to search XML, consider MarkLogic. It is a very full-featured > database and search engine based on XML. > > https://www.marklogic.com > > Disclaimer:

Re: XML retrieval with Intervals

2022-05-06 Thread Walter Underwood
If you need to search XML, consider MarkLogic. It is a very full-featured database and search engine based on XML. https://www.marklogic.com Disclaimer: I worked there for a couple of years ten years ago. But I’ve been inside that product and it is non-muggle technology. wunder Walter

Re: XML retrieval with Intervals

2022-05-06 Thread Michael Sokolov
Many years ago I had started this Lux project that was designed to build an XML-aware index using Solr; see https://github.com/msokolov/lux/tree/master/src/main/java/lux/index/analysis for the analysis chain I used. Maybe you'll find something useful in this project? It's dormant for years, and

XML retrieval with Intervals

2022-05-06 Thread Mikhail Khludnev
Hi Devs! I found intervals quite nice and natural for retrieving scoped data (thanks, Alan!): foo stuff bar I.containing(I.ordered(I.term(""), I.term("")), I.unordered(I.term("bar"), I.term("foo"))); It works like a charm until it encounter ill nested tags: foo bug bar Due