On Mon, 2020-05-11 at 22:29 +0200, Christian Grün wrote: > Providing access to the starts and ends may be difficult due to all > the logical operators that can be used
A way to go from ($input, $phrases) to a $input autmented with db:milestone elements each containing starts="0 7 23" ends="2 6 18" attributes (where the numbers are positional in the sequene of phrases) might be good. Or the mileston element could iclude the phrase, I saw his <db:milestone> <db:start ref="3">naked hooves</db:start> <db:start ref="6">unshod</db:start> </db:milestonr>bare<db:milestone> <db:end ref="6" /></db:mileston> feet.... as two problems are (1) overlapping results, and (2) query expansion using a thesaurus and/or stemming. Liam > (ftor, ftand, ftnot, not in). A > simple example: > > let $xml := <_>a b c d</_> update {} > return ft:mark($xml[text() contains text 'b c' ftand 'c d']) > > We could possibly make the full data structures available that need > to > be internally generated. I fear people wouldn’t really work with it > as > they are fairly complex (a look into the specification may give you > an > impression of that [1]). > > But thanks for your thoughts, I’ll let them grow. > > [1] https://www.w3.org/TR/xpath-full-text-10/#FTOperatorsSemanticsSec -- Liam Quin, https://www.delightfulcomputing.com/ Available for XML/Document/Information Architecture/XSLT/ XSL/XQuery/Web/Text Processing/A11Y training, work & consulting. Barefoot Web-slave, antique illustrations: http://www.fromoldbooks.org