On Mon, 2020-05-11 at 22:29 +0200, Christian Grün wrote:

> Providing access to the starts and ends may be difficult due to all
> the logical operators that can be used 

A way to go from ($input, $phrases) to a $input autmented with
db:milestone elements each containing starts="0 7 23" ends="2 6 18"
attributes (where the numbers are positional in the sequene of phrases)
might be good. Or the mileston element could iclude the phrase,

I saw his <db:milestone>
   <db:start ref="3">naked hooves</db:start>
   <db:start ref="6">unshod</db:start>
</db:milestonr>bare<db:milestone>
  <db:end ref="6" /></db:mileston> feet....

as two problems are (1) overlapping results, and (2) query expansion
using a thesaurus and/or stemming.

Liam

> (ftor, ftand, ftnot, not in). A
> simple example:
> 
>   let $xml := <_>a b c d</_> update {}
>   return ft:mark($xml[text() contains text 'b c' ftand 'c d'])
> 
> We could possibly make the full data structures available that need
> to
> be internally generated. I fear people wouldn’t really work with it
> as
> they are fairly complex (a look into the specification may give you
> an
> impression of that [1]).
> 
> But thanks for your thoughts, I’ll let them grow.
> 
> [1] https://www.w3.org/TR/xpath-full-text-10/#FTOperatorsSemanticsSec
-- 
Liam Quin, https://www.delightfulcomputing.com/
Available for XML/Document/Information Architecture/XSLT/
XSL/XQuery/Web/Text Processing/A11Y training, work & consulting.
Barefoot Web-slave, antique illustrations:  http://www.fromoldbooks.org

Reply via email to