A bit of experimentation shows that the Unicode soft hyphen character,
U+00AD, is treated as word-dividing for the purposes of MarkLogic word
indexing. I.e. given in one's underlying data

  multi­lingual

then cts:word-query("multilingual") won't match.

Is there any workaround?

-- 
David Sewell, Editorial and Technical Manager
ROTUNDA, The University of Virginia Press
PO Box 801079, Charlottesville, VA 22904-4318 USA
Courier: 310 Old Ivy Way, Suite 302, Charlottesville VA 22903
Email: [email protected]   Tel: +1 434 924 9973
Web: http://rotunda.upress.virginia.edu/
_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

Reply via email to