You're describing word-through. Still got my fingers crossed for that
RFE ;-)
On Jul 8, 2009, at 10:07 AM, David Sewell wrote:
Phrase-around doesn't work within a word. Even with <hy> defined as a
phrase-around, "multi<hy>-</hy>lingual" will get indexed as two words,
"multi" and "lingual".
David
On Wed, 8 Jul 2009, Mike Sokolov wrote:
Not sure if this will work, but have you tried replacing hyphens
with an
element such as:
multi<hy>-</hy>lingual and defining the element as a phrase-
around? If ML
actually treats that as a single word, you should be ok; you would
also have
to remove hyphens from search terms
-Mike
David Sewell wrote:
A bit of experimentation shows that the Unicode soft hyphen
character,
U+00AD, is treated as word-dividing for the purposes of MarkLogic
word
indexing. I.e. given in one's underlying data
multi­lingual
then cts:word-query("multilingual") won't match.
Is there any workaround?
_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general
--
David Sewell, Editorial and Technical Manager
ROTUNDA, The University of Virginia Press
PO Box 801079, Charlottesville, VA 22904-4318 USA
Courier: 310 Old Ivy Way, Suite 302, Charlottesville VA 22903
Email: [email protected] Tel: +1 434 924 9973
Web: http://rotunda.upress.virginia.edu/
_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general
--
Shannon Scott Shiflett, XML Programmer
ROTUNDA, The University of Virginia Press
PO Box 801079, Charlottesville, VA 22904-4318 USA
Courier: 310 Old Ivy Way, Suite 302, Charlottesville VA 22903
Email: [email protected] Tel: +1 434 924 4495
Web: http://rotunda.upress.virginia.edu/
_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general