You're describing word-through. Still got my fingers crossed for that RFE ;-)

On Jul 8, 2009, at 10:07 AM, David Sewell wrote:

Phrase-around doesn't work within a word. Even with <hy> defined as a
phrase-around, "multi<hy>-</hy>lingual" will get indexed as two words,
"multi" and "lingual".

David

On Wed, 8 Jul 2009, Mike Sokolov wrote:

Not sure if this will work, but have you tried replacing hyphens with an
element such as:
multi<hy>-</hy>lingual and defining the element as a phrase- around? If ML actually treats that as a single word, you should be ok; you would also have
to remove hyphens from search terms

-Mike

David Sewell wrote:
A bit of experimentation shows that the Unicode soft hyphen character, U+00AD, is treated as word-dividing for the purposes of MarkLogic word
indexing. I.e. given in one's underlying data

 multi&#xAD;lingual

then cts:word-query("multilingual") won't match.

Is there any workaround?


_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general


--
David Sewell, Editorial and Technical Manager
ROTUNDA, The University of Virginia Press
PO Box 801079, Charlottesville, VA 22904-4318 USA
Courier: 310 Old Ivy Way, Suite 302, Charlottesville VA 22903
Email: [email protected]   Tel: +1 434 924 9973
Web: http://rotunda.upress.virginia.edu/
_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

--
Shannon Scott Shiflett, XML Programmer
ROTUNDA, The University of Virginia Press
PO Box 801079, Charlottesville, VA 22904-4318 USA
Courier: 310 Old Ivy Way, Suite 302, Charlottesville VA 22903
Email: [email protected]   Tel: +1 434 924 4495
Web: http://rotunda.upress.virginia.edu/

_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

Reply via email to