Phrase-around doesn't work within a word. Even with <hy> defined as a
phrase-around, "multi<hy>-</hy>lingual" will get indexed as two words,
"multi" and "lingual".

David

On Wed, 8 Jul 2009, Mike Sokolov wrote:

> Not sure if this will work, but have you tried replacing hyphens with an
> element such as:
> multi<hy>-</hy>lingual and defining the element as a phrase-around?  If ML
> actually treats that as a single word, you should be ok; you would also have
> to remove hyphens from search terms
>
> -Mike
>
> David Sewell wrote:
> > A bit of experimentation shows that the Unicode soft hyphen character,
> > U+00AD, is treated as word-dividing for the purposes of MarkLogic word
> > indexing. I.e. given in one's underlying data
> >
> >   multi&#xAD;lingual
> >
> > then cts:word-query("multilingual") won't match.
> >
> > Is there any workaround?
> >
> >
> _______________________________________________
> General mailing list
> [email protected]
> http://xqzone.com/mailman/listinfo/general
>

-- 
David Sewell, Editorial and Technical Manager
ROTUNDA, The University of Virginia Press
PO Box 801079, Charlottesville, VA 22904-4318 USA
Courier: 310 Old Ivy Way, Suite 302, Charlottesville VA 22903
Email: [email protected]   Tel: +1 434 924 9973
Web: http://rotunda.upress.virginia.edu/
_______________________________________________
General mailing list
[email protected]
http://xqzone.com/mailman/listinfo/general

Reply via email to