On Sun, 29 Aug 2010, Michael Sokolov wrote:

> We have used the approach of generating a thesaurus using a preprocessor that
> finds all words with the set of characters we're interested in.  The results
> are good in our use cases, but I do think it will be data-dependent since if
> you have a large number of expansions in the thesaurus for a term, that would
> probably be less performant. Also, (Jason you may remember this from a few
> years back) implementing an efficient character-finding thesaurus generator in
> xquery is quite difficult; we ended up writing that part in Perl.

What made this difficult using XQuery? The less rich implementation of
regular expressions compared to Perl, or something else?

-- 
David Sewell, Editorial and Technical Manager
ROTUNDA, The University of Virginia Press
PO Box 400314, Charlottesville, VA 22904-4314 USA
Email: [email protected]   Tel: +1 434 924 9973
Web: http://rotunda.upress.virginia.edu/
_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to