On Sun, 29 Aug 2010, Michael Sokolov wrote: > We have used the approach of generating a thesaurus using a preprocessor that > finds all words with the set of characters we're interested in. The results > are good in our use cases, but I do think it will be data-dependent since if > you have a large number of expansions in the thesaurus for a term, that would > probably be less performant. Also, (Jason you may remember this from a few > years back) implementing an efficient character-finding thesaurus generator in > xquery is quite difficult; we ended up writing that part in Perl.
What made this difficult using XQuery? The less rich implementation of regular expressions compared to Perl, or something else? -- David Sewell, Editorial and Technical Manager ROTUNDA, The University of Virginia Press PO Box 400314, Charlottesville, VA 22904-4314 USA Email: [email protected] Tel: +1 434 924 9973 Web: http://rotunda.upress.virginia.edu/ _______________________________________________ General mailing list [email protected] http://developer.marklogic.com/mailman/listinfo/general
