As far as I know there are no options to change word-tokenization in the manner you describe. So you would have to find a way to enumerate your word-pairs in a thesaurus, or reprocess your content so that it will be indexed as desired.
Or... back away from the immediate technical problem and describe the wider problem. Someone might see a better answer. -- Mike On 12 Apr 2012, at 10:42 , Abhishek53 S wrote: > Hi Mike, > > Thanks for asking. We don't have any predefined term list for thesaurus > which will help us in expanding "walmart" against "wal-mart".. So still the > problem lies in getting those documents where term "wal-mart" (with hyphen) > is actually present by the term "walmart" (without hyphen).. > > Can we have any configuration or query level option that will allow to ignore > the punctuation and treat the adjacent words as single word. If not do we > have any work around to achieve that. > > Kind Regards > Abhishek Srivastav > Systems Engineer > Tata Consultancy Services > Cell:- +91-9883389968 > Mailto: [email protected] > Website: http://www.tcs.com > ____________________________________________ > Experience certainty. IT Services > Business Solutions > Outsourcing > ____________________________________________ > > [email protected] wrote: ----- > > To: MarkLogic Developer Discussion <[email protected]> > From: Michael Blakeley <[email protected]> > Sent by: [email protected] > Date: 04/12/2012 10:47PM > Subject: Re: [MarkLogic Dev General] issue with words separated by > punctuations. > > Why does a large number of words preclude thesaurus expansion? > > -- Mike > > On 12 Apr 2012, at 03:27 , Abhishek53 S wrote: > > > > > Hi Folks, > > > > I am not confident if the requirement is possible through any configuration > > change in Marklogic. I have content with values > > <company>wal-mart</company>. There are thousand of values with hypen(-) > > separated. I want to get those documents back when user is searching with > > "walmart" i.e term without hyphen. I can not assign those words to > > thesaurus as they are large in count. > > > > Thanks > > -Abhishek > > > > =====-----=====-----===== > > Notice: The information contained in this e-mail > > message and/or attachments to it may contain > > confidential or privileged information. If you are > > not the intended recipient, any dissemination, use, > > review, distribution, printing or copying of the > > information contained in this e-mail message > > and/or attachments to it are strictly prohibited. If > > you have received this communication in error, > > please notify us by reply e-mail or telephone and > > immediately and permanently delete the message > > and any attachments. Thank you > > > > > > _______________________________________________ > > General mailing list > > [email protected] > > http://developer.marklogic.com/mailman/listinfo/general > > _______________________________________________ > General mailing list > [email protected] > http://developer.marklogic.com/mailman/listinfo/general > > _______________________________________________ > General mailing list > [email protected] > http://developer.marklogic.com/mailman/listinfo/general _______________________________________________ General mailing list [email protected] http://developer.marklogic.com/mailman/listinfo/general
