As far as I know there are no options to change word-tokenization in the manner 
you describe. So you would have to find a way to enumerate your word-pairs in a 
thesaurus, or reprocess your content so that it will be indexed as desired.

Or... back away from the immediate technical problem and describe the wider 
problem. Someone might see a better answer.

-- Mike

On 12 Apr 2012, at 10:42 , Abhishek53 S wrote:

> Hi Mike,
> 
> Thanks for asking.   We don't have any predefined term list for thesaurus 
> which will help us in expanding "walmart" against "wal-mart".. So still the 
> problem lies in getting those documents where term "wal-mart" (with hyphen) 
> is actually present by the term "walmart" (without hyphen).. 
> 
> Can we have any configuration or query level option that will allow to ignore 
> the punctuation and treat the adjacent words as single word. If not do we 
> have any work around to achieve that.
> 
> Kind Regards
> Abhishek Srivastav
> Systems Engineer
> Tata Consultancy Services
> Cell:- +91-9883389968
> Mailto: [email protected]
> Website: http://www.tcs.com
> ____________________________________________
> Experience certainty. IT Services
> Business Solutions
> Outsourcing
> ____________________________________________
> 
> [email protected] wrote: -----
> 
> To: MarkLogic Developer Discussion <[email protected]>
> From: Michael Blakeley <[email protected]>
> Sent by: [email protected]
> Date: 04/12/2012 10:47PM
> Subject: Re: [MarkLogic Dev General] issue with words separated by    
> punctuations.
> 
> Why does a large number of words preclude thesaurus expansion?
> 
> -- Mike
> 
> On 12 Apr 2012, at 03:27 , Abhishek53 S wrote:
> 
> > 
> > Hi Folks, 
> > 
> > I am not confident if the requirement is possible through any configuration 
> > change in Marklogic. I have content with values 
> > <company>wal-mart</company>. There are thousand of values with hypen(-) 
> > separated. I want to get those documents back when user is searching with 
> > "walmart" i.e term without hyphen. I can not assign those words to 
> > thesaurus as they are large in count. 
> > 
> > Thanks 
> > -Abhishek 
> > 
> > =====-----=====-----=====
> > Notice: The information contained in this e-mail
> > message and/or attachments to it may contain 
> > confidential or privileged information. If you are 
> > not the intended recipient, any dissemination, use, 
> > review, distribution, printing or copying of the 
> > information contained in this e-mail message 
> > and/or attachments to it are strictly prohibited. If 
> > you have received this communication in error, 
> > please notify us by reply e-mail or telephone and 
> > immediately and permanently delete the message 
> > and any attachments. Thank you
> > 
> > 
> > _______________________________________________
> > General mailing list
> > [email protected]
> > http://developer.marklogic.com/mailman/listinfo/general
> 
> _______________________________________________
> General mailing list
> [email protected]
> http://developer.marklogic.com/mailman/listinfo/general
> 
> _______________________________________________
> General mailing list
> [email protected]
> http://developer.marklogic.com/mailman/listinfo/general

_______________________________________________
General mailing list
[email protected]
http://developer.marklogic.com/mailman/listinfo/general

Reply via email to