RE: Fuzzy search in solr
Thank you jack for the response. Fuzzy search is the syntax for a term, not a handler. For example: alpha~1 will match terms that have an editing distance of 0 or 1 from alpha. So the search query string will be like - /term?q= alpha~1 But, are you sure you really mean fuzzy search, as opposed to, say, spellcheck? Yes, I need to implement fuzzy search. Spell check matches exact string but fuzzy search is done for approximately matching strings. Please correct me if my understanding is wrong. -Original Message- From: Jack Krupansky [mailto:j...@basetechnology.com] Sent: Friday, May 24, 2013 6:40 PM To: solr-user@lucene.apache.org Subject: Re: Fuzzy search in solr Fuzzy search is the syntax for a term, not a handler. For example: alpha~1 will match terms that have an editing distance of 0 or 1 from alpha. All of the search handlers support fuzzy search. Some query parsers, such as dismax, do not, but the standard Solr query parser and edismax query parser do. But, are you sure you really mean fuzzy search, as opposed to, say, spellcheck? -- Jack Krupansky -Original Message- From: Sagar Chaturvedi Sent: Friday, May 24, 2013 2:43 AM To: solr-user@lucene.apache.org Subject: Fuzzy search in solr Hi, How to perform fuzzy search in solr? Which request handler is used for fuzzy search by default? Regards, Sagar DISCLAIMER: --- The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. It shall not attach any liability on the originator or NEC or its affiliates. Any views or opinions presented in this email are solely those of the author and may not necessarily reflect the opinions of NEC or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of the author of this e-mail is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. . --- DISCLAIMER: --- The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. It shall not attach any liability on the originator or NEC or its affiliates. Any views or opinions presented in this email are solely those of the author and may not necessarily reflect the opinions of NEC or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of the author of this e-mail is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. . ---
Re: Fuzzy search in solr
Fuzzy query is invoked just like any other query: .../select?q=alpha~1 -- Jack Krupansky -Original Message- From: Sagar Chaturvedi Sent: Sunday, May 26, 2013 11:27 PM To: solr-user@lucene.apache.org Subject: RE: Fuzzy search in solr Thank you jack for the response. Fuzzy search is the syntax for a term, not a handler. For example: alpha~1 will match terms that have an editing distance of 0 or 1 from alpha. So the search query string will be like - /term?q= alpha~1 But, are you sure you really mean fuzzy search, as opposed to, say, spellcheck? Yes, I need to implement fuzzy search. Spell check matches exact string but fuzzy search is done for approximately matching strings. Please correct me if my understanding is wrong. -Original Message- From: Jack Krupansky [mailto:j...@basetechnology.com] Sent: Friday, May 24, 2013 6:40 PM To: solr-user@lucene.apache.org Subject: Re: Fuzzy search in solr Fuzzy search is the syntax for a term, not a handler. For example: alpha~1 will match terms that have an editing distance of 0 or 1 from alpha. All of the search handlers support fuzzy search. Some query parsers, such as dismax, do not, but the standard Solr query parser and edismax query parser do. But, are you sure you really mean fuzzy search, as opposed to, say, spellcheck? -- Jack Krupansky -Original Message- From: Sagar Chaturvedi Sent: Friday, May 24, 2013 2:43 AM To: solr-user@lucene.apache.org Subject: Fuzzy search in solr Hi, How to perform fuzzy search in solr? Which request handler is used for fuzzy search by default? Regards, Sagar DISCLAIMER: --- The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. It shall not attach any liability on the originator or NEC or its affiliates. Any views or opinions presented in this email are solely those of the author and may not necessarily reflect the opinions of NEC or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of the author of this e-mail is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. . --- DISCLAIMER: --- The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. It shall not attach any liability on the originator or NEC or its affiliates. Any views or opinions presented in this email are solely those of the author and may not necessarily reflect the opinions of NEC or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of the author of this e-mail is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. . ---
Fuzzy search in solr
Hi, How to perform fuzzy search in solr? Which request handler is used for fuzzy search by default? Regards, Sagar DISCLAIMER: --- The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. It shall not attach any liability on the originator or NEC or its affiliates. Any views or opinions presented in this email are solely those of the author and may not necessarily reflect the opinions of NEC or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of the author of this e-mail is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. . ---
Re: Fuzzy search in solr
Fuzzy search is the syntax for a term, not a handler. For example: alpha~1 will match terms that have an editing distance of 0 or 1 from alpha. All of the search handlers support fuzzy search. Some query parsers, such as dismax, do not, but the standard Solr query parser and edismax query parser do. But, are you sure you really mean fuzzy search, as opposed to, say, spellcheck? -- Jack Krupansky -Original Message- From: Sagar Chaturvedi Sent: Friday, May 24, 2013 2:43 AM To: solr-user@lucene.apache.org Subject: Fuzzy search in solr Hi, How to perform fuzzy search in solr? Which request handler is used for fuzzy search by default? Regards, Sagar DISCLAIMER: --- The contents of this e-mail and any attachment(s) are confidential and intended for the named recipient(s) only. It shall not attach any liability on the originator or NEC or its affiliates. Any views or opinions presented in this email are solely those of the author and may not necessarily reflect the opinions of NEC or its affiliates. Any form of reproduction, dissemination, copying, disclosure, modification, distribution and / or publication of this message without the prior written consent of the author of this e-mail is strictly prohibited. If you have received this email in error please delete it and notify the sender immediately. . ---
Re: Question about Fuzzy search in Solr
Hello! Is this what you are looking for https://lucene.apache.org/core/old_versioned_docs/versions/3_0_0/queryparsersyntax.html#Fuzzy%20Searches ? -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch Hi, I need to know how we can implement fuzzy searches using Solr. Can someone provide any links to any relevant documentation ?
Re: Question about Fuzzy search in Solr
Thanks. Is any extra configuration from the Solr side to make this work ? Any additional text files like synonyms.txt, any additional fields or any changes in schema.xml or solrconfig.xml ? On Mon, Sep 17, 2012 at 4:45 PM, Rafał Kuć r@solr.pl wrote: Hello! Is this what you are looking for https://lucene.apache.org/core/old_versioned_docs/versions/3_0_0/queryparsersyntax.html#Fuzzy%20Searches ? -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch Hi, I need to know how we can implement fuzzy searches using Solr. Can someone provide any links to any relevant documentation ? -- Thanks and Regards Rahul A. Warawdekar
Re: Question about Fuzzy search in Solr
Hello! There is no need to include any changes or additional component to have fuzzy search working in Solr. -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch Thanks. Is any extra configuration from the Solr side to make this work ? Any additional text files like synonyms.txt, any additional fields or any changes in schema.xml or solrconfig.xml ? On Mon, Sep 17, 2012 at 4:45 PM, Rafał Kuć r@solr.pl wrote: Hello! Is this what you are looking for https://lucene.apache.org/core/old_versioned_docs/versions/3_0_0/queryparsersyntax.html#Fuzzy%20Searches ? -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch Hi, I need to know how we can implement fuzzy searches using Solr. Can someone provide any links to any relevant documentation ?
Re: Question about Fuzzy search in Solr
Got it. Thanks Rafał ! On Mon, Sep 17, 2012 at 6:37 PM, Rafał Kuć r@solr.pl wrote: Hello! There is no need to include any changes or additional component to have fuzzy search working in Solr. -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch Thanks. Is any extra configuration from the Solr side to make this work ? Any additional text files like synonyms.txt, any additional fields or any changes in schema.xml or solrconfig.xml ? On Mon, Sep 17, 2012 at 4:45 PM, Rafał Kuć r@solr.pl wrote: Hello! Is this what you are looking for https://lucene.apache.org/core/old_versioned_docs/versions/3_0_0/queryparsersyntax.html#Fuzzy%20Searches ? -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch Hi, I need to know how we can implement fuzzy searches using Solr. Can someone provide any links to any relevant documentation ? -- Thanks and Regards Rahul A. Warawdekar
Re: Question about Fuzzy search in Solr
That doc is out of date for 4.0. See the 4.0 Javadoc on FuzzyQuery for updated info. The tilda right operand is now an integer editing distance (number of times to insert char, delete char, change char, or transpose two adjacent chars to map index term to query term) that is limited to 2. Be aware that if you use fuzzy query in 3.6/3.6.1 or earlier, it will change when you go to 4.0. -- Jack Krupansky -Original Message- From: Rafał Kuć Sent: Monday, September 17, 2012 7:15 AM To: solr-user@lucene.apache.org Subject: Re: Question about Fuzzy search in Solr Hello! Is this what you are looking for https://lucene.apache.org/core/old_versioned_docs/versions/3_0_0/queryparsersyntax.html#Fuzzy%20Searches ? -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch Hi, I need to know how we can implement fuzzy searches using Solr. Can someone provide any links to any relevant documentation ?
Re: Question about Fuzzy search in Solr
Thanks Jack. We are using Solr 3.4. On Mon, Sep 17, 2012 at 8:18 PM, Jack Krupansky j...@basetechnology.comwrote: That doc is out of date for 4.0. See the 4.0 Javadoc on FuzzyQuery for updated info. The tilda right operand is now an integer editing distance (number of times to insert char, delete char, change char, or transpose two adjacent chars to map index term to query term) that is limited to 2. Be aware that if you use fuzzy query in 3.6/3.6.1 or earlier, it will change when you go to 4.0. -- Jack Krupansky -Original Message- From: Rafał Kuć Sent: Monday, September 17, 2012 7:15 AM To: solr-user@lucene.apache.org Subject: Re: Question about Fuzzy search in Solr Hello! Is this what you are looking for https://lucene.apache.org/**core/old_versioned_docs/**versions/3_0_0/** queryparsersyntax.html#Fuzzy%**20Searcheshttps://lucene.apache.org/core/old_versioned_docs/versions/3_0_0/queryparsersyntax.html#Fuzzy%20Searches ? -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch Hi, I need to know how we can implement fuzzy searches using Solr. Can someone provide any links to any relevant documentation ? -- Thanks and Regards Rahul A. Warawdekar
Re: Fuzzy search and solr 4.0
Well the good news is FuzzyQuery is indeed much faster in Lucene/Solr 4.0. But the bad news is... FuzzyQuery won't do what you need here. You need some sort of FuzzyPhraseQuery, which is able to replace terms similar to one another (comp/company/corporation) by some metric. I don't know of such a query in Lucene/Solr... but it'd be a nice addition. Others have asked about this before. FuzzyQuery finds terms close to other terms, when measured by edit distance, eg fuzzy/wuzzy/muzzy are all edit distance one from each other. Mike http://blog.mikemccandless.com On Wed, May 18, 2011 at 8:03 PM, Guilherme Aiolfi grad...@gmail.com wrote: Hi, I want to do a fuzzy search that compare a phrase to a field in solr. For example: abc company ltda will be compared to abc comp, abc corporation, def company ltda, nothing to match here. The thing is the it has to always returns documents sorted by its score. I've found some good algorithms to do that, like StrikeAMatch[1] and JaroWinkler. Using the JaroWinkler with strdist() I can do exactly that. But, I rather prefer to use the StrikeAMatch that had a patch in the lucene jira that was never commited. So, I contacted the author of that patch and he told me that I should use the solr 4.0 that it has now some pretty good new fuzzy search enhancements that made StrikeAMatch seems toys for kids. Anyone know how can I achieve that using solr 4.0? [1] http://www.catalysoft.com/articles/StrikeAMatch.html
Re: Fuzzy search and solr 4.0
You, or any other solr member, knows a good fuzzy string matching library to recommend? On Thu, May 19, 2011 at 9:39 AM, Michael McCandless luc...@mikemccandless.com wrote: Well the good news is FuzzyQuery is indeed much faster in Lucene/Solr 4.0. But the bad news is... FuzzyQuery won't do what you need here. You need some sort of FuzzyPhraseQuery, which is able to replace terms similar to one another (comp/company/corporation) by some metric. I don't know of such a query in Lucene/Solr... but it'd be a nice addition. Others have asked about this before. FuzzyQuery finds terms close to other terms, when measured by edit distance, eg fuzzy/wuzzy/muzzy are all edit distance one from each other. Mike http://blog.mikemccandless.com On Wed, May 18, 2011 at 8:03 PM, Guilherme Aiolfi grad...@gmail.com wrote: Hi, I want to do a fuzzy search that compare a phrase to a field in solr. For example: abc company ltda will be compared to abc comp, abc corporation, def company ltda, nothing to match here. The thing is the it has to always returns documents sorted by its score. I've found some good algorithms to do that, like StrikeAMatch[1] and JaroWinkler. Using the JaroWinkler with strdist() I can do exactly that. But, I rather prefer to use the StrikeAMatch that had a patch in the lucene jira that was never commited. So, I contacted the author of that patch and he told me that I should use the solr 4.0 that it has now some pretty good new fuzzy search enhancements that made StrikeAMatch seems toys for kids. Anyone know how can I achieve that using solr 4.0? [1] http://www.catalysoft.com/articles/StrikeAMatch.html
Fuzzy search and solr 4.0
Hi, I want to do a fuzzy search that compare a phrase to a field in solr. For example: abc company ltda will be compared to abc comp, abc corporation, def company ltda, nothing to match here. The thing is the it has to always returns documents sorted by its score. I've found some good algorithms to do that, like StrikeAMatch[1] and JaroWinkler. Using the JaroWinkler with strdist() I can do exactly that. But, I rather prefer to use the StrikeAMatch that had a patch in the lucene jira that was never commited. So, I contacted the author of that patch and he told me that I should use the solr 4.0 that it has now some pretty good new fuzzy search enhancements that made StrikeAMatch seems toys for kids. Anyone know how can I achieve that using solr 4.0? [1] http://www.catalysoft.com/articles/StrikeAMatch.html