Re: Solr system and numbers
if i wanna search on subsets of number,what can i do? -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-system-and-numbers-tp482519p4057134.html Sent from the Solr - User mailing list archive at Nabble.com.
Re: Solr system and numbers
Do you mean a range (e.g. [4 TO 17]) or a prefix (e.g. 10*)? For range you need to index it as a number. For prefix, string is probably better. Than, just use standard query parameters. Regards, Alex. Personal blog: http://blog.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps events from happening all at once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) On Thu, Apr 18, 2013 at 9:29 PM, uohzoaix johncho...@gmail.com wrote: if i wanna search on subsets of number,what can i do? -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-system-and-numbers-tp482519p4057134.html Sent from the Solr - User mailing list archive at Nabble.com.
RE: Solr system and numbers
great info ,,, thanks a lot all Date: Mon, 9 Jun 2008 05:58:50 -0700 From: [EMAIL PROTECTED] Subject: Re: Solr system and numbers To: solr-user@lucene.apache.org Hi, Solr/Lucene can treat phone numbers as strings. If you want to clean them up and normalize them outside of Solr, you can do that and feed them into Solr as pure numbers. How the phone numbers will be treated after you pump them into Solr depends on the analyzer you choose to use for this data. If you don't need to search on subsets of phone numbers, then just don't tokenize them (i.e. use string type if the phone numbers contain any non-numeric characters, sint otherwise). Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: dudes dudes To: solr-user@lucene.apache.org Sent: Monday, June 9, 2008 2:10:20 PM Subject: Solr system and numbers Hello experts, How does Solr deal with numbers or phone numbers .. For example if you have 1234 and 12 34 or 1 234... with spaces between the numbers .. Or this is dealt by lucene ? any documentations or tutorial on this ? many thanks, ak _ All new Live Search at Live.com http://clk.atdmt.com/UKM/go/msnnkmgl001006ukm/direct/01/ _ All new Live Search at Live.com http://clk.atdmt.com/UKM/go/msnnkmgl001006ukm/direct/01/
Re: Solr system and numbers
I got a similar question: how would one normalize or even detect if a string is a phone number? On Mon, Jun 9, 2008 at 4:17 PM, dudes dudes [EMAIL PROTECTED] wrote: great info ,,, thanks a lot all Date: Mon, 9 Jun 2008 05:58:50 -0700 From: [EMAIL PROTECTED] Subject: Re: Solr system and numbers To: solr-user@lucene.apache.org Hi, Solr/Lucene can treat phone numbers as strings. If you want to clean them up and normalize them outside of Solr, you can do that and feed them into Solr as pure numbers. How the phone numbers will be treated after you pump them into Solr depends on the analyzer you choose to use for this data. If you don't need to search on subsets of phone numbers, then just don't tokenize them (i.e. use string type if the phone numbers contain any non-numeric characters, sint otherwise). Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: dudes dudes To: solr-user@lucene.apache.org Sent: Monday, June 9, 2008 2:10:20 PM Subject: Solr system and numbers Hello experts, How does Solr deal with numbers or phone numbers .. For example if you have 1234 and 12 34 or 1 234... with spaces between the numbers .. Or this is dealt by lucene ? any documentations or tutorial on this ? many thanks, ak _ All new Live Search at Live.com http://clk.atdmt.com/UKM/go/msnnkmgl001006ukm/direct/01/ _ All new Live Search at Live.com http://clk.atdmt.com/UKM/go/msnnkmgl001006ukm/direct/01/
Re: Solr system and numbers
Not sure. Perhaps it can be done by training a language model and treating phone numbers as named entities? Not sure if it would work. But I know there are a few NLP people subscribed, maybe they'll have some good ideas. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Cam Bazz [EMAIL PROTECTED] To: solr-user@lucene.apache.org Sent: Monday, June 9, 2008 4:24:48 PM Subject: Re: Solr system and numbers I got a similar question: how would one normalize or even detect if a string is a phone number? On Mon, Jun 9, 2008 at 4:17 PM, dudes dudes wrote: great info ,,, thanks a lot all Date: Mon, 9 Jun 2008 05:58:50 -0700 From: [EMAIL PROTECTED] Subject: Re: Solr system and numbers To: solr-user@lucene.apache.org Hi, Solr/Lucene can treat phone numbers as strings. If you want to clean them up and normalize them outside of Solr, you can do that and feed them into Solr as pure numbers. How the phone numbers will be treated after you pump them into Solr depends on the analyzer you choose to use for this data. If you don't need to search on subsets of phone numbers, then just don't tokenize them (i.e. use string type if the phone numbers contain any non-numeric characters, sint otherwise). Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: dudes dudes To: solr-user@lucene.apache.org Sent: Monday, June 9, 2008 2:10:20 PM Subject: Solr system and numbers Hello experts, How does Solr deal with numbers or phone numbers .. For example if you have 1234 and 12 34 or 1 234... with spaces between the numbers .. Or this is dealt by lucene ? any documentations or tutorial on this ? many thanks, ak _ All new Live Search at Live.com http://clk.atdmt.com/UKM/go/msnnkmgl001006ukm/direct/01/ _ All new Live Search at Live.com http://clk.atdmt.com/UKM/go/msnnkmgl001006ukm/direct/01/
Re: Solr system and numbers
Doh, I forgot. Regular expressions worked well for me when I dealt with that problem many years ago. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Otis Gospodnetic [EMAIL PROTECTED] To: solr-user@lucene.apache.org Sent: Monday, June 9, 2008 5:36:34 PM Subject: Re: Solr system and numbers Not sure. Perhaps it can be done by training a language model and treating phone numbers as named entities? Not sure if it would work. But I know there are a few NLP people subscribed, maybe they'll have some good ideas. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Cam Bazz To: solr-user@lucene.apache.org Sent: Monday, June 9, 2008 4:24:48 PM Subject: Re: Solr system and numbers I got a similar question: how would one normalize or even detect if a string is a phone number? On Mon, Jun 9, 2008 at 4:17 PM, dudes dudes wrote: great info ,,, thanks a lot all Date: Mon, 9 Jun 2008 05:58:50 -0700 From: [EMAIL PROTECTED] Subject: Re: Solr system and numbers To: solr-user@lucene.apache.org Hi, Solr/Lucene can treat phone numbers as strings. If you want to clean them up and normalize them outside of Solr, you can do that and feed them into Solr as pure numbers. How the phone numbers will be treated after you pump them into Solr depends on the analyzer you choose to use for this data. If you don't need to search on subsets of phone numbers, then just don't tokenize them (i.e. use string type if the phone numbers contain any non-numeric characters, sint otherwise). Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: dudes dudes To: solr-user@lucene.apache.org Sent: Monday, June 9, 2008 2:10:20 PM Subject: Solr system and numbers Hello experts, How does Solr deal with numbers or phone numbers .. For example if you have 1234 and 12 34 or 1 234... with spaces between the numbers .. Or this is dealt by lucene ? any documentations or tutorial on this ? many thanks, ak _ All new Live Search at Live.com http://clk.atdmt.com/UKM/go/msnnkmgl001006ukm/direct/01/ _ All new Live Search at Live.com http://clk.atdmt.com/UKM/go/msnnkmgl001006ukm/direct/01/