Re: Boosting of words
Hi Arslan, Yes,I am using Solr as an input to carrot. Yes,I am using org.carrot2.source.solr.SolrDocumentSource just to cluster search results. Currently we are focusing to Solr search results only. In future we will focuse to clustered search results. Now i am using Solr 1.3. Regards Bhaskar --- On Sat, 10/17/09, AHMET ARSLAN iori...@yahoo.com wrote: From: AHMET ARSLAN iori...@yahoo.com Subject: Re: Boosting of words To: solr-user@lucene.apache.org Date: Saturday, October 17, 2009, 1:55 PM I am using Solr 1.3. I access Solr through carrot and use Java. What is the meaning of accessing solr through carrot? Are you using solr as an input to carrot? Using org.carrot2.source.solr.SolrDocumentSource just to cluster search results? Can we say that you are interested in clustered search results rather than search results them selfs? If yes solr 1.4 will have Grant Ingersoll's ClusteringComponent [1] which uses carrot2 to cluster search results. [1] http://wiki.apache.org/solr/ClusteringComponent
Re: Boosting of words
Hi, I am using Solr 1.3. I access Solr through carrot and use Java. Regards Bhaskar --- On Thu, 10/15/09, AHMET ARSLAN iori...@yahoo.com wrote: From: AHMET ARSLAN iori...@yahoo.com Subject: Re: Boosting of words To: solr-user@lucene.apache.org Date: Thursday, October 15, 2009, 8:58 AM Hi, I am able to see the results when i pass the values in the query browser. When i pass the below query i am able to see the difference in output. http://localhost:8983/solr/select/?q=java^100%20technology^1 Each time user cannot pass the values in the query browser to see the output. But where exactly java^100 technology^1 this value should be set. In which file and which location to be precise?. Please help me. Althought I do not understand you, you need to URL encode your parameter values before you invoke a HTTP GET. paramater=urlencode(value,UTF-8) Try this url : /select/?q=java%5E100+OR+technology%5E1version=2.2 Note that space is encoded into +. Also ^ is encoded into %5E. What kind of solr client are you using? How are you accessing to solr? From java, php, rubby?
Re: Boosting of words
Hi, I am able to see the results when i pass the values in the query browser. When i pass the below query i am able to see the difference in output. http://localhost:8983/solr/select/?q=java^100%20technology^1 Each time user cannot pass the values in the query browser to see the output. But where exactly java^100 technology^1 this value should be set.In which file and which location to be precise?. Please help me. Regards Bhaskar --- On Wed, 10/14/09, AHMET ARSLAN iori...@yahoo.com wrote: From: AHMET ARSLAN iori...@yahoo.com Subject: Re: Boosting of words To: solr-user@lucene.apache.org Date: Wednesday, October 14, 2009, 6:41 AM Hi Clark, Thanks for your input. I have a query. I have my XML which contains the following: add doc field name=urlhttp://www.sun.com/field field name=titleinformation/field field name=descriptionjava plays a important role in computer industry for web users/field /doc doc field name=urlhttp://www.askguru.com/field field name=titlehomepage/field field name=descriptionInformation about technology is stored in the web sites/field /doc doc field name=urlhttp://www.techie.com/field field name=titlepost queries/field field name=descriptionThis web site have more java technology related to web/field /doc /add When I give “java technology” as my input in Solr admin page ,At present I get output as doc field name=urlhttp://www.techie.com/field field name=titlepost queries/field field name=descriptionThis web site have more java technology related to web/field /doc Now I need to get doc which has “technology” also When I give “java technology “ I need to get output as,I need to give boosting to doc which has “technology”. It should display in the below order.The output should come as doc field name=urlhttp://www.techie.com/field field name=titlepost queries/field field name=descriptionThis web site have more java technology related to web/field /doc doc field name=urlhttp://www.askguru.com/field field name=titlehomepage/field field name=descriptionInformation about technology is stored in the web sites/field /doc doc field name=urlhttp://www.sun.com/field field name=titleinformation/field field name=descriptionjava plays a important role in computer industry for web users/field /doc Let me know how to achieve the same? The query : java^1 OR technology^100 will do it. Results will be in this order: 1-)This web site have more java technology related to web 2-)Information about technology is stored in the web sites 3-)java plays a important role in computer industry for web users 1-) contains both java and technology 2-) contains only technology 3-) contains only java Is that what you want? Note that there is no quotes in the query above. And you can adjust boost factors (1 and 100) according to your needs. Use OR operator between terms. You set individual terms boost with ^ operator. hope this helps.
Re: Boosting of words
Hi Clark, Thanks for your input.I have a query. I have my XML which contains the following: add doc field name=urlhttp://www.sun.com/field field name=titleinformation/field field name=descriptionjava plays a important role in computer industry for web users/field /doc doc field name=urlhttp://www.askguru.com/field field name=titlehomepage/field field name=descriptionInformation about technology is stored in the web sites/field /doc doc field name=urlhttp://www.techie.com/field field name=titlepost queries/field field name=descriptionThis web site have more java technology related to web/field /doc /add When I give “java technology” as my input in Solr admin page ,At present I get output as doc field name=urlhttp://www.techie.com/field field name=titlepost queries/field field name=descriptionThis web site have more java technology related to web/field /doc Now I need to get doc which has “technology” also When I give “java technology “ I need to get output as,I need to give boosting to doc which has “technology”. It should display in the below order.The output should come as doc field name=urlhttp://www.techie.com/field field name=titlepost queries/field field name=descriptionThis web site have more java technology related to web/field /doc doc field name=urlhttp://www.askguru.com/field field name=titlehomepage/field field name=descriptionInformation about technology is stored in the web sites/field /doc doc field name=urlhttp://www.sun.com/field field name=titleinformation/field field name=descriptionjava plays a important role in computer industry for web users/field /doc Let me know how to achieve the same? Regards Bhaskar --- On Tue, 10/13/09, Nicholas Clark clark...@gmail.com wrote: From: Nicholas Clark clark...@gmail.com Subject: Re: Boosting of words To: solr-user@lucene.apache.org Date: Tuesday, October 13, 2009, 1:01 PM Bhaskar, Read this page, specifically how to query data. http://lucene.apache.org/solr/tutorial.html#Querying+Data It sounds like you are very new to Solr, so I would also suggest reading the wiki. http://wiki.apache.org/solr/ -Nick On Mon, Oct 12, 2009 at 10:02 PM, bhaskar chandrasekar bas_s...@yahoo.co.in wrote: Hi Nicholas, Thanks for your input.Where exactly the query q=product:red color:red^10 should be used and defined?. Help me. Regards Bhaskar --- On Mon, 10/12/09, Nicholas Clark clark...@gmail.com wrote: From: Nicholas Clark clark...@gmail.com Subject: Re: Boosting of words To: solr-user@lucene.apache.org Date: Monday, October 12, 2009, 2:13 PM The easiest way to boost your query is to modify your query string. q=product:red color:red^10 In the above example, I have boosted the color field. If red is found in that field, it will get a boost of 10. If it is only found in the product field, then there will be no boost. Here's more information: http://wiki.apache.org/solr/SolrRelevancyCookbook#Boosting_Ranking_Terms Once you're comfortable with that, I suggest that you look into using the DisMax request handler. It will allow you to easily search across multiple fields with custom boost values. http://wiki.apache.org/solr/DisMaxRequestHandler -Nick On Sun, Oct 11, 2009 at 12:26 PM, bhaskar chandrasekar bas_s...@yahoo.co.in wrote: Hi, I would like to know how can i give boosting to search input in Solr. Where exactly should i make the changes?. Regards Bhaskar
Search results order
Hi, I have indexed my xml which contains the following data. add doc field name=urlhttp://www.yahoo.com /field field name=titleyahoomail/field field name=descriptionyahoo has various links and gives in detail about the all the links in it/field /doc doc field name=urlhttp://www.rediff.com/field field name=titleIt is a good website/field field name=descriptionRediff has a interesting homepage/field /doc doc field name=urlhttp://www.ndtv.com/field field name=titleNdtv has a variety of good links/field field name=descriptionThe homepage of Ndtv is very good/field /doc /add In my solr home page , when I search input as “good” It displays the docs which has “good” as highest occurrences by default. The output comes as follows. doc field name=urlhttp://www.ndtv.com/field field name=titleNdtv has a variety of good links/field field name=descriptionThe homepage of Ndtv is very good/field /doc doc field name=urlhttp://www.rediff.com/field field name=titleIt is a good website/field field name=descriptionRediff has a interesting homepage/field /doc If I need to display doc which has least occurrence of search input “good” as first result. What changes should I make in solrconfig file to achieve the same?. Any suggestions would be helpful. For me the output should come as below. doc field name=urlhttp://www.rediff.com/field field name=titleIt is a good website/field field name=descriptionRediff has a interesting homepage/field /doc doc field name=urlhttp://www.ndtv.com/field field name=titleNdtv has a variety of good links/field field name=descriptionThe homepage of Ndtv is very good/field /doc Regards Bhaskar
Re: Boosting of words
Hi Nicholas, Thanks for your input.Where exactly the query q=product:red color:red^10 should be used and defined?. Help me. Regards Bhaskar --- On Mon, 10/12/09, Nicholas Clark clark...@gmail.com wrote: From: Nicholas Clark clark...@gmail.com Subject: Re: Boosting of words To: solr-user@lucene.apache.org Date: Monday, October 12, 2009, 2:13 PM The easiest way to boost your query is to modify your query string. q=product:red color:red^10 In the above example, I have boosted the color field. If red is found in that field, it will get a boost of 10. If it is only found in the product field, then there will be no boost. Here's more information: http://wiki.apache.org/solr/SolrRelevancyCookbook#Boosting_Ranking_Terms Once you're comfortable with that, I suggest that you look into using the DisMax request handler. It will allow you to easily search across multiple fields with custom boost values. http://wiki.apache.org/solr/DisMaxRequestHandler -Nick On Sun, Oct 11, 2009 at 12:26 PM, bhaskar chandrasekar bas_s...@yahoo.co.in wrote: Hi, I would like to know how can i give boosting to search input in Solr. Where exactly should i make the changes?. Regards Bhaskar
Boosting of words
Hi, I would like to know how can i give boosting to search input in Solr. Where exactly should i make the changes?. Regards Bhaskar
Re: Ranking of search results
Hi Amith, I tried with the options you gave and gave debug=true at the end of the URL. I am getting output as lst name=debug str name=rawquerystringchannel/str str name=querystringchannel/str str name=parsedquerytext:channel/str str name=parsedquery_toStringtext:channel/str - lst name=explain str name=http://hotmail;1.2682627 = (MATCH) fieldWeight(text:channel in 3), product of: 2.828427 = tf(termFreq(text:channel)=8) 2.049822 = idf(docFreq=6, numDocs=20) 0.21875 = fieldNorm(field=text, doc=3)/str str name=http://share;1.0026497 = (MATCH) fieldWeight(text:channel in 19), product of: 2.236068 = tf(termFreq(text:channel)=5) 2.049822 = idf(docFreq=6, numDocs=20) 0.21875 = fieldNorm(field=text, doc=19)/str str name=http://metacreek;0.6341314 = (MATCH) fieldWeight(text:channel in 10), product of: 1.4142135 = tf(termFreq(text:channel)=2) 2.049822 = idf(docFreq=6, numDocs=20) 0.21875 = fieldNorm(field=text, doc=10)/str str name=http://yahoo;0.5124555 = (MATCH) fieldWeight(text:channel in 0), product of: 1.0 = tf(termFreq(text:channel)=1) 2.049822 = idf(docFreq=6, numDocs=20) 0.25 = fieldNorm(field=text, doc=0)/str str name=http://sharemarket;0.4483986 = (MATCH) fieldWeight(text:channel in 1), product of: 1.0 = tf(termFreq(text:channel)=1) 2.049822 = idf(docFreq=6, numDocs=20) 0.21875 = fieldNorm(field=text, doc=1)/str str name=http://Altavista;0.4483986 = (MATCH) fieldWeight(text:channel in 5), product of: 1.0 = tf(termFreq(text:channel)=1) 2.049822 = idf(docFreq=6, numDocs=20) 0.21875 = fieldNorm(field=text, doc=5)/str /lst What does the numeric terms denotes?.With this numeric value will i be able to i set preference for my search links?.If so how?. Regards Bhaskar - On Thu, 10/1/09, bhaskar chandrasekar bas_s...@yahoo.co.in wrote: From: bhaskar chandrasekar bas_s...@yahoo.co.in Subject: Re: Ranking of search results To: solr-user@lucene.apache.org Date: Thursday, October 1, 2009, 7:34 PM --- On Wed, 9/23/09, Amit Nithian anith...@gmail.com wrote: Hi Amith, Thanks for your reply.How do i set preference for the links , which should appear first,second in the search results. Which configuration file in Solr needs to be modified to achieve the same?. Regards Bhaskar From: Amit Nithian anith...@gmail.com Subject: Re: Ranking of search results To: solr-user@lucene.apache.org Date: Wednesday, September 23, 2009, 11:33 AM It depends on several things:1) The query handler that you are using 2) The fields that you are searching on and default fields specified For the default handler, it will issue a query for the default field and return results accordingly. To see what is going on pass the debugQuery=true to the end of the URL to see detailed output. If you are using the DisMaxHandler (DisJoint Max) then you will have a qf, pf and bf (query fields, phrase fields, boosting function). I would start looking at http://wiki.apache.org/solr/DisMaxRequestHandler http://wiki.apache.org/solr/DisMaxRequestHandler- Amit On Wed, Sep 23, 2009 at 10:25 AM, bhaskar chandrasekar bas_s...@yahoo.co.in wrote: Hi, When i give a input string for search in Solr , it displays me the corresponding results for the given input string. How the results are ranked and displayed.On what basis the search results are displayed. Is there any algorithm followed for displaying the results with first result and so on. Regards Bhaskar
Solr configuration file
Hi, In my Solrconfig file, can any one let me know what the below does str name=qf and st name=mm represents?. in the below mentioned. requestHandler name=partitioned class=solr.SearchHandler lst name=defaults str name=defTypedismax/str str name=echoParamsexplicit/str str name=qftext^0.5 features^1.0 name^1.2 sku^1.5 id^10.0/str str name=mm2lt;-1 5lt;-2 6lt;90%/str str name=bqincubationdate_dt:[* TO NOW/DAY-1MONTH]^2.2/str /lst ex:text^0.5 features^1.0 name^1.2 sku^1.5 id^10.0.What ^ and numeric part represents?. Does it have any thing to do with ranking of web pages?. Regards Bhaskar
Re: Ranking of search results
--- On Wed, 9/23/09, Amit Nithian anith...@gmail.com wrote: Hi Amith, Thanks for your reply.How do i set preference for the links , which should appear first,second in the search results. Which configuration file in Solr needs to be modified to achieve the same?. Regards Bhaskar From: Amit Nithian anith...@gmail.com Subject: Re: Ranking of search results To: solr-user@lucene.apache.org Date: Wednesday, September 23, 2009, 11:33 AM It depends on several things:1) The query handler that you are using 2) The fields that you are searching on and default fields specified For the default handler, it will issue a query for the default field and return results accordingly. To see what is going on pass the debugQuery=true to the end of the URL to see detailed output. If you are using the DisMaxHandler (DisJoint Max) then you will have a qf, pf and bf (query fields, phrase fields, boosting function). I would start looking at http://wiki.apache.org/solr/DisMaxRequestHandler http://wiki.apache.org/solr/DisMaxRequestHandler- Amit On Wed, Sep 23, 2009 at 10:25 AM, bhaskar chandrasekar bas_s...@yahoo.co.in wrote: Hi, When i give a input string for search in Solr , it displays me the corresponding results for the given input string. How the results are ranked and displayed.On what basis the search results are displayed. Is there any algorithm followed for displaying the results with first result and so on. Regards Bhaskar
Exact match
Hi, I am doing exact search in Solr .In Solr admin page I am giving the search input string for search. For ex: I am giving “channeL12” as search input string in solr home page it displays search results as doc str name=urlhttp://rediff/field str name=titlefirst/field str name=descriptionchanneL12/field /doc As there is a matching input for “channeL12”. If I give “channel12” as search input string with L in lower case I am not getting any search results. In fact I changed ignoreCase =”true” in schema.xml schema.xml analyzer type=query tokenizer class=solr.WhitespaceTokenizerFactory ignoreCase=true / /analyzer I want to ignore casesensitive search in my search results. Please let me know if I need to make changes any where else or what to do to achieve the desired output. Regards Bhaskar
Re: Exact word search in Solr
Hi, I followed with the below metioned steps and i am not getting the desired results. Let me know if anything else to be done. Regards Bhaskar --- On Fri, 9/18/09, AHMET ARSLAN iori...@yahoo.com wrote: From: AHMET ARSLAN iori...@yahoo.com Subject: Re: Exact word search in Solr To: solr-user@lucene.apache.org Date: Friday, September 18, 2009, 2:59 AM Hi, I am doing exact word search in Solr 1.3 and I am not getting the expected results. I am giving you the sample XML file along with the mail from where search results are fetched. The following steps were followed to achieve exact word search result in Solr. You can simply use the fieldType below to achieve this: fieldType name=text_ws class=solr.TextField positionIncrementGap=100 analyzer tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.LowerCaseFilterFactory/ /analyzer /fieldType Note that there is no WordDelimiterFilterFactory in this type. But probably yours has it. Hope this helps.
Exact word search in Solr
Hi, I am doing exact word search in Solr 1.3 and I am not getting the expected results. I am giving you the sample XML file along with the mail from where search results are fetched. The following steps were followed to achieve exact word search result in Solr. 1) Schema.xml is configured for title, url and description field name=url type=string indexed=true stored=true required=true/ field name=title type=text indexed=true stored=true required=true / field name=description type=text indexed=true stored=true required=true/ Commented below lines !--filter class=solr.EnglishPorterFilterFactory protected=protwords.txt/-- !--filter class=solr.EnglishPorterFilterFactory protected=protwords.txt/-- 2) Started Solr server 3) Indexed sample data with title, url description 5) Assume I am giving (say channelone) as my input search string for exact word search in Solr admin page. I am getting the following output.It sould show output pertaining to channelone only.It should not display combination of words with “channelone”.I am not looking for case sensitive search here. doc field name=urlhttp://c2search1/contactus3.html/field field name=titlec2Search1: Contactus3/field field name=descriptionchannelOne/field /doc - doc field name=urlhttp://c2search1/contactus4.html/field field name=titlec2Search1: Contactus4/field field name=descriptionChannelone/field /doc - doc field name=urlhttp://c2search1/contactus5.html/field field name=titlec2Search1: Contactus5/field field name=descriptionchannel...@$/field /doc - doc field name=urlhttp://c2search1/contactus6.html/field field name=titlec2Search1: Contactus6/field field name=descriptionchannelon...@$/field /doc - doc field name=urlhttp://c2search1/contactus7.html/field field name=titlec2Search1: Contactus7/field field name=descriptionchannelon...@$ab/field /doc Expected Result doc field name=urlhttp://c2search1/contactus3.html/field field name=titlec2Search1: Contactus3/field field name=descriptionchannelOne/field /doc - doc field name=urlhttp://c2search1/contactus4.html/field field name=titlec2Search1: Contactus4/field field name=descriptionChannelone/field /doc Please help me with the above scenario to achieve the desired output. Regards Bhaskar
Re: Exact Word Search
Hi Shalin, My search is based on the following fields in schema.xml field name=url type=string indexed=true stored=true/ field name=content type=text indexed=true stored=true/ field name=description type=string indexed=true stored=true/ Let me know if you need anything else? Regards Bhaskar --- On Fri, 9/4/09, Shalin Shekhar Mangar shalinman...@gmail.com wrote: From: Shalin Shekhar Mangar shalinman...@gmail.com Subject: Re: Exact Word Search To: solr-user@lucene.apache.org Date: Friday, September 4, 2009, 5:51 AM On Fri, Sep 4, 2009 at 6:06 PM, bhaskar chandrasekar bas_s...@yahoo.co.inwrote: Hi, I have integrated Solr with Carrot2 Cluster Engine (v 3.1.0). Carrot2 is used as a presentation layer. Carrot2 sends requested query to external source (Solr) and get results from Solr. Carrot2 may not be responsible for forming Query. It would have been handled from Solr end. Can you post the exact query that your application or Carrot2 is sending to Solr? Can you also list the Solr field and type defined in schema.xml which is being searched? Please help me with the below scenarios. Scenario: (Please DO NOT consider any case sensitive) Assuming I give bhaskar as input string It should give me search results pertaining to word ‘bhaskar’ only. I am expecting output like below database query Select * from MASTER where name =’bhaskar’; Above query suppose to return matched records for ‘bhaskar’.. Use a solr.TextField with KeywordTokenizer and LowerCaseFilter and search with q=field-name:field-value -- Regards, Shalin Shekhar Mangar.
Re: Exact Word Search
Hi Shalin, Where on in which file should i set the values you have mentioned?. Let me know how to set it. Regards Bhaskar --- On Fri, 9/4/09, Shalin Shekhar Mangar shalinman...@gmail.com wrote: From: Shalin Shekhar Mangar shalinman...@gmail.com Subject: Re: Exact Word Search To: solr-user@lucene.apache.org Date: Friday, September 4, 2009, 1:47 AM If you want to do a exact match (case sensitive) in Solr, you should have a string type field and the query should be made as fieldname:value However, reading your mail I get the feeling that the query is actually being made by Carrot2. On Fri, Sep 4, 2009 at 7:41 AM, bhaskar chandrasekar bas_s...@yahoo.co.inwrote: Hi shalin, Thanks for your reply. I am not sure as how the query is formed in Solr. If you could throw some light on this , it will be helpful. Is it achievable?. Regards Bhaskar --- On Thu, 9/3/09, Shalin Shekhar Mangar shalinman...@gmail.com wrote: From: Shalin Shekhar Mangar shalinman...@gmail.com Subject: Re: Exact Word Search To: solr-user@lucene.apache.org Date: Thursday, September 3, 2009, 5:14 AM On Thu, Sep 3, 2009 at 1:33 PM, bhaskar chandrasekar bas_s...@yahoo.co.inwrote: Hi, Can any one help me with the below scenario?. Scenario : I have integrated Solr with Carrot2. The issue is Assuming i give bhaskar as input string for search. It should give me search results pertaining to bhaskar only. Example: It should not display search results as chandarbhaskar or bhaskarc. Basically search should happen based on the exact word match. I am not bothered about case sensitive here How to achieve the above Scenario in Carrot2 ?. Bhaskar, I think this question is better suited for the Carrot mailing lists. Unless you yourself control how the solr query is created, we will not be able to help you. -- Regards, Shalin Shekhar Mangar. -- Regards, Shalin Shekhar Mangar.
Re: Exact Word Search
Hi, I have integrated Solr with Carrot2 Cluster Engine (v 3.1.0). Carrot2 is used as a presentation layer. Carrot2 sends requested query to external source (Solr) and get results from Solr. Carrot2 may not be responsible for forming Query. It would have been handled from Solr end. Please help me with the below scenarios. Scenario: (Please DO NOT consider any case sensitive) Assuming I give bhaskar as input string It should give me search results pertaining to word ‘bhaskar’ only. I am expecting output like below database query Select * from MASTER where name =’bhaskar’; Above query suppose to return matched records for ‘bhaskar’.. My Carrot2 search result should have similar out. It should not display search results prefixed or suffixed with bhaskar. Regards Bhaskar --- On Fri, 9/4/09, Shalin Shekhar Mangar shalinman...@gmail.com wrote: From: Shalin Shekhar Mangar shalinman...@gmail.com Subject: Re: Exact Word Search To: solr-user@lucene.apache.org Date: Friday, September 4, 2009, 1:47 AM If you want to do a exact match (case sensitive) in Solr, you should have a string type field and the query should be made as fieldname:value However, reading your mail I get the feeling that the query is actually being made by Carrot2. On Fri, Sep 4, 2009 at 7:41 AM, bhaskar chandrasekar bas_s...@yahoo.co.inwrote: Hi shalin, Thanks for your reply. I am not sure as how the query is formed in Solr. If you could throw some light on this , it will be helpful. Is it achievable?. Regards Bhaskar --- On Thu, 9/3/09, Shalin Shekhar Mangar shalinman...@gmail.com wrote: From: Shalin Shekhar Mangar shalinman...@gmail.com Subject: Re: Exact Word Search To: solr-user@lucene.apache.org Date: Thursday, September 3, 2009, 5:14 AM On Thu, Sep 3, 2009 at 1:33 PM, bhaskar chandrasekar bas_s...@yahoo.co.inwrote: Hi, Can any one help me with the below scenario?. Scenario : I have integrated Solr with Carrot2. The issue is Assuming i give bhaskar as input string for search. It should give me search results pertaining to bhaskar only. Example: It should not display search results as chandarbhaskar or bhaskarc. Basically search should happen based on the exact word match. I am not bothered about case sensitive here How to achieve the above Scenario in Carrot2 ?. Bhaskar, I think this question is better suited for the Carrot mailing lists. Unless you yourself control how the solr query is created, we will not be able to help you. -- Regards, Shalin Shekhar Mangar. -- Regards, Shalin Shekhar Mangar.
Exact Word Search
Hi, Can any one help me with the below scenario?. Scenario : I have integrated Solr with Carrot2. The issue is Assuming i give bhaskar as input string for search. It should give me search results pertaining to bhaskar only. Example: It should not display search results as chandarbhaskar or bhaskarc. Basically search should happen based on the exact word match. I am not bothered about case sensitive here How to achieve the above Scenario in Carrot2 ?. Regards Bhaskar
Re: Exact Word Search
Hi shalin, Thanks for your reply. I am not sure as how the query is formed in Solr. If you could throw some light on this , it will be helpful. Is it achievable?. Regards Bhaskar --- On Thu, 9/3/09, Shalin Shekhar Mangar shalinman...@gmail.com wrote: From: Shalin Shekhar Mangar shalinman...@gmail.com Subject: Re: Exact Word Search To: solr-user@lucene.apache.org Date: Thursday, September 3, 2009, 5:14 AM On Thu, Sep 3, 2009 at 1:33 PM, bhaskar chandrasekar bas_s...@yahoo.co.inwrote: Hi, Can any one help me with the below scenario?. Scenario : I have integrated Solr with Carrot2. The issue is Assuming i give bhaskar as input string for search. It should give me search results pertaining to bhaskar only. Example: It should not display search results as chandarbhaskar or bhaskarc. Basically search should happen based on the exact word match. I am not bothered about case sensitive here How to achieve the above Scenario in Carrot2 ?. Bhaskar, I think this question is better suited for the Carrot mailing lists. Unless you yourself control how the solr query is created, we will not be able to help you. -- Regards, Shalin Shekhar Mangar.
Re: Pattern matching in Solr
Hi, In Schema.xml file,I am not able ot find splitOnCaseChange=1. I am not looking for case sensitive search. Let me know what file you are refering to?. I am looking for exact match search only Moreover for scenario 2 the KeywordTokenizerFactory and EdgeNGramFilterFactory refers which link in Solr wiki. Regards Bhaskar --- On Wed, 8/26/09, Avlesh Singh avl...@gmail.com wrote: From: Avlesh Singh avl...@gmail.com Subject: Re: Pattern matching in Solr To: solr-user@lucene.apache.org Date: Wednesday, August 26, 2009, 11:31 AM You could have used your previous thread itself ( http://www.lucidimagination.com/search/document/31c1ebcedd4442b/exact_pattern_search_in_solr), Bhaskar. In your scenario one, you need an exact token match, right? You are getting expected results if your field type is text. Look for the WordDelimiterFilterFactory in your field type definition for the text field inside schema.xml. You'll find an attribute splitOnCaseChange=1. Because of this, ChandarBhaskar is converted into two tokens Chandra and Bhaskar and hence the matches. You may choose to remove this attribute if the behaviour is not desired. For your scenario two, you may want to look at the KeywordTokenizerFactory and EdgeNGramFilterFactory on Solr wiki. Generally, for all such use cases people create multiple fields in their schema storing the same data analyzed in different ways. Cheers Avlesh On Wed, Aug 26, 2009 at 10:58 PM, bhaskar chandrasekar bas_s...@yahoo.co.in wrote: Hi, Can any one help me with the below scenario?. Scenario 1: Assume that I give Google as input string i am using Carrot with Solr Carrot is for front end display purpose the issue is Assuming i give BHASKAR as input string It should give me search results pertaining to BHASKAR only. Select * from MASTER where name =Bhaskar; Example:It should not display search results as ChandarBhaskar or BhaskarC. Should display Bhaskar only. Scenario 2: Select * from MASTER where name like %BHASKAR%; It should display records containing the word BHASKAR Ex: Bhaskar ChandarBhaskar BhaskarC Bhaskarabc How to achieve Scenario 1 in Solr ?. Regards Bhaskar __ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com
Count of records
Hi, We have integrated Solr index with Carrot2 Search Engine and able to get search results. In my search results page, by default Total Number of records matched for the particular query is not getting displayed. http://localhost:8089/carrot2-webapp-3.0.1/search?source=Solrview=treeskin=simplequery=javaresults=100algorithm=lingoSolrDocumentSource.solrTitleFieldName=title SolrDocumentSource.solrSummaryFieldName=descriptionSolrDocumentSource.solrUrlFieldName=url Currently I am getting like, Results 1 - 100 of about 100 for java Consider I searched for Java; In my Solr index, total number of matches found are 1000. I am interested to display only top 100 results. I should also get total match for the search query. Display should be similar to below: Results 1 - 100 of about 1000 for java Regards Bhaskar
Re: Pattern matching in Solr
Hi, In Schema.xml file,I am not able ot find splitOnCaseChange=1. I am not looking for case sensitive search. Let me know what file you are refering to?. I am looking for exact match search only Moreover for scenario 2 the KeywordTokenizerFactory and EdgeNGramFilterFactory refers which link in Solr wiki. Regards Bhaskar --- On Thu, 8/27/09, Avlesh Singh avl...@gmail.com wrote: From: Avlesh Singh avl...@gmail.com Subject: Re: Pattern matching in Solr To: solr-user@lucene.apache.org Date: Thursday, August 27, 2009, 2:10 AM In Schema.xml file,I am not able ot find splitOnCaseChange=1. Unless you have modified the stock field type definition of text field in your core's schema.xml you should be able to find this property set for the WordDelimiterFilterFactory. Read more here - http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#head-1c9b83870ca7890cd73b193cefed83c283339089 Moreover for scenario 2 the KeywordTokenizerFactory and EdgeNGramFilterFactory refers which link in Solr wiki. Google for these two. Cheers Avlesh On Thu, Aug 27, 2009 at 12:21 PM, bhaskar chandrasekar bas_s...@yahoo.co.in wrote: Hi, In Schema.xml file,I am not able ot find splitOnCaseChange=1. I am not looking for case sensitive search. Let me know what file you are refering to?. I am looking for exact match search only Moreover for scenario 2 the KeywordTokenizerFactory and EdgeNGramFilterFactory refers which link in Solr wiki. Regards Bhaskar --- On Wed, 8/26/09, Avlesh Singh avl...@gmail.com wrote: From: Avlesh Singh avl...@gmail.com Subject: Re: Pattern matching in Solr To: solr-user@lucene.apache.org Date: Wednesday, August 26, 2009, 11:31 AM You could have used your previous thread itself ( http://www.lucidimagination.com/search/document/31c1ebcedd4442b/exact_pattern_search_in_solr ), Bhaskar. In your scenario one, you need an exact token match, right? You are getting expected results if your field type is text. Look for the WordDelimiterFilterFactory in your field type definition for the text field inside schema.xml. You'll find an attribute splitOnCaseChange=1. Because of this, ChandarBhaskar is converted into two tokens Chandra and Bhaskar and hence the matches. You may choose to remove this attribute if the behaviour is not desired. For your scenario two, you may want to look at the KeywordTokenizerFactory and EdgeNGramFilterFactory on Solr wiki. Generally, for all such use cases people create multiple fields in their schema storing the same data analyzed in different ways. Cheers Avlesh On Wed, Aug 26, 2009 at 10:58 PM, bhaskar chandrasekar bas_s...@yahoo.co.in wrote: Hi, Can any one help me with the below scenario?. Scenario 1: Assume that I give Google as input string i am using Carrot with Solr Carrot is for front end display purpose the issue is Assuming i give BHASKAR as input string It should give me search results pertaining to BHASKAR only. Select * from MASTER where name =Bhaskar; Example:It should not display search results as ChandarBhaskar or BhaskarC. Should display Bhaskar only. Scenario 2: Select * from MASTER where name like %BHASKAR%; It should display records containing the word BHASKAR Ex: Bhaskar ChandarBhaskar BhaskarC Bhaskarabc How to achieve Scenario 1 in Solr ?. Regards Bhaskar __ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com
Pattern matching in Solr
Hi, Can any one help me with the below scenario?. Scenario 1: Assume that I give Google as input string i am using Carrot with Solr Carrot is for front end display purpose the issue is Assuming i give BHASKAR as input string It should give me search results pertaining to BHASKAR only. Select * from MASTER where name =Bhaskar; Example:It should not display search results as ChandarBhaskar or BhaskarC. Should display Bhaskar only. Scenario 2: Select * from MASTER where name like %BHASKAR%; It should display records containing the word BHASKAR Ex: Bhaskar ChandarBhaskar BhaskarC Bhaskarabc How to achieve Scenario 1 in Solr ?. Regards Bhaskar
Total count of records
Hi, When Solr retrives records based on a input match , it gives total count of records. Say for Ex , it displays like : 1 out of 20,000 for the particular search string. How the total count of records are fetched in Solr , does it refer any Schema or XML file?. Regards Bhaskar
Exact pattern search in Solr
Hi, Can any one helpe me with the below scenario?. Scenario 1: Assume that I give Google as input string i am using Carrot with Solr Carrot is for front end display purpose the issue is Assuming i give BHASKAR as input string It should give me search results pertaining to BHASKAR only. Select * from MASTER where name =Bhaskar; Example:It should not display search results as ChandarBhaskar or BhaskarC. Should display Bhaskar only. Scenario 2: Select * from MASTER where name like %BHASKAR%; It should display records containing the word BHASKAR Ex: Bhaskar ChandarBhaskar BhaskarC Bhaskarabc How to achieve Scenario 1 in Solr ?. .. Regards Bhaskar
Exact word search
Hi, Can any one helpe me with the below scenario?. Scenario 1: Assume that I give Google as input string i am using Carrot with Solr Carrot is for front end display purpose the issue is Assuming i give BHASKAR as input string It should give me search results pertaining to BHASKAR only. Select * from MASTER where name =Bhaskar; Example:It should not display search results as ChandarBhaskar or BhaskarC. Should display Bhaskar only. Scenario 2: Select * from MASTER where name like %BHASKAR%; It should display records containing the word BHASKAR Ex: Bhaskar ChandarBhaskar BhaskarC Bhaskarabc How to achieve Scenario 1 in Solr ?. ... Regards Bhaskar
Fetching Query Results from Solr
Hi, I am using Solr to retrive records and display them thru carrot. Assuming that i give Google as search , it will display me all the records relating to that. It should give me exact word search only. It shld not look for combination of words in the search query and display them. How can i restrict Solr to achieve the same. Regards Bhaskar
Re: Fetching Query Results from Solr
Hi, Which Java class needs to be modified to get the exact search results in Solr. either 1) SearchServlet.java 2) SolrQuerySession.java 3) SolrQuery.java and where it should be modified. Scenario: I am using Solr to retrive records and display them thru carrot. Assuming that i give Google as search , it will display me all the records relating to that. It should give me exact word search only. It shld not look for combination of words in the search query and display them. How can i restrict Solr to achieve the same. Regards Bhaskar --- On Thu, 8/20/09, bhaskar chandrasekar bas_s...@yahoo.co.in wrote: From: bhaskar chandrasekar bas_s...@yahoo.co.in Subject: Fetching Query Results from Solr To: solr-user@lucene.apache.org Date: Thursday, August 20, 2009, 2:48 AM Hi, I am using Solr to retrive records and display them thru carrot. Assuming that i give Google as search , it will display me all the records relating to that. It should give me exact word search only. It shld not look for combination of words in the search query and display them. How can i restrict Solr to achieve the same. Regards Bhaskar
Results from Solr
Hi, Can some one help me with the below situation? To elaborate more on this. Assuming i give BHASKAR as input string. Scenario 1: It should give me search results pertaining to BHASKAR only. Select * from MASTER where name =Bhaskar; Example:It should not display search results as ChandarBhaskar or BhaskarC. Should display Bhaskar only. Scenario 2: Select * from MASTER where name like %BHASKAR%; It should display records containing the word BHASKAR Ex: Bhaskar ChandarBhaskar BhaskarC Bhaskarabc How to achieve Scenario 1 in Solr ?. Thanks Bhaskar