I've posted a few questions on synonyms before and finally understood how it
worked and settled with index-time synonyms. Seems to work much better than
query time synonyms. But now @ my work, they have a special request. They
want certain synonyms to be applied only to certain sections of the
That's right. Simple. I can very well do that. Why didn't I think of it.
Thanks.
rswart wrote:
What is stopping you from defining different field types for faqs and
attorneys? One with index time synomyms and one without.
anuvenk wrote:
I've posted a few questions on synonyms
could bring out some wrong match as the one
noticed by Anuvenk
for instance
dui = drunk driving defense
or
dui,drunk driving defense,drunk driving law
query for dui matches dui = drunk driving defense and dui,drunk
driving defense,drunk driving law
in order to prevent this kind
the number of
synonyms unless you are turning a single term into dozens of ORed terms.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
From: anuvenk anuvenkat...@hotmail.com
To: solr-user@lucene.apache.org
Sent: Tuesday, June 2, 2009 11:28
A small addition to my earlier post. I wonder if its because of the 'mm'
param, which requires that until 3 words in search phrase, all the words
should be matched. If i alter this now, i'd get ir-relevant results for a
lot of popular 1, 2, 3 word search terms. How to solve for this?
anuvenk
?
Shalin Shekhar Mangar wrote:
On Tue, Jun 2, 2009 at 12:53 AM, anuvenk anuvenkat...@hotmail.com wrote:
title state
dui faq1 california
dui faq2 florida
dui faq3 federal
Now I want to be able to return federal results
In my index i have legal faqs, forms, legal videos etc with a state field for
each resource.
Now if i search for real estate san diego, I want to be able to return other
'california' results i.e results from san francisco.
I have the following fields in the index
title
.
anuvenk wrote:
In my index i have legal faqs, forms, legal videos etc with a state field
for each resource.
Now if i search for real estate san diego, I want to be able to return
other 'california' results i.e results from san francisco.
I have the following fields in the index
title
Hello,
I'm using the dismax handler for the phrase matching. I have a few legal
resources in my index in the following format for example
title state
dui faq1 california
dui faq2 florida
dui faq3 federal
Somebody please help clear this doubt. What more could i do with the dismax
handler to remove results that don't have 'word1'', 'word2', 'word3' etc in
a search phrase not within 5 words of one another, to not come up in the
results?
anuvenk wrote:
From the solr wiki, it sounded like if qs
somewhere. I thought if there is a way to just not return any results if
they are not within close proximity, it would be great.
Yonik Seeley wrote:
On Sun, Nov 23, 2008 at 11:51 PM, anuvenk [EMAIL PROTECTED]
wrote:
Please help someone...i've been waiting for an answer for the last couple
From the solr wiki, it sounded like if qs is set to 5 for example, if the
search term is 'child custody', only docs with 'child' 'custody' within 5
words of one another would be returned in results. Is this correct? If so,
it doesn't seem to be working for me. I see docs with 'child' 'custody'
all
states are returned although user specifically asked for florida will. I
don't want to alter the 'mm' either because its working fine for other
searches. Just for the search terms with a 'location' , i want to be able to
match all words. Any easy way to do this? Someone please?
anuvenk wrote
I use the 'dismax handler' for my phrase matching. And i have the 'mm' set
this way:
Up to 3 words, match all
up to 4, match 3
up to 4, match 3 so on
Its been working fine, but for certain phrases like 'san diego drunk driving
defense attorney', its brings up dui attorneys for other cities first
Question about sorting with solr. I want to group results in certain sort
order so i can split them display in tabs easily.
I want to be able to have a custom sort order instead of sort=cat asc score
desc
In the above mentioned way, categories are grouped in ascending order. But i
want certain
. I
appreciate your help !!
scott.tabar wrote:
Anuvenk,
Sorry for this Third email, but I was reading your question below and I
think it warrants yet another reply.
Just some background from my focus and involvement, and hence the
generation of the JavaDocs. I was primarily interested
Thanks. But i'm looking at this
http://.../spellchecker?indent=ononlyMorePopular=trueaccuracy=.6suggestionCount=20q=facial+salophosphoprotein
on
http://lucene.apache.org/solr/api/org/apache/solr/handler/SpellCheckerRequestHandler.html
It seems to return results (well in the example)
with and
I followed your instructions exactly. But still have trouble with multiword
queries
for eg: q=grapics returns 'graphics'
but q=grapics card returns nothing.
I even tried with the latest nightly build but didn't solve the problem. Any
solution available.
scott.tabar wrote:
Matthew,
Thanks
Here is what it means by injecting at query time:
This is the text field definition i have in my schema
fieldType name=text class=solr.TextField positionIncrementGap=100
analyzer type=index
tokenizer class=solr.WhitespaceTokenizerFactory/
filter
I have a hard time understanding the synonyms behaviour..especially because i
don't have the syn filter at index time.
If i have this synonym at index time
Alternative Sentence,Probation before Judgement,Pretrial Diversion
does all occurrence of 'alternative sentence' also get indexed as
I need to understand this synonym behaviour
I have this synonym
divorce mediation,alternative dispute resolution
so when i do a debug this is the parsedquery_tostring i see:
(((text:divorc^0.8 | name:divorc^2.0)~0.01 (text:mediat^0.8 |
name:mediat^2.0)~0.01)~2) (text:(divorc altern) (disput
- Nutch
- Original Message
From: anuvenk [EMAIL PROTECTED]
To: solr-user@lucene.apache.org
Sent: Monday, January 21, 2008 12:35:52 AM
Subject: Re: spellcheckhandler
I followed the steps outlined in
http://wiki.apache.org/solr/SpellCheckerRequestHandler
with regards to setting
Gospodnetic wrote:
You don't need to wait for 1.3 to be released - you can simply use a
recent nightly build.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
From: anuvenk [EMAIL PROTECTED]
To: solr-user@lucene.apache.org
Sent: Monday, January
- Nutch
- Original Message
From: anuvenk [EMAIL PROTECTED]
To: solr-user@lucene.apache.org
Sent: Monday, January 21, 2008 12:35:52 AM
Subject: Re: spellcheckhandler
I followed the steps outlined in
http://wiki.apache.org/solr/SpellCheckerRequestHandler
with regards to setting
!!
Otis Gospodnetic wrote:
You don't need to wait for 1.3 to be released - you can simply use a
recent nightly build.
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
From: anuvenk [EMAIL PROTECTED]
To: solr-user@lucene.apache.org
Sent
when will this be released? where can i find the list of
improvements/enhancements in 1.3 if its been documented already?
--
View this message in context:
http://www.nabble.com/solr-1.3-tp14989395p14989395.html
Sent from the Solr - User mailing list archive at Nabble.com.
the trunk and I can tell you
it's quite stable. The only problem is the documentation sometimes doesn't
cover lastest changes in the code. You'll probably have to dig into the
code
itself or post a question here and many people will be happy to help you.
On Jan 21, 2008 12:07 PM, anuvenk [EMAIL
Could you please let me know the location from where i can get it.
climbingrose wrote:
I'm using code pulled directly from Subversion.
On Jan 21, 2008 12:34 PM, anuvenk [EMAIL PROTECTED] wrote:
Thanks. Would this be the latest code from the trunk that you mentioned?
http
what are term vectors? How do they help with mlt?
--
View this message in context:
http://www.nabble.com/Term-vector-tp14990408p14990408.html
Sent from the Solr - User mailing list archive at Nabble.com.
http://wiki.apache.org/solr/UpdateXmlMessages
Is this what you are looking for. Index the document again and it should
overwrite the older one with the same id.
Gavin-39 wrote:
Hi,
Can some one point me to a location where it describes how to update an
already indexed document? I was
Is it possible to add a spell check component so i don't have to issue a
separate request to solr to do the spell checking? Sorry if this question is
naive..am just learning to use solr.
searchComponent name=spellcheck
class=org.apache.solr.handler.component.spellcheckComponent /
and add it
' and 'name' to
build my spell index?
How to use it effectively for spell checking on multi-word queries?
anuvenk wrote:
Is it possible to implement something like this with the spellcheckhandler
Like how google does,..
say i search for 'chater 13 bakrupcy',
should be able to display
How does adding a phrase slop in the handler help?
I tried ps=25 along with some pf values. I assumed that it means this..for
eg: a search term, 'child custody battle' means documents which have the
words 'child','custody','battle' within 25 words of one another will rank
high. Is that correct?
I understand tf means term frequency. For eg: if the search term is 'chapter
7', does tf mean how frequently 'chapter 7' occurs in the docs? Does it take
in to account the total number of words in a doc to determine frequency.
Also what is idf, fieldNorm and queryNorm. Trying to understand how
Is the parsedquery_ToString, the one passed to solr after all the tokenizing
and analyzing of the query?
For the search term 'chapter 7' i have this parsedquery_ToString
str name=parsedquery_toString
+(text:(bankruptci chap 7) (7 chapter chap) 7 bankruptci^0.8 |
((name:bankruptci
Is it possible to implement something like this with the spellcheckhandler
Like how google does,..
say i search for 'chater 13 bakrupcy',
should be able to display these..
did you search for 'chapter 13 bankruptcy'
Has someone been able to do this?
--
View this message in context:
I've been using the solr admin form with debug=true to do some in-depth
analysis on some results. Could someone explain how to make sense of
this..This is the debugging info for the first result i got.
10.201284 = (MATCH) sum of:
6.2467875 = (MATCH) max plus 0.01 times others of:
6.236769
I have the word delimiter filter factory in the text field definition both at
index and query time.
But it does have some negative effects on some search terms like h1-b visa
It splits this in to three tokens h,1,b. Now if i understand right, does
solr look for matches for 'h' separately, '1'
38 matches
Mail list logo