I am looking for a quick solution to expand queries so they will look for synonms as well. The same way WordNet is doing - it will looks for other words that mean the same as written in the query.
So Synonyms and WordNet are better categories to describe what I need. Any idea? Currently what I do, is expending the query in my own code - for every word in the query I also add the 10 most close words to that one and parse it to the query parser. It takes a bit time and I wondered whether I am missing something and it can be done in a better way. Thanks again, Liat 2009/3/18 Grant Ingersoll <gsing...@apache.org> > > On Mar 17, 2009, at 5:44 AM, liat oren wrote: > > Thanks for all the answers. >> >> I am new to Lucene and in the emails its the first time I heard of the >> bigrams and thus read about them a bit. >> >> Question - if I query for "cat animal" - or use boosting - "cat^2 >> animal^0.5" - will the results return ONLY documents that contain both? >> From what I saw until now - it can also show documents that contain one of >> them, no? >> > > I think if you are using bigrams, then you would only match on one, but if > you do the prefix/wildard approach you could match on either. I'm not sure > if you will be able to pull off doing the individual term boosting and the > bigrams. You will likely need to write your own Query classes to do that. > > If you don't mind me asking, what is the problem you are trying to solve? > I know the solution you want (I think, namely boosted bigrams of some > sort), but I'm still clueless on the problem and I think that is really > hindering me helping. It sounds like it is some type of co-occurrence > problem, but I'm not sure. Is there a bigger category that what you are > doing fits in? If you can't say, that is fine, too. It may be some > proprietary thing. > > >> >> Can you please elaborate a bit more on your suggestion? >> >> I read a bit on the synonyms and the wordNet package. >> Isn't there a way to use an index that is structured in the same way the >> index of the wordNet (any idea how is this index built?), but stores other >> values? >> > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > >