[search > edismax] compound words different result issue

2019-02-11 Thread 유정인
Hi I use 'edismax'. Our main language uses compound words. There is an issue here. For example, assume that 'ab' => 'a' and 'b' are analyzed. The results are different when searching with 'ab' and 'a b'. I want to get the

Re: DictionaryCompoundWordTokenFilterFactory - Dictionary/Compound-Words File

2015-04-07 Thread Mike L.
Typo:   *even when the user delimits with a space. (e.g. base ball should find baseball). Thanks, From: Mike L. To: "solr-user@lucene.apache.org" Sent: Tuesday, April 7, 2015 9:05 AM Subject: DictionaryCompoundWordTokenFilterFactory - Dictionary/Compound-Words File

DictionaryCompoundWordTokenFilterFactory - Dictionary/Compound-Words File

2015-04-07 Thread Mike L.
Solr User Group -    I have a case where I need to be able to search against compound words, even when the user delimits with a space. (e.g. baseball => base ball).  I think I've solved this by creating a compound-words dictionary file containing the split words that I wo

Re: Having trouble with German compound words in Solr 4.7

2014-04-24 Thread Siegfried Goeschl
nz'. If anybody has a better solution to what I've posted I would be more than happy to read up on it as I'm quite new to Solr and I think my way is a bit convoluted to be honest. Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4132478.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Having trouble with German compound words in Solr 4.7

2014-04-22 Thread Alistair
posted I would be more than happy to read up on it as I'm quite new to Solr and I think my way is a bit convoluted to be honest. Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4132478.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Having trouble with German compound words in Solr 4.7

2014-04-21 Thread Alistair
Ali -- View this message in context: http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4132338.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Siegfried Goeschl
ord:schwarz AND keyword:kleid I only get 20283 > results (which is the correct one). > > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4131973.html > Sent from the Solr - User mailing list archive at Nabble.com.

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Alistair
ts, whereas if I manually enter the query as keyword:schwarz AND keyword:kleid I only get 20283 results (which is the correct one). -- View this message in context: http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4131973.html Sent from the

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Jack Krupansky
18, 2014 6:11 AM To: solr-user@lucene.apache.org Subject: Having trouble with German compound words in Solr 4.7 Hello all, I'm a fairly new Solr user and I need my search function to handle compound words in German. I've searched through the archives and found that Solr already has a Fil

Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Alistair
Hello all, I'm a fairly new Solr user and I need my search function to handle compound words in German. I've searched through the archives and found that Solr already has a Filter Factory made for such words called DictionaryCompoundWordTokenFilterFactory. I've already built a lis

Re: Compound words

2013-10-29 Thread Parvesh Garg
Hi Erick, I tried with expand=true and got exactly the same tokens i.e., seabiscuit sea bird at 1,2 and 3 positions respectively. As per solr documentation at http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory, explicit mappings ignore the expand parameter in the

Re: Compound words

2013-10-28 Thread Parvesh Garg
Hi Roman, thanks for the link, will go through it. Erick, will try with expand=true once and check out the results. Will update this thread with the findings. I remember we rejected expand=true because of some weird spaghetti problem. Will check it out again. Thanks, Parvesh Garg http://www.zett

Re: Compound words

2013-10-28 Thread Roman Chyla
Hi Parvesh, I think you should check the following jira https://issues.apache.org/jira/browse/SOLR-5379. You will find there links to other possible solutions/problems:-) Roman On 28 Oct 2013 09:06, "Erick Erickson" wrote: > Consider setting expand=true at index time. That > puts all the tokens i

Re: Compound words

2013-10-28 Thread Erick Erickson
Consider setting expand=true at index time. That puts all the tokens in your index, and then you may not need to have any synonym processing at query time since all the variants will already be in the index. As it is, you've replaced the words in the original with synonyms, essentially collapsed t

Re: Compound words

2013-10-28 Thread Parvesh Garg
Hi Erick, Thanks for the suggestion. Like I said, I'm an infant. We tried synonyms both ways. sea biscuit => seabiscuit and seabiscuit => sea biscuit and didn't understand exactly how it worked. But I just checked the analysis tool, and it seems to work perfectly fine at index time. Now, I can ha

Re: Compound words

2013-10-28 Thread Erick Erickson
Why did you reject using synonyms? You can have multi-word synonyms just fine at index time, and at query time, since the multiple words are already substituted in the index you don't need to do the same substitution, just query the raw strings. I freely acknowledge you may have very good reasons

Re: Compound words

2013-10-28 Thread Parvesh Garg
One more thing, Is there a way to remove my "accidentally sent phone number in the signature" from the previous mail? aarrrggghhh

Compound words

2013-10-28 Thread Parvesh Garg
Hi, I'm an infant in Solr/Lucene family, just a couple of months old. We are trying to find a way to combine words into a single compound word at index and query time. E.g. if the document has "sea bird" in it, it should be indexed as seabird and any query having sea bird in it should also look f

Re: Adding the Lucene org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter to solr for german compound words

2008-07-23 Thread Chris Hostetter
rote: : : > Hi can anybody point me in the right direction in how I go about adding : > the : > : > org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter : > : > Token filter to the solr schema.xml. : > : > : > : > : > : > I need to be a

Re: Adding the Lucene org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter to solr for german compound words

2008-07-23 Thread Grant Ingersoll
how I go about adding the org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter Token filter to the solr schema.xml. I need to be able to break German compound words, and from what I have read this Token filter would seem to be what I need to use, my question is how do I

Adding the Lucene org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter to solr for german compound words

2008-07-23 Thread Barry Harding
Hi can anybody point me in the right direction in how I go about adding the org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter Token filter to the solr schema.xml. I need to be able to break German compound words, and from what I have read this Token filter would seem