[search > edismax] compound words different result issue

2019-02-11 Thread 유정인
Hi I use 'edismax'. Our main language uses compound words. There is an issue here. For example, assume that 'ab' => 'a' and 'b' are analyzed. The results are different when searching with 'ab' and 'a b'. I want to get the same result as searching 'a b' when searching

Re: DictionaryCompoundWordTokenFilterFactory - Dictionary/Compound-Words File

2015-04-07 Thread Mike L.
- Dictionary/Compound-Words File Solr User Group -    I have a case where I need to be able to search against compound words, even when the user delimits with a space. (e.g. baseball = base ball).  I think I've solved this by creating a compound-words dictionary file containing the split words

DictionaryCompoundWordTokenFilterFactory - Dictionary/Compound-Words File

2015-04-07 Thread Mike L.
Solr User Group -    I have a case where I need to be able to search against compound words, even when the user delimits with a space. (e.g. baseball = base ball).  I think I've solved this by creating a compound-words dictionary file containing the split words that I would want

Re: Having trouble with German compound words in Solr 4.7

2014-04-24 Thread Siegfried Goeschl
. -- View this message in context: http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4132478.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Having trouble with German compound words in Solr 4.7

2014-04-22 Thread Alistair
to read up on it as I'm quite new to Solr and I think my way is a bit convoluted to be honest. Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4132478.html Sent from the Solr - User mailing list archive

Re: Having trouble with German compound words in Solr 4.7

2014-04-21 Thread Alistair
-- View this message in context: http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4132338.html Sent from the Solr - User mailing list archive at Nabble.com.

Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Alistair
Hello all, I'm a fairly new Solr user and I need my search function to handle compound words in German. I've searched through the archives and found that Solr already has a Filter Factory made for such words called DictionaryCompoundWordTokenFilterFactory. I've already built a list of words

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Jack Krupansky
AM To: solr-user@lucene.apache.org Subject: Having trouble with German compound words in Solr 4.7 Hello all, I'm a fairly new Solr user and I need my search function to handle compound words in German. I've searched through the archives and found that Solr already has a Filter Factory made

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Alistair
enter the query as keyword:schwarz AND keyword:kleid I only get 20283 results (which is the correct one). -- View this message in context: http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4131973.html Sent from the Solr - User mailing list

Re: Having trouble with German compound words in Solr 4.7

2014-04-18 Thread Siegfried Goeschl
20283 results (which is the correct one). -- View this message in context: http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4131973.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Compound words

2013-10-29 Thread Parvesh Garg
Hi Erick, I tried with expand=true and got exactly the same tokens i.e., seabiscuit sea bird at 1,2 and 3 positions respectively. As per solr documentation at http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory, explicit mappings ignore the expand parameter in

Compound words

2013-10-28 Thread Parvesh Garg
Hi, I'm an infant in Solr/Lucene family, just a couple of months old. We are trying to find a way to combine words into a single compound word at index and query time. E.g. if the document has sea bird in it, it should be indexed as seabird and any query having sea bird in it should also look

Re: Compound words

2013-10-28 Thread Parvesh Garg
One more thing, Is there a way to remove my accidentally sent phone number in the signature from the previous mail? aarrrggghhh

Re: Compound words

2013-10-28 Thread Erick Erickson
Why did you reject using synonyms? You can have multi-word synonyms just fine at index time, and at query time, since the multiple words are already substituted in the index you don't need to do the same substitution, just query the raw strings. I freely acknowledge you may have very good reasons

Re: Compound words

2013-10-28 Thread Parvesh Garg
Hi Erick, Thanks for the suggestion. Like I said, I'm an infant. We tried synonyms both ways. sea biscuit = seabiscuit and seabiscuit = sea biscuit and didn't understand exactly how it worked. But I just checked the analysis tool, and it seems to work perfectly fine at index time. Now, I can

Re: Compound words

2013-10-28 Thread Erick Erickson
Consider setting expand=true at index time. That puts all the tokens in your index, and then you may not need to have any synonym processing at query time since all the variants will already be in the index. As it is, you've replaced the words in the original with synonyms, essentially collapsed

Re: Compound words

2013-10-28 Thread Roman Chyla
Hi Parvesh, I think you should check the following jira https://issues.apache.org/jira/browse/SOLR-5379. You will find there links to other possible solutions/problems:-) Roman On 28 Oct 2013 09:06, Erick Erickson erickerick...@gmail.com wrote: Consider setting expand=true at index time. That

Re: Compound words

2013-10-28 Thread Parvesh Garg
Hi Roman, thanks for the link, will go through it. Erick, will try with expand=true once and check out the results. Will update this thread with the findings. I remember we rejected expand=true because of some weird spaghetti problem. Will check it out again. Thanks, Parvesh Garg

Re: Adding the Lucene org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter to solr for german compound words

2008-07-23 Thread Chris Hostetter
can anybody point me in the right direction in how I go about adding : the : : org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter : : Token filter to the solr schema.xml. : : : : : : I need to be able to break German compound words, and from what I have : read