Hi
I use 'edismax'.
Our main language uses compound words.
There is an issue here.
For example, assume that 'ab' => 'a' and 'b' are analyzed.
The results are different when searching with 'ab' and 'a b'.
I want to get the same result as searching 'a b' when searching
- Dictionary/Compound-Words
File
Solr User Group -
I have a case where I need to be able to search against compound words, even
when the user delimits with a space. (e.g. baseball = base ball). I think
I've solved this by creating a compound-words dictionary file containing the
split words
Solr User Group -
I have a case where I need to be able to search against compound words, even
when the user delimits with a space. (e.g. baseball = base ball). I think
I've solved this by creating a compound-words dictionary file containing the
split words that I would want
.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4132478.html
Sent from the Solr - User mailing list archive at Nabble.com.
to read up on it as I'm quite new to Solr and I think my way is a bit
convoluted to be honest.
Thanks.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4132478.html
Sent from the Solr - User mailing list archive
--
View this message in context:
http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4132338.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hello all,
I'm a fairly new Solr user and I need my search function to handle compound
words in German. I've searched through the archives and found that Solr
already has a Filter Factory made for such words called
DictionaryCompoundWordTokenFilterFactory. I've already built a list of words
AM
To: solr-user@lucene.apache.org
Subject: Having trouble with German compound words in Solr 4.7
Hello all,
I'm a fairly new Solr user and I need my search function to handle compound
words in German. I've searched through the archives and found that Solr
already has a Filter Factory made
enter the query as keyword:schwarz AND keyword:kleid I only get 20283
results (which is the correct one).
--
View this message in context:
http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4131973.html
Sent from the Solr - User mailing list
20283
results (which is the correct one).
--
View this message in context:
http://lucene.472066.n3.nabble.com/Having-trouble-with-German-compound-words-in-Solr-4-7-tp4131964p4131973.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi Erick,
I tried with expand=true and got exactly the same tokens i.e., seabiscuit
sea bird at 1,2 and 3 positions respectively. As per solr documentation at
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory,
explicit mappings ignore the expand parameter in
Hi,
I'm an infant in Solr/Lucene family, just a couple of months old.
We are trying to find a way to combine words into a single compound word at
index and query time. E.g. if the document has sea bird in it, it should
be indexed as seabird and any query having sea bird in it should also look
One more thing, Is there a way to remove my accidentally sent phone number
in the signature from the previous mail? aarrrggghhh
Why did you reject using synonyms? You can have multi-word
synonyms just fine at index time, and at query time, since the
multiple words are already substituted in the index you don't
need to do the same substitution, just query the raw strings.
I freely acknowledge you may have very good reasons
Hi Erick,
Thanks for the suggestion. Like I said, I'm an infant.
We tried synonyms both ways. sea biscuit = seabiscuit and seabiscuit =
sea biscuit and didn't understand exactly how it worked. But I just checked
the analysis tool, and it seems to work perfectly fine at index time. Now,
I can
Consider setting expand=true at index time. That
puts all the tokens in your index, and then you
may not need to have any synonym
processing at query time since all the variants will
already be in the index.
As it is, you've replaced the words in the original with
synonyms, essentially collapsed
Hi Parvesh,
I think you should check the following jira
https://issues.apache.org/jira/browse/SOLR-5379. You will find there links
to other possible solutions/problems:-)
Roman
On 28 Oct 2013 09:06, Erick Erickson erickerick...@gmail.com wrote:
Consider setting expand=true at index time. That
Hi Roman, thanks for the link, will go through it.
Erick, will try with expand=true once and check out the results. Will
update this thread with the findings. I remember we rejected expand=true
because of some weird spaghetti problem. Will check it out again.
Thanks,
Parvesh Garg
can anybody point me in the right direction in how I go about adding
: the
:
: org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter
:
: Token filter to the solr schema.xml.
:
:
:
:
:
: I need to be able to break German compound words, and from what I have
: read
19 matches
Mail list logo