[ 
https://issues.apache.org/jira/browse/SOLR-3574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Despot Jakimovski updated SOLR-3574:
------------------------------------

    Description: 
When having the following use case:
We have 2 words "penslot" and "knoppen". One of them presents a compound word 
("penslot"), the other one is a plural form of knop.

When using the compound word filter, if we place the words "pen" "slot" and 
"knop" in the dictionary, for a search containing "knoppen", we get results 
containing "pen" also, which shouldn't be the case, because "knoppen" is only a 
plural form (not a compound word). 

We need another dictionary to specify the words that are exceptions to the 
filter (like in this case "knoppen"). In this case, the filter would find 
compound words containing "pen" "slot" and "knop", but will leave out dividing 
"knoppen" and searching on its parts.

More info on the subject: 
http://stackoverflow.com/questions/11159839/can-we-make-the-compound-word-filter-not-divide-some-words-in-solr

  was:
When having the following use case:
We have 2 words "penslot" and "knoppen". One of them presents a compound word 
("penslot"), the other one is a plural form of knop.

When using the compound word filter, if we place the words "pen" "slot" and 
"knop" in the dictionary, for a search containing "knoppen", we get results 
containing "pen" also, which shouldn't be the case, because "knoppen" is only a 
plural form (not a compound word). 

We need another dictionary to specify the words that are exceptions to the 
filter (like in this case "knoppen"). In this case, the filter would find 
compound words containing "pen" "slot" and "knop", but will leave out dividing 
"knoppen" and searching on its parts.



    
> Create a Compound Word Filter (and Factory) extension that will allow support 
> for (word) exceptions
> ---------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-3574
>                 URL: https://issues.apache.org/jira/browse/SOLR-3574
>             Project: Solr
>          Issue Type: New Feature
>          Components: SearchComponents - other
>    Affects Versions: 3.6
>            Reporter: Despot Jakimovski
>              Labels: compound-word, dictionary, feature, filter, 
> word-exception
>             Fix For: 3.6
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> When having the following use case:
> We have 2 words "penslot" and "knoppen". One of them presents a compound word 
> ("penslot"), the other one is a plural form of knop.
> When using the compound word filter, if we place the words "pen" "slot" and 
> "knop" in the dictionary, for a search containing "knoppen", we get results 
> containing "pen" also, which shouldn't be the case, because "knoppen" is only 
> a plural form (not a compound word). 
> We need another dictionary to specify the words that are exceptions to the 
> filter (like in this case "knoppen"). In this case, the filter would find 
> compound words containing "pen" "slot" and "knop", but will leave out 
> dividing "knoppen" and searching on its parts.
> More info on the subject: 
> http://stackoverflow.com/questions/11159839/can-we-make-the-compound-word-filter-not-divide-some-words-in-solr

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to