[
https://issues.apache.org/jira/browse/SOLR-3574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Despot Jakimovski updated SOLR-3574:
------------------------------------
Attachment: SOLR-3574.patch
> Create a Compound Word Filter (and Factory) extension that will allow support
> for (word) exceptions
> ---------------------------------------------------------------------------------------------------
>
> Key: SOLR-3574
> URL: https://issues.apache.org/jira/browse/SOLR-3574
> Project: Solr
> Issue Type: New Feature
> Components: SearchComponents - other
> Affects Versions: 5.0
> Reporter: Despot Jakimovski
> Assignee: Despot Jakimovski
> Labels: compound-word, dictionary, feature, filter,
> word-exception
> Fix For: 5.0
>
> Attachments: SOLR-3574.patch
>
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> When having the following use case:
> We have 2 words "penslot" and "knoppen". One of them presents a compound word
> ("penslot"), the other one is a plural form of knop.
> When using the compound word filter, if we place the words "pen" "slot" and
> "knop" in the dictionary, for a search containing "knoppen", we get results
> containing "pen" also, which shouldn't be the case, because "knoppen" is only
> a plural form (not a compound word).
> We need another dictionary to specify the words that are exceptions to the
> filter (like in this case "knoppen"). In this case, the filter would find
> compound words containing "pen" "slot" and "knop", but will leave out
> dividing "knoppen" and searching on its parts.
> More info on the subject:
> http://stackoverflow.com/questions/11159839/can-we-make-the-compound-word-filter-not-divide-some-words-in-solr
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]