[
https://issues.apache.org/jira/browse/SOLR-248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ryan McKinley updated SOLR-248:
-------------------------------
Attachment: SOLR-248-CapitalizationFilter.patch
Implementation and test...
<filter class="solr.CapitalizationFilterFactory" onlyFirstWord="false"
keep="and or the is my or de" maxTokenLength="40" maxWordCount="4"
okPrefix="McK" forceFirstLetter="true" />
onlyFirstWord="false" -- this capatalizes every word
keep="and or the is my or de" -- don't change capitalization for these words
forceFirstLetter="true" -- capitalize the first letter of the Token (not word)
even if it is in the "keep" list
maxTokenLength="40" -- if the token is longer then 40 chars, don't even try to
capitalize it
maxWordCount="4" -- if there are more then 4 words, don't try capitalizing
> Capitalization Filter Factory
> -----------------------------
>
> Key: SOLR-248
> URL: https://issues.apache.org/jira/browse/SOLR-248
> Project: Solr
> Issue Type: New Feature
> Reporter: Ryan McKinley
> Priority: Minor
> Attachments: SOLR-248-CapitalizationFilter.patch
>
>
> For tokens that are used in faceting, it is nice to have standard
> capitalization.
> I want "Aerial views" and "Aerial Views" to both be: "Aerial Views"
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.