[
https://issues.apache.org/jira/browse/SOLR-248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498711
]
Ryan McKinley commented on SOLR-248:
------------------------------------
It is a little strange, but (in my case anyway) i think it makes sense...
I am indexing a bunch of metadata from a bunch of libraries (OAI-PMH) -- I want
to display the data exactly as it came from the source, but for faceted
browsing I need to normalize capitalization.
Implemented at the indexing level, I can have different values for the stored
value and indexed terms. Also, at the indexing level I can leverage existing
Tokenizers and Filters to build the tokens that need capitalization -- it keeps
all the configuration in schema.xml and lets the OAI -> solr xml be a simple
transformation, this way whoever takes care of this need only learn solr
configuration, not ryan+solr configuration.
If it is not generally useful I can keep it elsewhere - that is why we have the
nice plugin framework!
> Capitalization Filter Factory
> -----------------------------
>
> Key: SOLR-248
> URL: https://issues.apache.org/jira/browse/SOLR-248
> Project: Solr
> Issue Type: New Feature
> Reporter: Ryan McKinley
> Priority: Minor
> Attachments: SOLR-248-CapitalizationFilter.patch
>
>
> For tokens that are used in faceting, it is nice to have standard
> capitalization.
> I want "Aerial views" and "Aerial Views" to both be: "Aerial Views"
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.