Osma Suominen created JENA-1058:
-----------------------------------
Summary: add ASCIIFoldingLowerCaseKeywordAnalyzer to jena-text
Key: JENA-1058
URL: https://issues.apache.org/jira/browse/JENA-1058
Project: Apache Jena
Issue Type: New Feature
Components: Text
Reporter: Osma Suominen
Assignee: Osma Suominen
I'd like to have an Analyzer for jena-text which is otherwise like
LowerCaseKeywordAnalyzer that I've implemented before, but also includes the
ASCIIFoldingFilter from Lucene. This means that the comparison will ignore
accents, so that for example "deja vu" will match "déjà vu".
For some background on why I need this, see
https://github.com/NatLibFi/Skosmos/issues/313
I already have an implementation of this ready, will make a PR shortly.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)