Osma Suominen created JENA-1062:
-----------------------------------

             Summary: add ConfigurableAnalyzer to jena-text
                 Key: JENA-1062
                 URL: https://issues.apache.org/jira/browse/JENA-1062
             Project: Apache Jena
          Issue Type: New Feature
          Components: Text
            Reporter: Osma Suominen
            Assignee: Osma Suominen


This is an alternative to JENA-1058 (which implemented a very specific Lucene 
Analyzer for jena-text). The idea here, based on a comment by Claude Warren on 
JENA-1058, is to provide a ConfigurableAnalyzer that can be configured with a 
Tokenizer and (optionally) one or more TokenFilters, like this:

text:analyzer [
  a text:ConfigurableAnalyzer ;
  text:tokenizer text:KeywordTokenizer ;
  text:filters (text:ASCIIFoldingFilter, text:LowerCaseFilter)
]

I have some code ready to implement this and will open a PR shortly.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to