[jira] [Updated] (LUCENE-6177) Add CustomAnalyzer - a builder that creates Analyzers from the factory classes

Uwe Schindler (JIRA) Mon, 12 Jan 2015 07:51:02 -0800

     [ 
https://issues.apache.org/jira/browse/LUCENE-6177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Uwe Schindler updated LUCENE-6177:
----------------------------------
    Description: 
I prepared some "generic Analyzer class {{CustomAnalyzer}}, that makes it easy 
to build analyzers like in Solr or Elasticsearch. Under the hood it uses the 
factory classes. The class is made like a builder:

{code:java}
Analyzer ana = CustomAnalyzer.builder(Path.get("/path/to/config/dir"))
  .withTokenizer("standard")
  .addTokenFilter("standard")
  .addTokenFilter("lowercase")
  .addTokenFilter("stop", "ignoreCase", "false", "words", "stopwords.txt", 
"format", "wordset")
  .build();
{code}

It is possible to give the resource loader (used by stopwords and similar). By 
default it tries to load stuff from context classloader (without any class as 
reference so paths must be absolute - this is the behaviour 
ClasspathResourseLoader defaults to).

In addition you can give a Lucene MatchVersion, by default it would use 
Version.LATEST (once LUCENE-5900 is completely fixed).

  was:
I prepared some "generic Analyzer class {{CustomAnalyzer}}, that makes it easy 
to build analyzers like in Solr or Elasticsearch. Under the hood it uses the 
factory classes. The class is made like a builder:

{code:java}
Analyzer ana = CustomAnalyzer.builder(Path.get("/path/to/config/dir"))
  .withTokenizer("standard", key1, value1, key2, value2)
  .addTokenFilter("standard")
  .addTokenFilter("lowercase").
  .addTokenFilter("stop", "ignoreCase", "true", "words", "stopwords.txt", 
"format", "wordset")
  .build();
{code}

It is possible to give the resource loader (used by stopwords and similar). By 
default it tries to load stuff from context classloader (without any class as 
reference so paths must be absolute - this is the behaviour 
ClasspathResourseLoader defaults to).

In addition you can give a Lucene MatchVersion, by default it would use 
Version.LATEST (once LUCENE-5900 is completely fixed).


> Add CustomAnalyzer - a builder that creates Analyzers from the factory classes
> ------------------------------------------------------------------------------
>
>                 Key: LUCENE-6177
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6177
>             Project: Lucene - Core
>          Issue Type: New Feature
>          Components: modules/analysis
>            Reporter: Uwe Schindler
>            Assignee: Uwe Schindler
>             Fix For: 5.0, Trunk
>
>
> I prepared some "generic Analyzer class {{CustomAnalyzer}}, that makes it 
> easy to build analyzers like in Solr or Elasticsearch. Under the hood it uses 
> the factory classes. The class is made like a builder:
> {code:java}
> Analyzer ana = CustomAnalyzer.builder(Path.get("/path/to/config/dir"))
>   .withTokenizer("standard")
>   .addTokenFilter("standard")
>   .addTokenFilter("lowercase")
>   .addTokenFilter("stop", "ignoreCase", "false", "words", "stopwords.txt", 
> "format", "wordset")
>   .build();
> {code}
> It is possible to give the resource loader (used by stopwords and similar). 
> By default it tries to load stuff from context classloader (without any class 
> as reference so paths must be absolute - this is the behaviour 
> ClasspathResourseLoader defaults to).
> In addition you can give a Lucene MatchVersion, by default it would use 
> Version.LATEST (once LUCENE-5900 is completely fixed).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Updated] (LUCENE-6177) Add CustomAnalyzer - a builder that creates Analyzers from the factory classes

Reply via email to