[jira] [Commented] (SOLR-13593) Allow to specify analyzer components by their SPI names in schema definition

Tomoko Uchida (JIRA) Sun, 04 Aug 2019 03:18:20 -0700


    [ 
https://issues.apache.org/jira/browse/SOLR-13593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16899602#comment-16899602
 ]


Tomoko Uchida commented on SOLR-13593:
--------------------------------------

I have updated the pull request.

1. If both of "name" and "class" are specified, this redundancy does not cause 
any error but warnings are emitted when loading the schema. In this case "name" 
is given priority over "class". (In a future release "class" could be 
deprecated so this behaviour makes sense to me, any comments?)
 2. Added unit tests: for loading field types from schema.xml and creating 
those via REST API.

LUCENE-8778 was backported with proper backwards compatibility (LUCENE-8911), 
so I think we can expose this feature from 8.x minor releases. After the pull 
request gets reviewed I'd like to commit the changes to the master and 8x 
branch, then migrate default schema file(s) and the examples in Ref Guide.

> Allow to specify analyzer components by their SPI names in schema definition
> ----------------------------------------------------------------------------
>
>                 Key: SOLR-13593
>                 URL: https://issues.apache.org/jira/browse/SOLR-13593
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>            Reporter: Tomoko Uchida
>            Priority: Major
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> Now each analysis factory has explicitely documented SPI name which is stored 
> in the static "NAME" field (LUCENE-8778).
>  Solr uses factories' simple class name in schema definition (like 
> class="solr.WhitespaceTokenizerFactory"), but we should be able to also use 
> more concise SPI names (like name="whitespace").
> e.g.:
> {code:xml}
> <fieldtype name="myfieldtype" class="solr.TextField">
>   <analyzer>
>     <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>     <filter class="solr.KeywordMarkerFilterFactory" protected="protwords.txt" 
> />
>     <filter class="solr.PorterStemFilterFactory" />
>   </analyzer>
> </fieldtype>
> {code}
> would be
> {code:xml}
> <fieldtype name="myfieldtype" class="solr.TextField">
>   <analyzer>
>     <tokenizer name="whitespace"/>
>     <filter name="keywordMarker" protected="protwords.txt" />
>     <filter name="porterStem" />
>   </analyzer>
> </fieldtype>
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (SOLR-13593) Allow to specify analyzer components by their SPI names in schema definition

Reply via email to