[jira] [Updated] (SOLR-3013) Add UIMA based tokenizers / filters that can be used in the schema.xml

Tommaso Teofili (Updated) (JIRA) Mon, 23 Jan 2012 03:41:07 -0800

     [ 
https://issues.apache.org/jira/browse/SOLR-3013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Tommaso Teofili updated SOLR-3013:
----------------------------------

    Attachment: SOLR-3013.patch

patch overview:
 - moved the 'ae' package out of 'processor' package since it's to be used by 
tokenizers too
 - created an 'analysis' package which contains 
tokenizers/analyzers/tokenizerfactories
 - updated the 'Introduction' section inside CHANGES.txt 
 

The UIMAAnnotationsTokenizer creates tokens using annotations created over the 
input Reader.
The UIMATypeAwareAnnotationsTokenizer creates tokens using annotations created 
over the input Reader adding also the TypeAttribute according to the specified 
UIMA FeaturePath.
                
> Add UIMA based tokenizers / filters that can be used in the schema.xml
> ----------------------------------------------------------------------
>
>                 Key: SOLR-3013
>                 URL: https://issues.apache.org/jira/browse/SOLR-3013
>             Project: Solr
>          Issue Type: Improvement
>          Components: update
>    Affects Versions: 3.5
>            Reporter: Tommaso Teofili
>            Priority: Minor
>              Labels: uima, update_request_handler
>             Fix For: 3.6, 4.0
>
>         Attachments: SOLR-3013.patch
>
>
> Add UIMA based tokenizers / filters that can be declared and used directly 
> inside the schema.xml.
> Thus instead of using the UIMA UpdateRequestProcessor one could directly 
> define per-field NLP capable tokenizers / filters.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-3013) Add UIMA based tokenizers / filters that can be used in the schema.xml

Reply via email to