[ https://issues.apache.org/jira/browse/SOLR-3013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tommaso Teofili updated SOLR-3013: ---------------------------------- Attachment: SOLR-3013.patch patch overview: - moved the 'ae' package out of 'processor' package since it's to be used by tokenizers too - created an 'analysis' package which contains tokenizers/analyzers/tokenizerfactories - updated the 'Introduction' section inside CHANGES.txt The UIMAAnnotationsTokenizer creates tokens using annotations created over the input Reader. The UIMATypeAwareAnnotationsTokenizer creates tokens using annotations created over the input Reader adding also the TypeAttribute according to the specified UIMA FeaturePath. > Add UIMA based tokenizers / filters that can be used in the schema.xml > ---------------------------------------------------------------------- > > Key: SOLR-3013 > URL: https://issues.apache.org/jira/browse/SOLR-3013 > Project: Solr > Issue Type: Improvement > Components: update > Affects Versions: 3.5 > Reporter: Tommaso Teofili > Priority: Minor > Labels: uima, update_request_handler > Fix For: 3.6, 4.0 > > Attachments: SOLR-3013.patch > > > Add UIMA based tokenizers / filters that can be declared and used directly > inside the schema.xml. > Thus instead of using the UIMA UpdateRequestProcessor one could directly > define per-field NLP capable tokenizers / filters. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org