[
https://issues.apache.org/jira/browse/SOLR-477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Alexandre Rafalovitch closed SOLR-477.
--------------------------------------
Resolved long time ago, but was not "closed".
> AnalysisRequestHandler
> ----------------------
>
> Key: SOLR-477
> URL: https://issues.apache.org/jira/browse/SOLR-477
> Project: Solr
> Issue Type: New Feature
> Reporter: Grant Ingersoll
> Assignee: Grant Ingersoll
> Priority: Minor
> Attachments: SOLR-477.patch, SOLR-477.patch
>
>
> Being able to programmatically access tokenization information can be quite
> useful not only in Solr, but in other NLP applications where token vectors
> are necessary.
> The patch to follow creates an AnalysisRequestHandler which processes a
> document through the analysis process and returns a response filled with
> tokens, their offsets, position inc., type and value.
> Patch also adds some character array processing to Xml and adds Token
> handling to XMLWriter.
> I only implemented Xml output, as I don't know JSON or the other types. If
> someone else is so motivated, they can add those.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]