[ https://issues.apache.org/jira/browse/SOLR-477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12567955#action_12567955 ]
Ryan McKinley commented on SOLR-477: ------------------------------------ > I've been meaning to create a handler like this... > I'm not sure that Token should be directly added to XMLWriter and friends > though... seems more flexible for the handler to handle the transformation > from token(s) to more primitive types. Or did you actually add a <token> tag > to the xml? > I've been using something similar -- but it is implemented as a SearchComponent -- it adds another bit to the response like highlighting does. If possible, i think we should avoid custom XMLWriter implementations... the NamedList suff is a bit akward, but having stuff work in JSON/ruby/etc is a big deal > AnalysisRequestHandler > ---------------------- > > Key: SOLR-477 > URL: https://issues.apache.org/jira/browse/SOLR-477 > Project: Solr > Issue Type: New Feature > Reporter: Grant Ingersoll > Assignee: Grant Ingersoll > Priority: Minor > Attachments: SOLR-477.patch > > > Being able to programmatically access tokenization information can be quite > useful not only in Solr, but in other NLP applications where token vectors > are necessary. > The patch to follow creates an AnalysisRequestHandler which processes a > document through the analysis process and returns a response filled with > tokens, their offsets, position inc., type and value. > Patch also adds some character array processing to Xml and adds Token > handling to XMLWriter. > I only implemented Xml output, as I don't know JSON or the other types. If > someone else is so motivated, they can add those. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.