[
https://issues.apache.org/jira/browse/SOLR-2400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13002533#comment-13002533
]
Stefan Matheis (steffkes) commented on SOLR-2400:
-------------------------------------------------
Uwe, thanks for your reply.
bq. I wonder a little bit about your xml file, it only contains text and
position, but it should also contain rawTerm, startOffset, endOffset. When I
call analysis i get all of those attributes not only two of them. Is this a
hand-made file or what is the problem? Which Solr version?
My fault, indeed it's not the original output - i thought it would be enough to
demonstrate the point which i was talking about, sorry for that.
My Solr 4.x nighlty-build from last week only has the following output; there
is no rawTerm - which would be extremly helpful, because with this information
it should be possible to establish to relation i talked about earlier.
{code}<!-- .. -->
<arr name="org.apache.lucene.analysis.standard.StandardTokenizer">
<lst>
<str name="text">this</str>
<str name="type"><ALPHANUM></str>
<int name="start">0</int>
<int name="end">4</int>
<int name="position">1</int>
</lst>
<!-- .. -->
</arr>
<!-- .. -->{code}
May i miss an important configuration-setting for having rawTerm in
Analysis-Output?
> FieldAnalysisRequestHandler; add information about token-relation
> -----------------------------------------------------------------
>
> Key: SOLR-2400
> URL: https://issues.apache.org/jira/browse/SOLR-2400
> Project: Solr
> Issue Type: Improvement
> Components: Schema and Analysis
> Reporter: Stefan Matheis (steffkes)
> Priority: Minor
> Attachments: 110303_FieldAnalysisRequestHandler_output.xml,
> 110303_FieldAnalysisRequestHandler_view.png
>
>
> The XML-Output (simplified example attached) is missing one small information
> .. which could be very useful to build an nice Analysis-Output, and that's
> "Token-Relation" (if there is special/correct word for this, please correct
> me).
> Meaning, that is actually not possible to "follow" the Analysis-Process
> (completly) while the Tokenizers/Filters will drop out Tokens (f.e. StopWord)
> or split it into multiple Tokens (f.e. WordDelimiter).
> Would it be possible to include this Information? If so, it would be possible
> to create an improved Analysis-Page for the new Solr Admin (SOLR-2399) -
> short scribble attached
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]