[jira] Commented: (SOLR-2400) FieldAnalysisRequestHandler; add information about token-relation

Stefan Matheis (steffkes) (JIRA) Fri, 04 Mar 2011 02:50:00 -0800

    [ 
https://issues.apache.org/jira/browse/SOLR-2400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13002533#comment-13002533
 ]


Stefan Matheis (steffkes) commented on SOLR-2400:
-------------------------------------------------

Uwe, thanks for your reply.

bq. I wonder a little bit about your xml file, it only contains text and 
position, but it should also contain rawTerm, startOffset, endOffset. When I 
call analysis i get all of those attributes not only two of them. Is this a 
hand-made file or what is the problem? Which Solr version?
My fault, indeed it's not the original output - i thought it would be enough to 
demonstrate the point which i was talking about, sorry for that.

My Solr 4.x nighlty-build from last week only has the following output; there 
is no rawTerm - which would be extremly helpful, because with this information 
it should be possible to establish to relation i talked about earlier. 

{code}<!-- .. -->
<arr name="org.apache.lucene.analysis.standard.StandardTokenizer">
  <lst>
    <str name="text">this</str>
    <str name="type"><ALPHANUM></str>
    <int name="start">0</int>
    <int name="end">4</int>
    <int name="position">1</int>
  </lst>
  <!-- .. -->
</arr>
<!-- .. -->{code}

May i miss an important configuration-setting for having rawTerm in 
Analysis-Output?

> FieldAnalysisRequestHandler; add information about token-relation
> -----------------------------------------------------------------
>
>                 Key: SOLR-2400
>                 URL: https://issues.apache.org/jira/browse/SOLR-2400
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>            Reporter: Stefan Matheis (steffkes)
>            Priority: Minor
>         Attachments: 110303_FieldAnalysisRequestHandler_output.xml, 
> 110303_FieldAnalysisRequestHandler_view.png
>
>
> The XML-Output (simplified example attached) is missing one small information 
> .. which could be very useful to build an nice Analysis-Output, and that's 
> "Token-Relation" (if there is special/correct word for this, please correct 
> me).
> Meaning, that is actually not possible to "follow" the Analysis-Process 
> (completly) while the Tokenizers/Filters will drop out Tokens (f.e. StopWord) 
> or split it into multiple Tokens (f.e. WordDelimiter).
> Would it be possible to include this Information? If so, it would be possible 
> to create an improved Analysis-Page for the new Solr Admin (SOLR-2399) - 
> short scribble attached

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] Commented: (SOLR-2400) FieldAnalysisRequestHandler; add information about token-relation

Reply via email to