[jira] Commented: (SOLR-314) Store Analyzed token text from an incoming SolrInputDocument

Ryan McKinley (JIRA) Sat, 21 Jul 2007 11:51:27 -0700

    [ 
https://issues.apache.org/jira/browse/SOLR-314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12514432
 ]


Ryan McKinley commented on SOLR-314:
------------------------------------

Right, the point of this is to process *stored* fields.  Any documentation for 
this would make the purpose clear and suggest that you will have more 
flexibility doing the processing on the client side.

I need to find a user configurable way to  have someone process incoming 
fields.  In some cases that is splitting them into multiple tokens, but in 
others it is doing things like 'toLowerCase' and remove duplicates.  Rather 
then build my own interface for this, It would be great to use the existing 
configurable analyzer framework.

If this is something that ought to stay of of core, I'm fine with that.  But it 
does feel generally useful.



> Store Analyzed token text from an incoming SolrInputDocument
> ------------------------------------------------------------
>
>                 Key: SOLR-314
>                 URL: https://issues.apache.org/jira/browse/SOLR-314
>             Project: Solr
>          Issue Type: New Feature
>          Components: update
>            Reporter: Ryan McKinley
>         Attachments: SOLR-314-StoreAnalysis.patch
>
>
> This is an UpdateRequestProcessor that runs incoming fields through a Field 
> Analyzer and stores the output of each token as a field value.
> For Example.  If you have a field type defined:
>   <fieldType name="text_ws" class="solr.TextField" >
>       <analyzer>
>         <tokenizer class="solr.WhitespaceTokenizerFactory"/>
>       </analyzer>
>   </fieldType>
> And send a request:
> /update?store.analysis=true&f.feature.analysis=text_ws
> <add> <doc>
>  <field name="feature">aaa bbb ccc</field>
> </doc></add>
> The returned document will look like:
> <doc>
>  <arr name="feature">
>   <str>aaa</str>
>   <str>bbb</str>
>   <str>ccc</str>
>  </arr>
> </doc>

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (SOLR-314) Store Analyzed token text from an incoming SolrInputDocument

Reply via email to