[ 
https://issues.apache.org/jira/browse/SOLR-11859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jaap de Jong updated SOLR-11859:
--------------------------------
    Description: 
I'm using the CloneFieldUpdateProcessorFactory to copy content from all string 
fields _except some predefined fields_ to a multivalued "text_final" field. 
This seems to work, however each value value is prepended with "\{set=" and 
appended with "}".

Expected result
 Just clone all the original values into the multivalued field +without 
"\{set=}".+  

In my schema this field is defined as:

{{<field name="text_final" type="text_nl" indexed="true" stored="true" 
multiValued="true"/>}}

The fieldType is defined as:

{{<fieldType name="text_nl" class="solr.TextField" positionIncrementGap="100">}}
 {{<analyzer>}}
 {{<charFilter class="solr.PatternReplaceCharFilterFactory" pattern='>' 
replacement=' ' />}}
{{<tokenizer class="solr.StandardTokenizerFactory"/>}}
 {{<filter class="solr.LowerCaseFilterFactory"/>}}
 {{<filter class="solr.StopFilterFactory" ignoreCase="true" 
words="lang/stopwords_nl.txt"}}
 {{format="snowball"/>}}
 {{<filter class="solr.HunspellStemFilterFactory"}}
 {{dictionary="lang/nederlands/nl_NL.dic"}}
 {{affix="lang/nederlands/nl_NL.aff"}}
 {{ignoreCase="true"/>}}
 {{</analyzer>}}
 {{</fieldType>}}

In my updateRequestProcessorChain the processor is defined as:

{{<processor class="solr.CloneFieldUpdateProcessorFactory">}}
 {{<lst name="source">}}
 {{<str name="fieldRegex">s_.*</str>}}
 {{<lst name="exclude">}}
 {{<str name="fieldName">s_description</str>}}
 {{<str name="fieldName">s_image_link</str>}}
 {{<str name="fieldName">s_link</str>}}
 {{</lst>}}
 {{</lst>}}
 {{<str name="dest">text_final</str>}}
 {{</processor>}}

  was:
I'm using the CloneFieldUpdateProcessorFactory to copy content from all string 
fields _except some predefined fields_ to a multivalued "text_final" field. 
This seems to work, however each value value is prepended with "\{set=" and 
appended with "}".

Expected result
Just clone all the original values into the multivalued field +without 
"\{set=}".+  

In my schema this field is defined as:

{{<field name="text_final" type="text_nl" indexed="true" stored="true" 
multiValued="true"/>}}

The fieldType is defined as:

{{<fieldType name="text_nl" class="solr.TextField" positionIncrementGap="100">}}
{{<analyzer>}}
{{<charFilter class="solr.PatternReplaceCharFilterFactory" pattern='>' 
replacement=' ' />}}
{{<charFilter class="solr.PatternReplaceCharFilterFactory" 
pattern="\\{set=(.*)\}" replacement="$1" />}}

{{<tokenizer class="solr.StandardTokenizerFactory"/>}}
{{<filter class="solr.LowerCaseFilterFactory"/>}}
{{<filter class="solr.StopFilterFactory" ignoreCase="true" 
words="lang/stopwords_nl.txt"}}
{{format="snowball"/>}}
{{<filter class="solr.HunspellStemFilterFactory"}}
{{dictionary="lang/nederlands/nl_NL.dic"}}
{{affix="lang/nederlands/nl_NL.aff"}}
{{ignoreCase="true"/>}}
{{</analyzer>}}
{{</fieldType>}}

In my updateRequestProcessorChain the processor is defined as:

{{<processor class="solr.CloneFieldUpdateProcessorFactory">}}
{{<lst name="source">}}
{{<str name="fieldRegex">s_.*</str>}}
{{<lst name="exclude">}}
{{<str name="fieldName">s_description</str>}}
{{<str name="fieldName">s_image_link</str>}}
{{<str name="fieldName">s_link</str>}}
{{</lst>}}
{{</lst>}}
{{<str name="dest">text_final</str>}}
{{</processor>}}


> CloneFieldUpdateProcessorFactory should not add {set=} to content when cloned 
> to multivalued field
> --------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-11859
>                 URL: https://issues.apache.org/jira/browse/SOLR-11859
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: UpdateRequestProcessors
>    Affects Versions: 7.2
>            Reporter: Jaap de Jong
>            Priority: Minor
>
> I'm using the CloneFieldUpdateProcessorFactory to copy content from all 
> string fields _except some predefined fields_ to a multivalued "text_final" 
> field. This seems to work, however each value value is prepended with 
> "\{set=" and appended with "}".
> Expected result
>  Just clone all the original values into the multivalued field +without 
> "\{set=}".+  
> In my schema this field is defined as:
> {{<field name="text_final" type="text_nl" indexed="true" stored="true" 
> multiValued="true"/>}}
> The fieldType is defined as:
> {{<fieldType name="text_nl" class="solr.TextField" 
> positionIncrementGap="100">}}
>  {{<analyzer>}}
>  {{<charFilter class="solr.PatternReplaceCharFilterFactory" pattern='>' 
> replacement=' ' />}}
> {{<tokenizer class="solr.StandardTokenizerFactory"/>}}
>  {{<filter class="solr.LowerCaseFilterFactory"/>}}
>  {{<filter class="solr.StopFilterFactory" ignoreCase="true" 
> words="lang/stopwords_nl.txt"}}
>  {{format="snowball"/>}}
>  {{<filter class="solr.HunspellStemFilterFactory"}}
>  {{dictionary="lang/nederlands/nl_NL.dic"}}
>  {{affix="lang/nederlands/nl_NL.aff"}}
>  {{ignoreCase="true"/>}}
>  {{</analyzer>}}
>  {{</fieldType>}}
> In my updateRequestProcessorChain the processor is defined as:
> {{<processor class="solr.CloneFieldUpdateProcessorFactory">}}
>  {{<lst name="source">}}
>  {{<str name="fieldRegex">s_.*</str>}}
>  {{<lst name="exclude">}}
>  {{<str name="fieldName">s_description</str>}}
>  {{<str name="fieldName">s_image_link</str>}}
>  {{<str name="fieldName">s_link</str>}}
>  {{</lst>}}
>  {{</lst>}}
>  {{<str name="dest">text_final</str>}}
>  {{</processor>}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to