Hello,

I had some differences in solr score between solr 3.1 and solr 4.
I have a searchfield with the following type:

<fieldType name="text_delimiter" class="solr.TextField">
      <analyzer type="index">
        <charFilter class="solr.HTMLStripCharFilterFactory"/>
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.ASCIIFoldingFilterFactory"/>
        <filter class="solr.TrimFilterFactory"/>
        <filter class="solr.WordDelimiterFilterFactory"
generateWordParts="1" catenateWords="1" splitOnCaseChange="0"
splitOnNumerics="0" stemEnglishPossessive="0" />
      </analyzer>
      <analyzer type="query">
        <charFilter class="solr.HTMLStripCharFilterFactory"/>
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
        <filter class="solr.LowerCaseFilterFactory"/>
        <filter class="solr.ASCIIFoldingFilterFactory"/>
        <filter class="solr.TrimFilterFactory"/>
        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1"
catenateWords="0" splitOnCaseChange="0" splitOnNumerics="0"
stemEnglishPossessive="0" />
      </analyzer>
    </fieldType>


An example of fieldnorms:

SearchTerm = *barcelona*

solr 3.1:
fc *barcelona* soccer club -> 0.5
fc-*barcelona* soccer club -> 0.5

solr 4:
fc *barcelona* soccer club -> 0.5
fc-*barcelona* soccer club -> 0.4375

It could be the catenateWords of the fieldtype conf:
fc,barcelona,fcbarcelona,soccer,club(5 terms = 0.4375)
Strange that in solr 3.1 it was just counting for 4 terms with the same
filter.

Why is fieldnorm different? I need some help with this:)

Thanks 
Roy







--
View this message in context: 
http://lucene.472066.n3.nabble.com/Fieldnorm-solr-4-specialchars-worddelimiter-tp4036248.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to