I did a quick grep through the directory listing of the Solr 3.1 source, the only part of your analysis chain that came up empty was HTMLStripWhitespaceTokenizerFactory. I think you'll have to replace it with something like this:

        <charFilter class="solr.HTMLStripCharFilterFactory"/>
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>

Also, the ISOLatin1AccentFilterFactory is deprecated, replaced with ASCIIFoldingFilterFactory. It's still around, but could be removed at any time.

Shawn


On 11/18/2010 4:56 AM, Moritz Krinke wrote:
Hello,

i have a running solr 1.3 installation and would like to migrate it to
solr 3 in order to get speed improvements by using the multiple threads
for indexing.

When starting SOLR 3, i get the following error message:
SEVERE: org.apache.solr.common.SolrException: Unknown fieldtype 'textfc'
specified on field descr

I'm using the exact same schema.xml as with solr 1.3.
In the schema.xml, the fieldTyper "textfc" is specified as follows:

     <fieldType name="textfc" class="solr.TextField"
positionIncrementGap="100">
       <analyzer type="index">
         <tokenizer class="solr.HTMLStripWhitespaceTokenizerFactory"/>
         <!--<tokenizer class="solr.WhitespaceTokenizerFactory"/>  -->
         <filter class="solr.ISOLatin1AccentFilterFactory"/>
         <!--
         <filter class="solr.StopFilterFactory"
            ignoreCase="true"
            words="stopwords.txt"
            enablePositionIncrements="true"
         />
         -->
         <filter class="solr.WordDelimiterFilterFactory"
generateWordParts="1" generateNumberParts="1" catenateWords="1"
catenateNumbers="1" catenateAll="0" splitOnCaseChange="0"/>
         <filter class="solr.LowerCaseFilterFactory"/>
         <filter class="solr.TrimFilterFactory" />
         <filter class="solr.EnglishPorterFilterFactory"
protected="protwords.txt"/>
         <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
       </analyzer>

      <analyzer type="query">
         <tokenizer class="solr.HTMLStripWhitespaceTokenizerFactory"/>
         <!--<tokenizer class="solr.WhitespaceTokenizerFactory"/>  -->
         <filter class="solr.ISOLatin1AccentFilterFactory"/>
         <filter class="solr.SynonymFilterFactory"
synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
         <!--<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt"/>  -->
         <filter class="solr.WordDelimiterFilterFactory"
generateWordParts="1" generateNumberParts="1" catenateWords="0"
catenateNumbers="0" catenateAll="0" splitOnCaseChange="0"/>
         <filter class="solr.LowerCaseFilterFactory"/>
         <filter class="solr.TrimFilterFactory" />
         <filter class="solr.EnglishPorterFilterFactory"
protected="protwords.txt"/>
         <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
       </analyzer>

     </fieldType>


Any ideas why this does not work?

Thanks a lot,
Moritz



Reply via email to