Hi,

I use Solr 6.6 and try to test automatic language detection. I've added
these configuration into my solrconfig.xml.

    <updateRequestProcessorChain name="langid">
       <processor
class="org.apache.solr.update.processor.LangDetectLanguageIdentifierUpdateProcessorFactory">
          <lst name="invariants">
            <str name="langid.fl">content</str>
            <str name="langid.whitelist">en,tr</str>
            <str name="langid.langField">language_code</str>
            <str name="langid.fallback">other</str>
            <bool name="langid.map">true</bool>
            <bool name="langid.map.keepOrig">true</bool>
          </lst>
        </processor>
       <processor class="solr.LogUpdateProcessorFactory" />
       <processor class="solr.RunUpdateProcessorFactory" />
     </updateRequestProcessorChain>
...
  <requestHandler name="/update/extract"
                  startup="lazy"
                  class="solr.extraction.ExtractingRequestHandler" >
    <lst name="defaults">
      <str name="lowernames">true</str>
      <str name="captureAttr">true</str>
      <str name="fmap.meta">ignored_</str>
      <str name="fmap.content">content</str>
      <str name="fmap.div">ignored_</str>
      <str name="fmap.a">ignored_</str>
    </lst>
    <lst name="invariants">
      <str name="update.chain">dedupe</str>
      <str name="update.chain">langid</str>
      <str name="update.chain">ignore-commit-from-client</str>
   </lst>
  </requestHandler>

content field is populated but content_en, content_tr, content_other and
language_code fields are empty.

What I miss?

Kind Regards,
Furkan KAMACI

Reply via email to