You should use language detection processor factory, like below:

<processor
class="org.apache.solr.update.processor.LangDetectLanguageIdentifierUpdateProcessorFactory">
         <str name="langid.fl">content</str>
         <str name="langid.langField">language</str>
         <str name="langid.fallback">en</str>
                 *<str name="langid.map">true</str>
                 <str name="langid.map.fl">content,fullname</str>*
<str name="langid.map.keepOrig">true</str>
<str name="langid.whitelist">en,fr,de,es,ru,it</str>
                 <str name="langid.threshold">0.7</str>
       </processor>

Once you have defined fields like content_en, content_fr etc., they will be
filled in automatically according to the recognized language

See http://wiki.apache.org/solr/LanguageDetection



--
View this message in context: 
http://lucene.472066.n3.nabble.com/language-specific-fields-of-text-tp3698985p4031180.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to