You should use language detection processor factory, like below:
<processor
class="org.apache.solr.update.processor.LangDetectLanguageIdentifierUpdateProcessorFactory">
<str name="langid.fl">content</str>
<str name="langid.langField">language</str>
<str name="langid.fallback">en</str>
*<str name="langid.map">true</str>
<str name="langid.map.fl">content,fullname</str>*
<str name="langid.map.keepOrig">true</str>
<str name="langid.whitelist">en,fr,de,es,ru,it</str>
<str name="langid.threshold">0.7</str>
</processor>
Once you have defined fields like content_en, content_fr etc., they will be
filled in automatically according to the recognized language
See http://wiki.apache.org/solr/LanguageDetection
--
View this message in context:
http://lucene.472066.n3.nabble.com/language-specific-fields-of-text-tp3698985p4031180.html
Sent from the Solr - User mailing list archive at Nabble.com.