Hi All,I am trying to detect the language of the business name filed and the
address field. I am using Solr's lang Detect(Google Library) , not Tika. It
works ok in most of the cases but in some it detects the language
wrongly.For an example the document -"OrgName": "EXPLOITS VALLEY
HIGHGREENWOOD",        "StreetLine1": "19 GREENWOOD AVE",       
"StreetLine2": "",        "SOrgName": "EXPLOITS VALLEY HIGHGREENWOOD",       
"StandardizedStreetLine1": "19 GREENWOOD AVE",        "language_s": [         
"de"        ]Language is detected as German(de) here , which is wrong.Below
is my
configuration-+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
                              
OrgName,StreetLine1,StreetLine2,SOrgName,StandardizedStreetLine1        
language_s               0.9                      en                            
         
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++Why
there is an issue?Why the language detection is wrong ?Please help !Vibhor 



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Issues-with-Language-detection-in-Solr-tp4096433.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to