Did you try specifying encoding for your source, for example like 
this:a1.sources.r1.inputCharset = ISO8859-1 << whatever charset you need>>?
Marina

      From: "Vishwakarma, Chhaya" <[email protected]>
 To: "[email protected]" <[email protected]> 
 Sent: Tuesday, May 12, 2015 4:20 AM
 Subject: Unicode data handling with flume
   
 <!--#yiv0710838864 _filtered #yiv0710838864 {font-family:Helvetica;panose-1:2 
11 6 4 2 2 2 2 2 4;} _filtered #yiv0710838864 {font-family:Helvetica;panose-1:2 
11 6 4 2 2 2 2 2 4;} _filtered #yiv0710838864 {font-family:Calibri;panose-1:2 
15 5 2 2 2 4 3 2 4;} _filtered #yiv0710838864 {font-family:Tahoma;panose-1:2 11 
6 4 3 5 4 4 2 4;} _filtered #yiv0710838864 {font-family:Consolas;panose-1:2 11 
6 9 2 2 4 3 2 4;}#yiv0710838864 #yiv0710838864 p.yiv0710838864MsoNormal, 
#yiv0710838864 li.yiv0710838864MsoNormal, #yiv0710838864 
div.yiv0710838864MsoNormal 
{margin:0in;margin-bottom:.0001pt;font-size:11.0pt;font-family:"Calibri", 
"sans-serif";}#yiv0710838864 a:link, #yiv0710838864 
span.yiv0710838864MsoHyperlink 
{color:blue;text-decoration:underline;}#yiv0710838864 a:visited, #yiv0710838864 
span.yiv0710838864MsoHyperlinkFollowed 
{color:purple;text-decoration:underline;}#yiv0710838864 p 
{margin-right:0in;margin-left:0in;font-size:12.0pt;font-family:"Times New 
Roman", "serif";}#yiv0710838864 code {font-family:"Courier New";}#yiv0710838864 
pre {margin:0in;margin-bottom:.0001pt;font-size:10.0pt;font-family:"Courier 
New";}#yiv0710838864 span.yiv0710838864EmailStyle17 {font-family:"Calibri", 
"sans-serif";color:windowtext;}#yiv0710838864 
span.yiv0710838864HTMLPreformattedChar {font-family:"Courier 
New";}#yiv0710838864 span.yiv0710838864apple-converted-space {}#yiv0710838864 
.yiv0710838864MsoChpDefault {font-family:"Calibri", "sans-serif";} _filtered 
#yiv0710838864 {margin:1.0in 1.0in 1.0in 1.0in;}#yiv0710838864 
div.yiv0710838864WordSection1 {}-->Hi all, I'm trying to put a CSV file into 
HDFS using flume, file contains some unicode characters also. Once the file is 
there in HDFS I tried to view the content, but unable to see the records 
properly. File content Name    age  sal    msg    Abc     21  1200    Lukè 
éxample àpple    Xyz     23  1400    er stîget ûf mit grôzer Output in console 
I did hdfs dfs -get /flume/events/csv/events.1234567 Below is the output 
Name,age,sal,msg    Abc,21,1200,Luk��xample��pple    Xyz,23,1400,er st�get�f 
mit gr�zer Does flume supports Unicode characters? If not how it can be 
handled? Flume version is 1.4.0    

  

Reply via email to