On Fri, Dec 17, 2010 at 2:02 AM, Sheeba George wrote:
> This must be a simple question . But somehow I am not able to get it to
> work.
> I have a text file which has ISO Latin characters like "CancĂșn".
> The mapper is taking "Text" as the input value.
>
> public
> void map(LongWritable key, Text value, OutputCollector<Text, IntWritable>
> output, Reporter reporter) throws IOException
>
> But the Latin characters are not recognized correctly and it throws a
> MalInputException when I try
> Text.validateUTF8(value.getBytes());

Is recoding your text file as UTF-8 an option?

-- 
WBR, Mikhail Yakshin

Reply via email to