I have a tab delimited text file and I read using TextInputFormat. I have problems reading lines from the txt file with ascii code > 127 e.g.
P 676827 Martin Plachý amg gets read as P 676827 Martin Plach? with missing 3rd tab delimited column. Whats the best way to handle this kind of input? thanks -- View this message in context: http://www.nabble.com/Text-encoding-tp24684865p24684865.html Sent from the Hadoop core-user mailing list archive at Nabble.com.
