That's right. The TextInputFormat handles situations where records cross split boundaries. What your mapper will see is "whole" records.
-----Original Message----- From: maha [mailto:[email protected]] Sent: Friday, February 18, 2011 1:14 PM To: common-user Subject: Quick question Hi all, I want to check if the following statement is right: If I use TextInputFormat to process a text file with 2000 lines (each ending with \n) with 20 mappers. Then each map will have a sequence of COMPLETE LINES . In other words, the input is not split byte-wise but by lines. Is that right? Thank you, Maha
