A problem about splitting a large file into serveral FileSplits

caoyuzhong Tue, 15 Jul 2008 03:17:43 -0700

Hi,

A large file will be splitted into serveral FileSplits in 
FileInputFormat.java#getSplits().
We know FileInputFormat presents a byte-oriented view of the input file so
a whole record (for instance a line) might be broken during the process of 
generating 
several FileSplits for a single file. Then one part of a whole record will be 
in one InputSplit and another
part will be in another InputSplit and the two InputSplits might be processed 
in different Node.


I want to know how does hadoop handle with this problem?

Yu zhong
2008/07/15



_________________________________________________________________
多个邮箱同步管理，live mail客户端万人抢用中
http://get.live.cn/product/mail.html

A problem about splitting a large file into serveral FileSplits

Reply via email to