Yes, but, my point is that it doesn't quite make sense to do such a thing in MapReduce. Only one mapper will see the header, but, presumably all mappers need that info. If it's a bit of metadata, pass it in the Configuration object as a String. If it's a lot, put it in the DistributedCache (or on HDFS and pass the location for mappers to read).
On Wed, Jul 6, 2011 at 11:23 AM, Xiaobo Gu <[email protected]> wrote: > Hi Sean, > > Thanks for your reply first, so we must wirte specific code to > handle the CSV header if we have it in the file, right? > > Xiaobu Gu > >
