Yes, but, my point is that it doesn't quite make sense to do such a thing in
MapReduce. Only one mapper will see the header, but, presumably all mappers
need that info. If it's a bit of metadata, pass it in the Configuration
object as a String. If it's a lot, put it in the DistributedCache (or on
HDFS and pass the location for mappers to read).

On Wed, Jul 6, 2011 at 11:23 AM, Xiaobo Gu <[email protected]> wrote:

> Hi Sean,
>
>     Thanks for your reply first, so we must wirte specific code to
> handle the CSV header if we have it in the file, right?
>
> Xiaobu Gu
>
>

Reply via email to