You could also set your input split size to Long.MAX_VALUE.

Goel, Ankur wrote:
Nope, But if the intent is so then there are 2 ways of doing it.

1. Just extend the input format of your choice and override
isSplitable() method to return false.

2. Compress your text file using a compression format supported by
hadoop (e.g gzip). This will ensure that one map task processes 1 file
since compressed files are not split between processes.


-----Original Message-----
From: Qiong Zhang [mailto:[EMAIL PROTECTED] Sent: Tuesday, July 01, 2008 9:54 PM
To: [email protected]
Subject: one input file per map
Hi,

Is there an existing input format/split which supports one input file
(e.g. plain text) per map task?

Thanks,

James

Reply via email to