Won't setting the input split value to size of file achieve this?


On Mar 11, 2010, at 1:15 AM, HypOo <[email protected]> wrote:


I have the same problem, I need to asign a whole file per map but I don't
know how to do that.
I've tried to create a new WholeFileFormat.class and override the method
isSplitable() but it doesn't seems to work..
Have you achieved to do this ?
I'm using hadoop 0.20.2



stolikp wrote:

I've got some text files in my input directory and I want to pass each single text file (whole file not just a line) to a map (one file per one map). How can I do this ? TextInputFormat splits text into lines and I do
not want this to happen.
I tried:
http://hadoop.apache.org/common/docs/r0.20./streaming.html#How+do+I+process+files%2C+one+per+map%3F
but it doesn't work for me, compiler doesn't know what
NonSplitableTextInputFormat.class is.
I'm using hadoop 0.20.1


--
View this message in context: 
http://old.nabble.com/Passing-whole-text-file-to-a-single-map-tp27287649p27860526.html
Sent from the Hadoop core-user mailing list archive at Nabble.com.

Reply via email to