Hi, Can Hadoop run Map/Reduce directly on files in a local file system and would this make sense?
Seems like there is a tradeoff to be made when you have to process lots and lots of little files. The tradeoff is the average size of the files. If they are small (under 10k in size) and there are thousands of them, would it make sense to process the files directly from the local file system via Map/Reduce? Is there a mode in Hadoop to do this? Does Hadoop make sense to use in this case? Thanks -- View this message in context: http://www.nabble.com/Using-Map-Reduce-without-HDFS--tf4331338.html#a12335598 Sent from the Hadoop Users mailing list archive at Nabble.com.
