Sean Bigdatafun Sat, 12 Feb 2011 23:36:35 -0800
Can I directly use files on S3 as input of my mapreduce program? Or should I discp the files on S3 to my HDFS first? -- I am asking data crunching on a Hadoop cluster installed on Amazon EC2.
Thanks, --Sean