For the second problem, just start Hadoop MapReduce before running distcp: /root/ephemeral-hadoop/bin/start-all.sh
On Sat, Jan 4, 2014 at 12:54 PM, Guillaume Pitel <[email protected] > wrote: > Hi, > > I'm making my first steps on EC2 (using 0.8.1 bin for CDH4) and some > problems occured. First one is that once the cluster is created, the script > cannot find it again for login, destroying and so on. Not a big deal, I can > do that manually, but it's annoying. > > Second problem is not really related to spark but to hdfs/mapreduce. I > want to make a hadoop distcp from S3 to the local ephemeral HDFS. The > distcp fails because there's no mapreduce running. > > Questions : > > - anyone has advice about a better way to copy from S3 to hdfs, or a way > to make distcp work ? > - any idea why the spark-ec2 cannot find the clusters back ? > > Thanks in advance for any experience and advices ! > > Guillaume > -- > [image: eXenSa] > *Guillaume PITEL, Président* > +33(0)6 25 48 86 80 / +33(0)9 70 44 67 53 > > eXenSa S.A.S. <http://www.exensa.com/> > 41, rue Périer - 92120 Montrouge - FRANCE > Tel +33(0)1 84 16 36 77 / Fax +33(0)9 72 28 37 05 >
<<exensa_logo_mail.png>>
