That's strange! Are you using a proxy to connect to internet? Is anyone filtering SSH connections?
Are you able to connect over SSH to Amazon machines? You can find more log messages in whirr.log. On Tue, May 8, 2012 at 6:27 PM, Light Reader <[email protected]> wrote: > Latest version. I downloaded this last night. 0.7.1 > > Subsequent runs are even worse. Here is the output from a subsequent run > (with reduced nodes), No clear reason why it fails, just that my cluster > starts and then destroys!! > > [root@ip-10-118-190-121 whirr-0.7.1]# bin/whirr launch-cluster > --cluster-name MyCluster1 --config hadoop.properties > Bootstrapping cluster > Configuring template > Configuring template > Starting 1 node(s) with roles [hadoop-jobtracker, hadoop-namenode] > Starting 2 node(s) with roles [hadoop-datanode, hadoop-tasktracker] > Starting 2 node(s) with roles [hadoop-datanode, hadoop-tasktracker] > Nodes started: [[id=us-east-1/i-3912c15f, providerId=i-3912c15f, > group=MyCluster1, name=MyCluster1-3912c15f, location=[id=us-east-1a, > scope=ZONE, description=us-east-1a, parent=us-east-1, iso3166Codes=[US-VA], > metadata={}], uri=null, imageId=us-east-1/ami-ab36fbc2, os=[name=null, > family=ubuntu, version=10.04, arch=paravirtual, is64Bit=false, > description=099720109477/ebs/ubuntu-images/ubuntu-lucid-10.04-i386-server-20110930], > state=RUNNING, loginPort=22, hostname=ip-10-190-50-29, > privateAddresses=[10.190.50.29], publicAddresses=[23.20.11.228], > hardware=[id=m1.small, providerId=m1.small, name=null, > processors=[[cores=1.0, speed=1.0]], ram=1740, volumes=[[id=null, > type=LOCAL, size=150.0, device=/dev/sda2, durable=false, > isBootDevice=false], [id=vol-4ccdeb23, type=SAN, size=null, > device=/dev/sda1, durable=true, isBootDevice=true]], > supportsImage=And(ALWAYS_TRUE,Or(isWindows(),requiresVirtualizationType(paravirtual)),ALWAYS_TRUE,Not(is64Bit())), > tags=[]], loginUser=ubuntu, userMetadata={Name=MyCluster1-3912c15f}, > tags=[]]] > Destroying failed nodes [us-east-1/i-3d10c35b, us-east-1/i-0712c161, > us-east-1/i-0512c163] > Destroyed failed nodes [us-east-1/i-3d10c35b, us-east-1/i-0712c161, > us-east-1/i-0512c163] > Unable to start the cluster. Terminating all nodes. > java.io.IOException: java.util.concurrent.ExecutionException: > java.io.IOException: Too many instance failed while bootstrapping! 1 > successfully started instances while 3 instances failed > at > org.apache.whirr.actions.BootstrapClusterAction.doAction(BootstrapClusterAction.java:129) > at > org.apache.whirr.actions.ScriptBasedClusterAction.execute(ScriptBasedClusterAction.java:107) > at > org.apache.whirr.ClusterController.launchCluster(ClusterController.java:106) > at > org.apache.whirr.cli.command.LaunchClusterCommand.run(LaunchClusterCommand.java:63) > at org.apache.whirr.cli.Main.run(Main.java:64) > at org.apache.whirr.cli.Main.main(Main.java:97) > Caused by: java.util.concurrent.ExecutionException: java.io.IOException: > Too many instance failed while bootstrapping! 1 successfully started > instances while 3 instances failed > at > java.util.concurrent.FutureTask$Sync.innerGet(FutureTask.java:222) > at java.util.concurrent.FutureTask.get(FutureTask.java:83) > at > org.apache.whirr.actions.BootstrapClusterAction.doAction(BootstrapClusterAction.java:124) > ... 5 more > Caused by: java.io.IOException: Too many instance failed while > bootstrapping! 1 successfully started instances while 3 instances failed > at > org.apache.whirr.compute.StartupProcess.call(StartupProcess.java:92) > at > org.apache.whirr.compute.StartupProcess.call(StartupProcess.java:40) > at > java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > at > java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) > at java.lang.Thread.run(Thread.java:662) > Unable to load cluster state, assuming it has no running nodes. > java.io.FileNotFoundException: /root/.whirr/MyCluster1/instances (No such > file or directory) > at java.io.FileInputStream.open(Native Method) > at java.io.FileInputStream.<init>(FileInputStream.java:120) > at com.google.common.io.Files$1.getInput(Files.java:100) > at com.google.common.io.Files$1.getInput(Files.java:97) > at com.google.common.io.CharStreams$2.getInput(CharStreams.java:91) > at com.google.common.io.CharStreams$2.getInput(CharStreams.java:88) > at com.google.common.io.CharStreams.readLines(CharStreams.java:306) > at com.google.common.io.Files.readLines(Files.java:580) > at > org.apache.whirr.state.FileClusterStateStore.load(FileClusterStateStore.java:54) > at > org.apache.whirr.state.ClusterStateStore.tryLoadOrEmpty(ClusterStateStore.java:58) > at > org.apache.whirr.ClusterController.destroyCluster(ClusterController.java:143) > at > org.apache.whirr.ClusterController.launchCluster(ClusterController.java:118) > at > org.apache.whirr.cli.command.LaunchClusterCommand.run(LaunchClusterCommand.java:63) > at org.apache.whirr.cli.Main.run(Main.java:64) > at org.apache.whirr.cli.Main.main(Main.java:97) > Starting to run scripts on cluster for phase destroyinstances: > Starting to run scripts on cluster for phase destroyinstances: > Finished running destroy phase scripts on all cluster instances > Destroying MyCluster1 cluster > Cluster MyCluster1 destroyed > Exception in thread "main" java.lang.RuntimeException: > java.io.IOException: java.util.concurrent.ExecutionException: > java.io.IOException: Too many instance failed while bootstrapping! 1 > successfully started instances while 3 instances failed > at > org.apache.whirr.ClusterController.launchCluster(ClusterController.java:125) > at > org.apache.whirr.cli.command.LaunchClusterCommand.run(LaunchClusterCommand.java:63) > at org.apache.whirr.cli.Main.run(Main.java:64) > at org.apache.whirr.cli.Main.main(Main.java:97) > Caused by: java.io.IOException: java.util.concurrent.ExecutionException: > java.io.IOException: Too many instance failed while bootstrapping! 1 > successfully started instances while 3 instances failed > at > org.apache.whirr.actions.BootstrapClusterAction.doAction(BootstrapClusterAction.java:129) > at > org.apache.whirr.actions.ScriptBasedClusterAction.execute(ScriptBasedClusterAction.java:107) > at > org.apache.whirr.ClusterController.launchCluster(ClusterController.java:106) > ... 3 more > Caused by: java.util.concurrent.ExecutionException: java.io.IOException: > Too many instance failed while bootstrapping! 1 successfully started > instances while 3 instances failed > at > java.util.concurrent.FutureTask$Sync.innerGet(FutureTask.java:222) > at java.util.concurrent.FutureTask.get(FutureTask.java:83) > at > org.apache.whirr.actions.BootstrapClusterAction.doAction(BootstrapClusterAction.java:124) > ... 5 more > Caused by: java.io.IOException: Too many instance failed while > bootstrapping! 1 successfully started instances while 3 instances failed > at > org.apache.whirr.compute.StartupProcess.call(StartupProcess.java:92) > at > org.apache.whirr.compute.StartupProcess.call(StartupProcess.java:40) > at > java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) > at java.util.concurrent.FutureTask.run(FutureTask.java:138) > at > java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) > at java.lang.Thread.run(Thread.java:662) > > > > > > > On Tue, May 8, 2012 at 11:05 AM, Andrei Savu <[email protected]>wrote: > >> What version of Whirr are you using? Are you seeing this error every time >> you start a cluster? >> >> You can get the latest release from: >> https://www.apache.org/dyn/closer.cgi/whirr/ >> >> Cheers, >> >> -- Andrei Savu / axemblr.com / Tools for Clouds >> >> >> On Tue, May 8, 2012 at 5:06 PM, Light Reader <[email protected]>wrote: >> >>> Hi, >>> Newbie alert. >>> >>> I am trying to launch a 5 node cluster with Whirr. I get the following >>> message during a launch: >>> >>> Unexpected error while starting 4 nodes, minimum 4 nodes for >>> [hadoop-datanode, hadoop-tasktracker] of cluster HCluster1 >>> *java.util.concurrent.ExecutionException: >>> java.lang.IllegalArgumentException: no private key configured for: >>> [region=us-east-1, name=jclouds#HCluster1#us-east-1#26]; please use >>> options.overrideLoginCredentialWith(rsa_private_text)* >>> at >>> java.util.concurrent.FutureTask$Sync.innerGet(FutureTask.java:222) >>> at java.util.concurrent.FutureTask.get(FutureTask.java:83) >>> at >>> org.apache.whirr.compute.StartupProcess.waitForOutcomes(StartupProcess.java:129) >>> at >>> org.apache.whirr.compute.StartupProcess.call(StartupProcess.java:82) >>> at >>> org.apache.whirr.compute.StartupProcess.call(StartupProcess.java:40) >>> at >>> java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) >>> at java.util.concurrent.FutureTask.run(FutureTask.java:138) >>> at >>> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) >>> at >>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) >>> at java.lang.Thread.run(Thread.java:662) >>> >>> While the cluster eventually does startup I am unable to do a hadoop fs >>> -ls. Before I debug that further wondering if the error message above is >>> harmless or maybe the root cause of my some of my issues. >>> >>> My hadoop.properties: (I did create the two env variables and also did >>> the rsa key gen) >>> >>> whirr.cluster-name=HCluster >>> whirr.instance-templates=1 hadoop-jobtracker+hadoop-namenode,4 >>> hadoop-datanode+hadoop-tasktracker >>> whirr.provider=aws-ec2 >>> whirr.identity=${env:AWS_ACCESS_KEY_ID} >>> whirr.credential=${env:AWS_SECRET_ACCESS_KEY} >>> whirr.private-key-file=${sys:user.home}/.ssh/id_rsa >>> whirr.public-key-file=${sys:user.home}/.ssh/id_rsa.pub >>> whirr.cluster-user=huser >>> >>> Any help is appreciated. >>> >>> Thx, >>> LR >>> >> >> >
