Github user JoshRosen commented on the pull request:
https://github.com/apache/spark/pull/3737#issuecomment-67663937
Hmm, I tried running
```bash
./spark-ec2 \
-t m3.xlarge \
-s 2 \
-k joshrosen \
-i /Users/joshrosen/.ssh/joshrosen.pem \
--ebs-vol-size 10 \
--ebs-vol-num 2 \
-r us-west-2 \
--zone us-west-2a \
--spark-version 1.1.0 \
--swap 2048 \
--vpc-id vpc-0778a362 \
--subnet-id subnet-ebcb768e \
launch josh-benchmarking3
```
Looks like it hit some sort of race-condition:
```
Setting up security groups...
Creating security group josh-benchmarking3-master
Creating security group josh-benchmarking3-slaves
Searching for existing cluster josh-benchmarking3...
Spark AMI: ami-ae6e0d9e
Launching instances...
Launched 2 slaves in us-west-2a, regid = r-1f9f8914
Launched master in us-west-2a, regid = r-d49187df
Waiting for cluster to enter 'ssh-ready' state.ERROR:boto:400 Bad Request
ERROR:boto:<?xml version="1.0" encoding="UTF-8"?>
<Response><Errors><Error><Code>InvalidInstanceID.NotFound</Code><Message>The
instance ID 'i-92214398' does not
exist</Message></Error></Errors><RequestID>22b657b9-f270-4795-a268-7a6bb3453947</RequestID></Response>
Traceback (most recent call last):
File "./spark_ec2.py", line 1173, in <module>
main()
File "./spark_ec2.py", line 1165, in main
real_main()
File "./spark_ec2.py", line 1019, in real_main
cluster_state='ssh-ready'
File "./spark_ec2.py", line 714, in wait_for_cluster_state
i.update()
File
"/Users/joshrosen/Documents/spark/ec2/lib/boto-2.34.0/boto/ec2/instance.py",
line 413, in update
rs = self.connection.get_all_reservations([self.id], dry_run=dry_run)
File
"/Users/joshrosen/Documents/spark/ec2/lib/boto-2.34.0/boto/ec2/connection.py",
line 682, in get_all_reservations
[('item', Reservation)], verb='POST')
File
"/Users/joshrosen/Documents/spark/ec2/lib/boto-2.34.0/boto/connection.py", line
1182, in get_list
raise self.ResponseError(response.status, response.reason, body)
EC2ResponseError: EC2ResponseError: 400 Bad Request
<?xml version="1.0" encoding="UTF-8"?>
<Response><Errors><Error><Code>InvalidInstanceID.NotFound</Code><Message>The
instance ID 'i-92214398' does not
exist</Message></Error></Errors><RequestID>22b657b9-f270-4795-a268-7a6bb3453947</RequestID></Response>
```
I've tried passing the `--resume` flag and the launch seems to be
proceeding, so maybe this was just a transient error that's not related to this
patch. Don't know for sure, though. Have you see this one before?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]