Github user JoshRosen commented on the pull request:

    https://github.com/apache/spark/pull/3737#issuecomment-67663937
  
    Hmm, I tried running
    
    ```bash
    ./spark-ec2 \
      -t m3.xlarge \
      -s 2 \ 
      -k joshrosen \ 
      -i /Users/joshrosen/.ssh/joshrosen.pem \ 
      --ebs-vol-size 10 \
      --ebs-vol-num 2 \
      -r us-west-2 \
      --zone us-west-2a \
      --spark-version 1.1.0 \
      --swap 2048 \
      --vpc-id vpc-0778a362 \ 
      --subnet-id subnet-ebcb768e \
      launch josh-benchmarking3
    ```
    
    Looks like it hit some sort of race-condition:
    
    ```
    Setting up security groups...
    Creating security group josh-benchmarking3-master
    Creating security group josh-benchmarking3-slaves
    Searching for existing cluster josh-benchmarking3...
    Spark AMI: ami-ae6e0d9e
    Launching instances...
    Launched 2 slaves in us-west-2a, regid = r-1f9f8914
    Launched master in us-west-2a, regid = r-d49187df
    Waiting for cluster to enter 'ssh-ready' state.ERROR:boto:400 Bad Request
    ERROR:boto:<?xml version="1.0" encoding="UTF-8"?>
    
<Response><Errors><Error><Code>InvalidInstanceID.NotFound</Code><Message>The 
instance ID 'i-92214398' does not 
exist</Message></Error></Errors><RequestID>22b657b9-f270-4795-a268-7a6bb3453947</RequestID></Response>
    Traceback (most recent call last):
      File "./spark_ec2.py", line 1173, in <module>
        main()
      File "./spark_ec2.py", line 1165, in main
        real_main()
      File "./spark_ec2.py", line 1019, in real_main
        cluster_state='ssh-ready'
      File "./spark_ec2.py", line 714, in wait_for_cluster_state
        i.update()
      File 
"/Users/joshrosen/Documents/spark/ec2/lib/boto-2.34.0/boto/ec2/instance.py", 
line 413, in update
        rs = self.connection.get_all_reservations([self.id], dry_run=dry_run)
      File 
"/Users/joshrosen/Documents/spark/ec2/lib/boto-2.34.0/boto/ec2/connection.py", 
line 682, in get_all_reservations
        [('item', Reservation)], verb='POST')
      File 
"/Users/joshrosen/Documents/spark/ec2/lib/boto-2.34.0/boto/connection.py", line 
1182, in get_list
        raise self.ResponseError(response.status, response.reason, body)
    EC2ResponseError: EC2ResponseError: 400 Bad Request
    <?xml version="1.0" encoding="UTF-8"?>
    
<Response><Errors><Error><Code>InvalidInstanceID.NotFound</Code><Message>The 
instance ID 'i-92214398' does not 
exist</Message></Error></Errors><RequestID>22b657b9-f270-4795-a268-7a6bb3453947</RequestID></Response>
    ```
    
    I've tried passing the `--resume` flag and the launch seems to be 
proceeding, so maybe this was just a transient error that's not related to this 
patch.  Don't know for sure, though.  Have you see this one before?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to