Check this presentation:
http://www.oscon.com/oscon2011/public/schedule/detail/19214

The ZooKeeper service should be easy to understand.

Cheers,

-- Andrei Savu / andreisavu.ro

On Thu, Aug 25, 2011 at 12:41 AM, Joris Poort <gpo...@gmail.com> wrote:
> Does anyone know a good simple whirr service code to use as a base to
> learn to write your own?  Kind of new to this so would really
> appreciate it.
>
> Cheers,
>
> Joris
>
> On Thu, Aug 25, 2011 at 12:16 AM, Joris Poort <gpo...@gmail.com> wrote:
>> Thanks for the recommendation on whirr services code, I think that
>> will work - looking into how to do it now.
>>
>> Thanks again Andrei - really appreciate your help on all this.
>>
>> Cheers,
>>
>> Joris
>>
>> On Wed, Aug 24, 2011 at 11:49 PM, Andrei Savu <savu.and...@gmail.com> wrote:
>>> It seems like for your custom ami the user scripts are not executed as 
>>> expected.
>>>
>>> How about adding your own software later as Whirr services or using
>>> bash scripts and start from a vanilla Ubuntu ami?
>>>
>>> -- Andrei Savu / andreisavu.ro
>>>
>>> On Wed, Aug 24, 2011 at 11:41 PM, Joris Poort <gpo...@gmail.com> wrote:
>>>> Thanks for offering your help Guillaume.  Unfortunately I tried that
>>>> and still have the same issues.
>>>>
>>>> On Wed, Aug 24, 2011 at 11:34 PM, tog <guillaume.all...@gmail.com> wrote:
>>>>> I don't know if this can help but here are the 2 lines that I added to
>>>>> my cluster property file in order to be able to log on the cluster:
>>>>> whirr.private-key-file=${sys:user.home}/.ssh/id_rsa
>>>>> whirr.public-key-file=${sys:user.home}/.ssh/id_rsa.pub
>>>>>
>>>>> then I was able to connect my local userid and not ubuntu.
>>>>>
>>>>> HTH
>>>>> Guillaume
>>>>>
>>>>>
>>>>> On Thu, Aug 25, 2011 at 11:34 AM, Joris Poort <gpo...@gmail.com> wrote:
>>>>>> I also tried to regenerate the ssh keys, but still no luck...
>>>>>>
>>>>>> On Wed, Aug 24, 2011 at 10:13 PM, Joris Poort <gpo...@gmail.com> wrote:
>>>>>>> Interestingly, as mentioned before I can ssh in using my keypair used
>>>>>>> to create the custom AMI.  So I'm thinking that it still has to do
>>>>>>> with whirr not being able to get the right keypair access with the
>>>>>>> local id_rsa.
>>>>>>>
>>>>>>> Thanks again for any help,
>>>>>>>
>>>>>>> Joris
>>>>>>>
>>>>>>> On Wed, Aug 24, 2011 at 10:11 PM, Joris Poort <gpo...@gmail.com> wrote:
>>>>>>>> Thanks guys.  I guess I was using "ubuntu" as user incorrectly, now
>>>>>>>> adjusted to local-user "gpoort" I'm still not getting through.  See
>>>>>>>> output below from -vv (sorry about the long email).
>>>>>>>>
>>>>>>>> gpoort@jvb:~$ ssh -vv -i .ssh/id_rsa
>>>>>>>> ubu...@ec2-184-72-86-108.compute-1.amazonaws.com
>>>>>>>> OpenSSH_5.8p1 Debian-1ubuntu3, OpenSSL 0.9.8o 01 Jun 2010
>>>>>>>> debug1: Reading configuration data /etc/ssh/ssh_config
>>>>>>>> debug1: Applying options for *
>>>>>>>> debug2: ssh_connect: needpriv 0
>>>>>>>> debug1: Connecting to ec2-184-72-86-108.compute-1.amazonaws.com
>>>>>>>> [184.72.86.108] port 22.
>>>>>>>> debug1: Connection established.
>>>>>>>> debug2: key_type_from_name: unknown key type '-----BEGIN'
>>>>>>>> debug2: key_type_from_name: unknown key type '-----END'
>>>>>>>> debug1: identity file .ssh/id_rsa type 1
>>>>>>>> debug1: Checking blacklist file /usr/share/ssh/blacklist.RSA-2048
>>>>>>>> debug1: Checking blacklist file /etc/ssh/blacklist.RSA-2048
>>>>>>>> debug1: identity file .ssh/id_rsa-cert type -1
>>>>>>>> debug1: Remote protocol version 2.0, remote software version
>>>>>>>> OpenSSH_5.3p1 Debian-3ubuntu7
>>>>>>>> debug1: match: OpenSSH_5.3p1 Debian-3ubuntu7 pat OpenSSH*
>>>>>>>> debug1: Enabling compatibility mode for protocol 2.0
>>>>>>>> debug1: Local version string SSH-2.0-OpenSSH_5.8p1 Debian-1ubuntu3
>>>>>>>> debug2: fd 3 setting O_NONBLOCK
>>>>>>>> debug1: SSH2_MSG_KEXINIT sent
>>>>>>>> debug1: SSH2_MSG_KEXINIT received
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> ecdh-sha2-nistp256,ecdh-sha2-nistp384,ecdh-sha2-nistp521,diffie-hellman-group-exchange-sha256,diffie-hellman-group-exchange-sha1,diffie-hellman-group14-sha1,diffie-hellman-group1-sha1
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> ssh-rsa-cert-...@openssh.com,ssh-rsa-cert-...@openssh.com,ssh-rsa,ecdsa-sha2-nistp256-cert-...@openssh.com,ecdsa-sha2-nistp384-cert-...@openssh.com,ecdsa-sha2-nistp521-cert-...@openssh.com,ssh-dss-cert-...@openssh.com,ssh-dss-cert-...@openssh.com,ecdsa-sha2-nistp256,ecdsa-sha2-nistp384,ecdsa-sha2-nistp521,ssh-dss
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> aes128-ctr,aes192-ctr,aes256-ctr,arcfour256,arcfour128,aes128-cbc,3des-cbc,blowfish-cbc,cast128-cbc,aes192-cbc,aes256-cbc,arcfour,rijndael-...@lysator.liu.se
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> aes128-ctr,aes192-ctr,aes256-ctr,arcfour256,arcfour128,aes128-cbc,3des-cbc,blowfish-cbc,cast128-cbc,aes192-cbc,aes256-cbc,arcfour,rijndael-...@lysator.liu.se
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> hmac-md5,hmac-sha1,umac...@openssh.com,hmac-ripemd160,hmac-ripemd...@openssh.com,hmac-sha1-96,hmac-md5-96
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> hmac-md5,hmac-sha1,umac...@openssh.com,hmac-ripemd160,hmac-ripemd...@openssh.com,hmac-sha1-96,hmac-md5-96
>>>>>>>> debug2: kex_parse_kexinit: none,z...@openssh.com,zlib
>>>>>>>> debug2: kex_parse_kexinit: none,z...@openssh.com,zlib
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> debug2: kex_parse_kexinit: first_kex_follows 0
>>>>>>>> debug2: kex_parse_kexinit: reserved 0
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> diffie-hellman-group-exchange-sha256,diffie-hellman-group-exchange-sha1,diffie-hellman-group14-sha1,diffie-hellman-group1-sha1
>>>>>>>> debug2: kex_parse_kexinit: ssh-rsa,ssh-dss
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> aes128-ctr,aes192-ctr,aes256-ctr,arcfour256,arcfour128,aes128-cbc,3des-cbc,blowfish-cbc,cast128-cbc,aes192-cbc,aes256-cbc,arcfour,rijndael-...@lysator.liu.se
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> aes128-ctr,aes192-ctr,aes256-ctr,arcfour256,arcfour128,aes128-cbc,3des-cbc,blowfish-cbc,cast128-cbc,aes192-cbc,aes256-cbc,arcfour,rijndael-...@lysator.liu.se
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> hmac-md5,hmac-sha1,umac...@openssh.com,hmac-ripemd160,hmac-ripemd...@openssh.com,hmac-sha1-96,hmac-md5-96
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> hmac-md5,hmac-sha1,umac...@openssh.com,hmac-ripemd160,hmac-ripemd...@openssh.com,hmac-sha1-96,hmac-md5-96
>>>>>>>> debug2: kex_parse_kexinit: none,z...@openssh.com
>>>>>>>> debug2: kex_parse_kexinit: none,z...@openssh.com
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> debug2: kex_parse_kexinit: first_kex_follows 0
>>>>>>>> debug2: kex_parse_kexinit: reserved 0
>>>>>>>> debug2: mac_setup: found hmac-md5
>>>>>>>> debug1: kex: server->client aes128-ctr hmac-md5 none
>>>>>>>> debug2: mac_setup: found hmac-md5
>>>>>>>> debug1: kex: client->server aes128-ctr hmac-md5 none
>>>>>>>> debug1: SSH2_MSG_KEX_DH_GEX_REQUEST(1024<1024<8192) sent
>>>>>>>> debug1: expecting SSH2_MSG_KEX_DH_GEX_GROUP
>>>>>>>> debug2: dh_gen_key: priv key bits set: 138/256
>>>>>>>> debug2: bits set: 502/1024
>>>>>>>> debug1: SSH2_MSG_KEX_DH_GEX_INIT sent
>>>>>>>> debug1: expecting SSH2_MSG_KEX_DH_GEX_REPLY
>>>>>>>> debug1: Server host key: RSA 
>>>>>>>> 77:b6:b5:95:c6:0d:3d:82:94:91:d5:7d:70:8f:b8:1b
>>>>>>>> debug1: Host 'ec2-184-72-86-108.compute-1.amazonaws.com' is known and
>>>>>>>> matches the RSA host key.
>>>>>>>> debug1: Found key in /home/gpoort/.ssh/known_hosts:7
>>>>>>>> debug2: bits set: 497/1024
>>>>>>>> debug1: ssh_rsa_verify: signature correct
>>>>>>>> debug2: kex_derive_keys
>>>>>>>> debug2: set_newkeys: mode 1
>>>>>>>> debug1: SSH2_MSG_NEWKEYS sent
>>>>>>>> debug1: expecting SSH2_MSG_NEWKEYS
>>>>>>>> debug2: set_newkeys: mode 0
>>>>>>>> debug1: SSH2_MSG_NEWKEYS received
>>>>>>>> debug1: Roaming not allowed by server
>>>>>>>> debug1: SSH2_MSG_SERVICE_REQUEST sent
>>>>>>>> debug2: service_accept: ssh-userauth
>>>>>>>> debug1: SSH2_MSG_SERVICE_ACCEPT received
>>>>>>>> debug2: key: .ssh/id_rsa (0x7f4ea3913cd0)
>>>>>>>> debug1: Authentications that can continue: publickey
>>>>>>>> debug1: Next authentication method: publickey
>>>>>>>> debug1: Offering RSA public key: .ssh/id_rsa
>>>>>>>> debug2: we sent a publickey packet, wait for reply
>>>>>>>> debug1: Authentications that can continue: publickey
>>>>>>>> debug2: we did not send a packet, disable method
>>>>>>>> debug1: No more authentication methods to try.
>>>>>>>> Permission denied (publickey).
>>>>>>>> gpoort@jvb:~$ ssh -vv -i .ssh/id_rsa
>>>>>>>> gpo...@ec2-184-72-86-108.compute-1.amazonaws.com
>>>>>>>> OpenSSH_5.8p1 Debian-1ubuntu3, OpenSSL 0.9.8o 01 Jun 2010
>>>>>>>> debug1: Reading configuration data /etc/ssh/ssh_config
>>>>>>>> debug1: Applying options for *
>>>>>>>> debug2: ssh_connect: needpriv 0
>>>>>>>> debug1: Connecting to ec2-184-72-86-108.compute-1.amazonaws.com
>>>>>>>> [184.72.86.108] port 22.
>>>>>>>> debug1: Connection established.
>>>>>>>> debug2: key_type_from_name: unknown key type '-----BEGIN'
>>>>>>>> debug2: key_type_from_name: unknown key type '-----END'
>>>>>>>> debug1: identity file .ssh/id_rsa type 1
>>>>>>>> debug1: Checking blacklist file /usr/share/ssh/blacklist.RSA-2048
>>>>>>>> debug1: Checking blacklist file /etc/ssh/blacklist.RSA-2048
>>>>>>>> debug1: identity file .ssh/id_rsa-cert type -1
>>>>>>>> debug1: Remote protocol version 2.0, remote software version
>>>>>>>> OpenSSH_5.3p1 Debian-3ubuntu7
>>>>>>>> debug1: match: OpenSSH_5.3p1 Debian-3ubuntu7 pat OpenSSH*
>>>>>>>> debug1: Enabling compatibility mode for protocol 2.0
>>>>>>>> debug1: Local version string SSH-2.0-OpenSSH_5.8p1 Debian-1ubuntu3
>>>>>>>> debug2: fd 3 setting O_NONBLOCK
>>>>>>>> debug1: SSH2_MSG_KEXINIT sent
>>>>>>>> debug1: SSH2_MSG_KEXINIT received
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> ecdh-sha2-nistp256,ecdh-sha2-nistp384,ecdh-sha2-nistp521,diffie-hellman-group-exchange-sha256,diffie-hellman-group-exchange-sha1,diffie-hellman-group14-sha1,diffie-hellman-group1-sha1
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> ssh-rsa-cert-...@openssh.com,ssh-rsa-cert-...@openssh.com,ssh-rsa,ecdsa-sha2-nistp256-cert-...@openssh.com,ecdsa-sha2-nistp384-cert-...@openssh.com,ecdsa-sha2-nistp521-cert-...@openssh.com,ssh-dss-cert-...@openssh.com,ssh-dss-cert-...@openssh.com,ecdsa-sha2-nistp256,ecdsa-sha2-nistp384,ecdsa-sha2-nistp521,ssh-dss
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> aes128-ctr,aes192-ctr,aes256-ctr,arcfour256,arcfour128,aes128-cbc,3des-cbc,blowfish-cbc,cast128-cbc,aes192-cbc,aes256-cbc,arcfour,rijndael-...@lysator.liu.se
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> aes128-ctr,aes192-ctr,aes256-ctr,arcfour256,arcfour128,aes128-cbc,3des-cbc,blowfish-cbc,cast128-cbc,aes192-cbc,aes256-cbc,arcfour,rijndael-...@lysator.liu.se
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> hmac-md5,hmac-sha1,umac...@openssh.com,hmac-ripemd160,hmac-ripemd...@openssh.com,hmac-sha1-96,hmac-md5-96
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> hmac-md5,hmac-sha1,umac...@openssh.com,hmac-ripemd160,hmac-ripemd...@openssh.com,hmac-sha1-96,hmac-md5-96
>>>>>>>> debug2: kex_parse_kexinit: none,z...@openssh.com,zlib
>>>>>>>> debug2: kex_parse_kexinit: none,z...@openssh.com,zlib
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> debug2: kex_parse_kexinit: first_kex_follows 0
>>>>>>>> debug2: kex_parse_kexinit: reserved 0
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> diffie-hellman-group-exchange-sha256,diffie-hellman-group-exchange-sha1,diffie-hellman-group14-sha1,diffie-hellman-group1-sha1
>>>>>>>> debug2: kex_parse_kexinit: ssh-rsa,ssh-dss
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> aes128-ctr,aes192-ctr,aes256-ctr,arcfour256,arcfour128,aes128-cbc,3des-cbc,blowfish-cbc,cast128-cbc,aes192-cbc,aes256-cbc,arcfour,rijndael-...@lysator.liu.se
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> aes128-ctr,aes192-ctr,aes256-ctr,arcfour256,arcfour128,aes128-cbc,3des-cbc,blowfish-cbc,cast128-cbc,aes192-cbc,aes256-cbc,arcfour,rijndael-...@lysator.liu.se
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> hmac-md5,hmac-sha1,umac...@openssh.com,hmac-ripemd160,hmac-ripemd...@openssh.com,hmac-sha1-96,hmac-md5-96
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> hmac-md5,hmac-sha1,umac...@openssh.com,hmac-ripemd160,hmac-ripemd...@openssh.com,hmac-sha1-96,hmac-md5-96
>>>>>>>> debug2: kex_parse_kexinit: none,z...@openssh.com
>>>>>>>> debug2: kex_parse_kexinit: none,z...@openssh.com
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> debug2: kex_parse_kexinit:
>>>>>>>> debug2: kex_parse_kexinit: first_kex_follows 0
>>>>>>>> debug2: kex_parse_kexinit: reserved 0
>>>>>>>> debug2: mac_setup: found hmac-md5
>>>>>>>> debug1: kex: server->client aes128-ctr hmac-md5 none
>>>>>>>> debug2: mac_setup: found hmac-md5
>>>>>>>> debug1: kex: client->server aes128-ctr hmac-md5 none
>>>>>>>> debug1: SSH2_MSG_KEX_DH_GEX_REQUEST(1024<1024<8192) sent
>>>>>>>> debug1: expecting SSH2_MSG_KEX_DH_GEX_GROUP
>>>>>>>> debug2: dh_gen_key: priv key bits set: 145/256
>>>>>>>> debug2: bits set: 505/1024
>>>>>>>> debug1: SSH2_MSG_KEX_DH_GEX_INIT sent
>>>>>>>> debug1: expecting SSH2_MSG_KEX_DH_GEX_REPLY
>>>>>>>> debug1: Server host key: RSA 
>>>>>>>> 77:b6:b5:95:c6:0d:3d:82:94:91:d5:7d:70:8f:b8:1b
>>>>>>>> debug1: Host 'ec2-184-72-86-108.compute-1.amazonaws.com' is known and
>>>>>>>> matches the RSA host key.
>>>>>>>> debug1: Found key in /home/gpoort/.ssh/known_hosts:7
>>>>>>>> debug2: bits set: 495/1024
>>>>>>>> debug1: ssh_rsa_verify: signature correct
>>>>>>>> debug2: kex_derive_keys
>>>>>>>> debug2: set_newkeys: mode 1
>>>>>>>> debug1: SSH2_MSG_NEWKEYS sent
>>>>>>>> debug1: expecting SSH2_MSG_NEWKEYS
>>>>>>>> debug2: set_newkeys: mode 0
>>>>>>>> debug1: SSH2_MSG_NEWKEYS received
>>>>>>>> debug1: Roaming not allowed by server
>>>>>>>> debug1: SSH2_MSG_SERVICE_REQUEST sent
>>>>>>>> debug2: service_accept: ssh-userauth
>>>>>>>> debug1: SSH2_MSG_SERVICE_ACCEPT received
>>>>>>>> debug2: key: .ssh/id_rsa (0x7fc32b462cd0)
>>>>>>>> debug1: Authentications that can continue: publickey
>>>>>>>> debug1: Next authentication method: publickey
>>>>>>>> debug1: Offering RSA public key: .ssh/id_rsa
>>>>>>>> debug2: we sent a publickey packet, wait for reply
>>>>>>>> debug1: Authentications that can continue: publickey
>>>>>>>> debug2: we did not send a packet, disable method
>>>>>>>> debug1: No more authentication methods to try.
>>>>>>>> Permission denied (publickey).
>>>>>>>>
>>>>>>>> On Wed, Aug 24, 2011 at 9:48 PM, Andrei Savu <savu.and...@gmail.com> 
>>>>>>>> wrote:
>>>>>>>>> You should be able to login user the same user that is running Whirr.
>>>>>>>>>
>>>>>>>>> ssh -i ~/.ssh/id_rsa local-user@server
>>>>>>>>>
>>>>>>>>> Permission denied most of the time means invalid key, user pair.
>>>>>>>>>
>>>>>>>>> It should be ok to use the keys generate by default.
>>>>>>>>>
>>>>>>>>> -- Andrei Savu / andreisavu.ro
>>>>>>>>>
>>>>>>>>> On Wed, Aug 24, 2011 at 9:01 PM, Joris Poort <gpo...@gmail.com> wrote:
>>>>>>>>>> I'm just using the defaults.  But you may be onto the problem here,
>>>>>>>>>> when I try to ssh into the node using:
>>>>>>>>>> ssh -i ~/.ssh/id_rsa ubuntu@<public dns>
>>>>>>>>>>
>>>>>>>>>> I get a permission denied.
>>>>>>>>>>
>>>>>>>>>> Any idea how to fix this?  Should I create my own set of SSH key 
>>>>>>>>>> pairs?
>>>>>>>>>>
>>>>>>>>>> Thanks again,
>>>>>>>>>>
>>>>>>>>>> Joris
>>>>>>>>>>
>>>>>>>>>> On Wed, Aug 24, 2011 at 8:42 PM, Andrei Savu <savu.and...@gmail.com> 
>>>>>>>>>> wrote:
>>>>>>>>>>> Are you specifying a new set of SSH key pairs in the Whirr 
>>>>>>>>>>> properties file?
>>>>>>>>>>>
>>>>>>>>>>> If not by default Whirr will use the keys found in ~/.ssh - id_rsa 
>>>>>>>>>>> & id_rsa.pub.
>>>>>>>>>>>
>>>>>>>>>>> -- Andrei Savu / andreisavu.ro
>>>>>>>>>>>
>>>>>>>>>>> On Wed, Aug 24, 2011 at 8:02 PM, Joris Poort <gpo...@gmail.com> 
>>>>>>>>>>> wrote:
>>>>>>>>>>>> I think you're probably right its an auth issue - although I was
>>>>>>>>>>>> expecting a more direct/clear error message if the keypair wasn't
>>>>>>>>>>>> working.
>>>>>>>>>>>>
>>>>>>>>>>>> I created the AMI by taking an EBS snapshot then converting to
>>>>>>>>>>>> instance-store.  I've tried both the ebs back ami and 
>>>>>>>>>>>> instance-store
>>>>>>>>>>>> with the same results.  My understanding is that the keypair used 
>>>>>>>>>>>> to
>>>>>>>>>>>> create the AMI is generally one of the accepted keys in addition to
>>>>>>>>>>>> the key pair used to launch the instance created by jclouds.  I'm 
>>>>>>>>>>>> not
>>>>>>>>>>>> sure how to confirm this for sure - is the jclouds keypair stored
>>>>>>>>>>>> anywhere that can be used to test this?
>>>>>>>>>>>>
>>>>>>>>>>>> Thanks again for your help,
>>>>>>>>>>>>
>>>>>>>>>>>> Joris
>>>>>>>>>>>>
>>>>>>>>>>>> On Wed, Aug 24, 2011 at 7:49 PM, Andrei Savu 
>>>>>>>>>>>> <savu.and...@gmail.com> wrote:
>>>>>>>>>>>>> I'm not sure but it looks like an auth issue to me. Whirr creates 
>>>>>>>>>>>>> it's
>>>>>>>>>>>>> own key pair using the local SSH keys as specified in the 
>>>>>>>>>>>>> properties
>>>>>>>>>>>>> file.
>>>>>>>>>>>>>
>>>>>>>>>>>>> You've created the custom ami by taking an EBS snapshot? Can you 
>>>>>>>>>>>>> use
>>>>>>>>>>>>> that custom ami with a different key pair?
>>>>>>>>>>>>>
>>>>>>>>>>>>> -- Andrei Savu / andreisavu.ro
>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>> On Wed, Aug 24, 2011 at 7:31 PM, Joris Poort <gpo...@gmail.com> 
>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>> Andrei - thanks for the response!
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> I logged into the custom AMI using ssh and a key pair on my local
>>>>>>>>>>>>>> machine (I'm executing whirr via ubuntu virtual machine).  I've 
>>>>>>>>>>>>>> tried
>>>>>>>>>>>>>> both spot instances and regular instances and am getting the same
>>>>>>>>>>>>>> behavior.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Full output on terminal looks like (lines between "Starting 1 
>>>>>>>>>>>>>> node"
>>>>>>>>>>>>>> and "Dying because" are not always there):
>>>>>>>>>>>>>> Starting 1 node(s) with roles [hadoop-datanode, 
>>>>>>>>>>>>>> hadoop-tasktracker]
>>>>>>>>>>>>>> Starting 1 node(s) with roles [hadoop-namenode, 
>>>>>>>>>>>>>> hadoop-jobtracker]
>>>>>>>>>>>>>> Dying because - net.schmizz.sshj.transport.TransportException: 
>>>>>>>>>>>>>> Broken
>>>>>>>>>>>>>> transport; encountered EOF
>>>>>>>>>>>>>> Dying because - net.schmizz.sshj.transport.TransportException: 
>>>>>>>>>>>>>> Broken
>>>>>>>>>>>>>> transport; encountered EOF
>>>>>>>>>>>>>> <<kex done>> woke to: 
>>>>>>>>>>>>>> net.schmizz.sshj.transport.TransportException:
>>>>>>>>>>>>>> Broken transport; encountered EOF
>>>>>>>>>>>>>> << (root@174.129.128.120:22) error acquiring
>>>>>>>>>>>>>> SSHClient(root@174.129.128.120:22): Broken transport; 
>>>>>>>>>>>>>> encountered EOF
>>>>>>>>>>>>>> net.schmizz.sshj.transport.TransportException: Broken transport; 
>>>>>>>>>>>>>> encountered EOF
>>>>>>>>>>>>>>        at net.schmizz.sshj.transport.Reader.run(Reader.java:70)
>>>>>>>>>>>>>> Dying because - java.net.SocketTimeoutException: Read timed out
>>>>>>>>>>>>>> Dying because - java.net.SocketTimeoutException: Read timed out
>>>>>>>>>>>>>> Dying because - java.net.SocketTimeoutException: Read timed out
>>>>>>>>>>>>>> Dying because - java.net.SocketTimeoutException: Read timed out
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Last few entries on whirr.log:
>>>>>>>>>>>>>> 2011-08-24 19:20:05,428 DEBUG [jclouds.compute] 
>>>>>>>>>>>>>> (pool-3-thread-2) >>
>>>>>>>>>>>>>> requesting 1 spot instances region(us-east-1) price(0.250000)
>>>>>>>>>>>>>> spec([instanceType=m1.large, imageId=ami-d1e525b8, kernelId=null,
>>>>>>>>>>>>>> ramdiskId=null, availabilityZone=null,
>>>>>>>>>>>>>> keyName=jclouds#hadoop_custom_spot_1#us-east-1#45,
>>>>>>>>>>>>>> securityGroupIdToNames={}, blockDeviceMappings=[],
>>>>>>>>>>>>>> securityGroupIds=[],
>>>>>>>>>>>>>> securityGroupNames=[jclouds#hadoop_custom_spot_1#us-east-1],
>>>>>>>>>>>>>> monitoringEnabled=null, userData=null]) 
>>>>>>>>>>>>>> options([formParameters={}])
>>>>>>>>>>>>>> 2011-08-24 19:20:05,642 DEBUG [jclouds.compute] 
>>>>>>>>>>>>>> (pool-3-thread-4) <<
>>>>>>>>>>>>>> started instances([region=us-east-1, name=sir-4f589c11])
>>>>>>>>>>>>>> 2011-08-24 19:20:05,682 DEBUG [jclouds.compute] 
>>>>>>>>>>>>>> (pool-3-thread-2) <<
>>>>>>>>>>>>>> started instances([region=us-east-1, name=sir-59cec612])
>>>>>>>>>>>>>> 2011-08-24 19:20:05,864 DEBUG [jclouds.compute] 
>>>>>>>>>>>>>> (pool-3-thread-4) <<
>>>>>>>>>>>>>> present instances([region=us-east-1, name=sir-4f589c11])
>>>>>>>>>>>>>> 2011-08-24 19:20:05,917 DEBUG [jclouds.compute] 
>>>>>>>>>>>>>> (pool-3-thread-2) <<
>>>>>>>>>>>>>> present instances([region=us-east-1, name=sir-59cec612])
>>>>>>>>>>>>>> 2011-08-24 19:27:18,150 DEBUG [jclouds.compute] (user thread 8) 
>>>>>>>>>>>>>> >>
>>>>>>>>>>>>>> blocking on socket [address=50.17.135.8, port=22] for 600000 
>>>>>>>>>>>>>> seconds
>>>>>>>>>>>>>> 2011-08-24 19:27:21,132 DEBUG [jclouds.compute] (user thread 7) 
>>>>>>>>>>>>>> >>
>>>>>>>>>>>>>> blocking on socket [address=174.129.128.120, port=22] for 600000
>>>>>>>>>>>>>> seconds
>>>>>>>>>>>>>> 2011-08-24 19:27:24,222 DEBUG [jclouds.compute] (user thread 7) 
>>>>>>>>>>>>>> <<
>>>>>>>>>>>>>> socket [address=174.129.128.120, port=22] opened
>>>>>>>>>>>>>> 2011-08-24 19:27:32,255 DEBUG [jclouds.compute] (user thread 8) 
>>>>>>>>>>>>>> <<
>>>>>>>>>>>>>> socket [address=50.17.135.8, port=22] opened
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> After ssh onto node,  didn't find any logs in /tmp.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Thanks again for any help on this!
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Joris
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> On Wed, Aug 24, 2011 at 7:12 PM, Andrei Savu 
>>>>>>>>>>>>>> <savu.and...@gmail.com> wrote:
>>>>>>>>>>>>>>> I suspect this is an authentication issue. How do you login to 
>>>>>>>>>>>>>>> the custom AMI?
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> Also check whirr.log for more details and on the remote 
>>>>>>>>>>>>>>> machines look
>>>>>>>>>>>>>>> in /tmp for jclouds script execution logs.
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> I know from IRC that you are using spot instances. Are you 
>>>>>>>>>>>>>>> seeing the
>>>>>>>>>>>>>>> same behavior with regular ones?
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> -- Andrei Savu / andreisavu.ro
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>> On Wed, Aug 24, 2011 at 7:07 PM, Joris Poort <gpo...@gmail.com> 
>>>>>>>>>>>>>>> wrote:
>>>>>>>>>>>>>>>> Hi,
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> I'm new to whirr and I'm running custom AMI configuration 
>>>>>>>>>>>>>>>> (application
>>>>>>>>>>>>>>>> installed on working canonical image).  Executing with whirr 
>>>>>>>>>>>>>>>> 0.6.0
>>>>>>>>>>>>>>>> everything executes fine until I get the following error:
>>>>>>>>>>>>>>>> "Dying because - java.net.SocketTimeoutException: Read timed 
>>>>>>>>>>>>>>>> out"
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> The instances are running fine, I can ssh into them, but the 
>>>>>>>>>>>>>>>> whirr
>>>>>>>>>>>>>>>> code stalls and I get the above error (2x number of 
>>>>>>>>>>>>>>>> instances), no
>>>>>>>>>>>>>>>> proxy shell is created.  If I run the exact same code with 
>>>>>>>>>>>>>>>> vanilla
>>>>>>>>>>>>>>>> canonical images I don't have any issues.
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> Anyone have any ideas on things to test, debug or work around 
>>>>>>>>>>>>>>>> this?
>>>>>>>>>>>>>>>> Would really appreciate it!
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> Cheers,
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>> Joris
>>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>>
>>>>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> PGP KeyID: 2048R/EA31CFC9  subkeys.pgp.net
>>>>>
>>>>
>>>
>>
>

Reply via email to