Hi: I've been tinkering and I have what I think is an accurate setup of Chapel on an EMR (elastic mapreduce) cluster but it seems to not execute right, so I think it's either SSH or GASNet settings. I am using agent forwarding to SSH into my cluster and can therefore SSH into my slave nodes (well, node).
Once I have Chapel built and I've compiled an example, I still have problems with execution on "multiple locales" (again, in my case, 1 remote server). Does Chapel assume that CHPL_HOME is a shared file system accessible throughout the cluster? [hadoop@ip-172-31-19-157 examples]$ export GASNET_SSH_SERVERS=$(cat ~/slaveHosts) [hadoop@ip-172-31-19-157 examples]$ export SSH_OPTIONS="-x -v" [hadoop@ip-172-31-19-157 examples]$ ./hello -nl 1 -v /home/hadoop/chapel-1.8.0/third-party/gasnet/install/linux64-gnu/seg-everything/nodbg/bin/amudprun -np 1 ./hello_real -nl 1 -v OpenSSH_6.2p2, OpenSSL 1.0.0-fips 29 Mar 2010 debug1: Reading configuration data /etc/ssh/ssh_config debug1: /etc/ssh/ssh_config line 50: Applying options for * debug1: Connecting to 172.31.20.123 [172.31.20.123] port 22. debug1: Connection established. debug1: identity file /home/hadoop/.ssh/id_rsa type -1 debug1: identity file /home/hadoop/.ssh/id_rsa-cert type -1 debug1: identity file /home/hadoop/.ssh/id_dsa type -1 debug1: identity file /home/hadoop/.ssh/id_dsa-cert type -1 debug1: Enabling compatibility mode for protocol 2.0 debug1: Local version string SSH-2.0-OpenSSH_6.2 debug1: Remote protocol version 2.0, remote software version OpenSSH_6.2 debug1: match: OpenSSH_6.2 pat OpenSSH* debug1: SSH2_MSG_KEXINIT sent debug1: SSH2_MSG_KEXINIT received debug1: kex: server->client aes128-ctr [email protected] none debug1: kex: client->server aes128-ctr [email protected] none debug1: SSH2_MSG_KEX_DH_GEX_REQUEST(1024<1024<8192) sent debug1: expecting SSH2_MSG_KEX_DH_GEX_GROUP debug1: SSH2_MSG_KEX_DH_GEX_INIT sent debug1: expecting SSH2_MSG_KEX_DH_GEX_REPLY debug1: Server host key: RSA 2b:5e:6c:27:b2:77:65:38:f9:a1:79:c5:87:0e:be:c7 debug1: Host '172.31.20.123' is known and matches the RSA host key. debug1: Found key in /home/hadoop/.ssh/known_hosts:1 debug1: ssh_rsa_verify: signature correct debug1: SSH2_MSG_NEWKEYS sent debug1: expecting SSH2_MSG_NEWKEYS debug1: SSH2_MSG_NEWKEYS received debug1: Roaming not allowed by server debug1: SSH2_MSG_SERVICE_REQUEST sent debug1: SSH2_MSG_SERVICE_ACCEPT received debug1: Authentications that can continue: publickey debug1: Next authentication method: publickey debug1: Offering RSA public key: /Users/foscraig/.keys/craigf.pem debug1: Server accepts key: pkalg ssh-rsa blen 279 debug1: Authentication succeeded (publickey). Authenticated to 172.31.20.123 ([172.31.20.123]:22). debug1: channel 0: new [client-session] debug1: Requesting [email protected] debug1: forking to background debug1: Entering interactive session. debug1: Sending environment. debug1: Sending env LANG = en_US.UTF-8 debug1: Sending command: cd '/home/hadoop/chapel-1.8.0/examples' ; './hello_real' '__AMUDP_SLAVE_PROCESS__' '172,31,19,157,158,33' '-nl' '1' '-v' bash: line 0: cd: /home/hadoop/chapel-1.8.0/examples: No such file or directory bash: ./hello_real: No such file or directory debug1: client_input_channel_req: channel 0 rtype exit-status reply 0 debug1: channel 0: free: client-session, nchannels 1 Transferred: sent 3244, received 3072 bytes, in 0.0 seconds Bytes per second: sent 81275.0, received 76965.7 debug1: Exit status 127 [kicked out of my cluster after a time] Just for FYI, I have GASNET variables defined. [hadoop@ip-172-31-19-157 chapel-1.8.0]$ printenv |grep "GASNET*" GASNET_SSH_SERVERS=172.31.20.123 GASNET_SPAWNFN=S [hadoop@ip-172-31-19-157 doc]$ printenv | grep "CHPL*" CHPL_COMM=gasnet CHPL_HOST_PLATFORM=linux64 CHPL_HOME=/home/hadoop/chapel-1.8.0 Any help is appreciated. Thanks! ------------------------------------------------------------------------------ _______________________________________________ Chapel-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/chapel-users
