Hello I am returning to elasticluster after about a year and found my configuration fails. I think my is pretty simple. I have asked elasticluster to create a frontend and a compute node using ubuntu 14.04 lts. I want grid engine installed. Here is my test config.
``` [cloud/amazon-us-west-1] provider=ec2_boto ec2_url=https://ec2.us-west-1.amazonaws.com ec2_access_key=...omit ... ec2_secret_key=....omit ... ec2_region=us-west-1 [login/seeker] image_user=ubuntu image_user_sudo=root image_sudo=True user_key_name=elasticluster user_key_private=~/.ssh/id_rsa user_key_public=~/.ssh/id_rsa.pub [setup/ansible-gridengine] provider=ansible frontend_groups=gridengine_master compute_groups=gridengine_worker [cluster/accuragen] cloud=amazon-us-west-1 login=seeker setup=ansible-gridengine ssh_to=frontend security_group=default image_id=ami-debaf8be flavor=m4.large frontend_nodes=1 compute_nodes=1 ``` Unfortunately, it fails. ``` TASK [gridengine-master : Set GridEngine master variables (Debian/Ubuntu)] ***** ok: [frontend001] TASK [gridengine-master : Install GridEngine master packages] ****************** changed: [frontend001] => (item=[u'gridengine-client', u'gridengine-common', u'gridengine-master', u'gridengine-qmon']) TASK [gridengine-master : deploy qmaster installer config (RHEL/CentOS)] ******* skipping: [frontend001] TASK [gridengine-master : qmaster installation (RHEL/CentOS)] ****************** skipping: [frontend001] TASK [gridengine-master : Load GridEngine settings at login (RHEL/CentOS)] ***** skipping: [frontend001] => (item=csh) skipping: [frontend001] => (item=sh) TASK [gridengine-master : Create GridEngine service file (RHEL/CentOS 6.x)] **** skipping: [frontend001] TASK [gridengine-master : Create GridEngine service file (RHEL/CentOS 7.x)] **** skipping: [frontend001] TASK [gridengine-master : Reload systemd service units (RHEL/CentOS 7.x)] ****** skipping: [frontend001] TASK [gridengine-master : upload configuration file for host] ****************** changed: [frontend001] => (item=compute001) TASK [gridengine-master : add admin hosts] ************************************* failed: [frontend001] (item=frontend001) => {"changed": false, "cmd": "bash -lc 'qconf -ah \"frontend001\"'", "delta": "0:00:00.060668", "end": "2017-05-24 21:12:41.315334", "failed": true, "item": "frontend001", "rc": 1, "start": "2017-05-24 21:12:41.254666", "stderr": "stdin: is not a tty\nerror: unable to read qmaster name: qmaster hostname in \"/var/lib/gridengine/default/common/act_qmaster\" has zero length", "stdout": "", "stdout_lines": [], "warnings": []} failed: [frontend001] (item=compute001) => {"changed": false, "cmd": "bash -lc 'qconf -ah \"compute001\"'", "delta": "0:00:00.007756", "end": "2017-05-24 21:12:41.434374", "failed": true, "item": "compute001", "rc": 1, "start": "2017-05-24 21:12:41.426618", "stderr": "stdin: is not a tty\nerror: unable to read qmaster name: qmaster hostname in \"/var/lib/gridengine/default/common/act_qmaster\" has zero length", "stdout": "", "stdout_lines": [], "warnings": []} to retry, use: --limit @/home/jkern/src/elasticluster/elasticluster/share/playbooks/site.retry PLAY RECAP ********************************************************************* compute001 : ok=40 changed=27 unreachable=0 failed=0 frontend001 : ok=63 changed=36 unreachable=0 failed=1 2017-05-24 14:12:41 ubuntu gc3.elasticluster[87990] ERROR Command `ansible-playbook /home/jkern/src/elasticluster/elasticluster/share/playbooks/site.yml --inventory=/home/jkern/.elasticluster/storage/meadow.inventory --become --become-user=root` failed with exit code 2. 2017-05-24 14:12:41 ubuntu gc3.elasticluster[87990] ERROR Check the output lines above for additional information on this error. 2017-05-24 14:12:41 ubuntu gc3.elasticluster[87990] ERROR The cluster has likely *not* been configured correctly. You may need to re-run `elasticluster setup` or fix the playbooks. 2017-05-24 14:12:41 ubuntu gc3.elasticluster[87990] WARNING Cluster `meadow` not yet configured. Please, re-run `elasticluster setup meadow` and/or check your configuration WARNING: YOUR CLUSTER IS NOT READY YET! ``` act_master is indeed empty. When I install manually with apt-get, there is no explicit step to populate this file. Any suggestions on how to debug it? Below are the dependencies based on pip freeze. Thanks for your time and consideration. ``` adal==0.4.5 ansible==2.3.0.0 apache-libcloud==2.0.0 appdirs==1.4.3 asn1crypto==0.22.0 azure==2.0.0 azure-batch==3.0.0 azure-common==1.1.6 azure-datalake-store==0.0.9 azure-graphrbac==0.30.0 azure-keyvault==0.3.3 azure-mgmt==1.0.0 azure-mgmt-authorization==0.30.0 azure-mgmt-batch==4.0.0 azure-mgmt-cdn==0.30.3 azure-mgmt-cognitiveservices==1.0.0 azure-mgmt-compute==1.0.0 azure-mgmt-containerregistry==0.2.1 azure-mgmt-datalake-analytics==0.1.4 azure-mgmt-datalake-nspkg==2.0.0 azure-mgmt-datalake-store==0.1.4 azure-mgmt-devtestlabs==2.0.0 azure-mgmt-dns==1.0.1 azure-mgmt-documentdb==0.1.3 azure-mgmt-iothub==0.2.2 azure-mgmt-keyvault==0.31.0 azure-mgmt-logic==2.1.0 azure-mgmt-monitor==0.2.1 azure-mgmt-network==1.0.0 azure-mgmt-nspkg==2.0.0 azure-mgmt-rdbms==0.1.0 azure-mgmt-redis==4.1.0 azure-mgmt-resource==1.1.0 azure-mgmt-scheduler==1.1.2 azure-mgmt-sql==0.5.1 azure-mgmt-storage==1.0.0 azure-mgmt-trafficmanager==0.30.0 azure-mgmt-web==0.32.0 azure-nspkg==2.0.0 azure-servicebus==0.21.1 azure-servicefabric==5.6.130 azure-servicemanagement-legacy==0.20.6 azure-storage==0.34.2 Babel==2.3.4 boto==2.46.1 certifi==2017.4.17 cffi==1.10.0 click==6.7 cliff==2.7.0 cmd2==0.7.2 coloredlogs==7.0 cryptography==1.8.1 debtcollector==1.13.0 -e git+https://github.com/gc3-uzh-ch/elasticluster.git@56f4cc209065457d92dd66ac065328af02b2d95f#egg=elasticluster enum34==1.1.6 funcsigs==1.0.2 functools32==3.2.3.post2 futures==3.1.1 google-api-python-client==1.6.2 google-compute-engine==2.3.7 httplib2==0.10.3 humanfriendly==3.2 idna==2.5 ipaddress==1.0.18 iso8601==0.1.11 isodate==0.5.4 Jinja2==2.9.6 jsonpatch==1.15 jsonpointer==1.10 jsonschema==2.6.0 keyring==10.3.2 keystoneauth1==2.20.0 MarkupSafe==1.0 monotonic==1.3 msgpack-python==0.4.8 msrest==0.4.8 msrestazure==0.4.7 netaddr==0.7.19 netifaces==0.10.5 oauth2client==4.1.0 oauthlib==2.0.2 os-client-config==1.27.0 osc-lib==1.6.0 oslo.config==4.1.1 oslo.i18n==3.15.3 oslo.serialization==2.18.0 oslo.utils==3.25.1 packaging==16.8 paramiko==2.1.2 pathlib2==2.2.1 pbr==3.0.1 positional==1.1.1 prettytable==0.7.2 pyasn1==0.2.3 pyasn1-modules==0.0.8 pyCLI==2.0.3 pycparser==2.17 pycrypto==2.6.1 PyJWT==1.5.0 pyparsing==2.2.0 python-cinderclient==2.0.1 python-dateutil==2.6.0 python-gflags==3.1.1 python-glanceclient==2.6.0 python-keystoneclient==3.10.0 python-neutronclient==6.2.0 python-novaclient==8.0.0 pytz==2017.2 PyYAML==3.12 requests==2.14.2 requests-oauthlib==0.8.0 requestsexceptions==1.2.0 rfc3986==1.0.0 rsa==3.4.2 scandir==1.5 schema==0.6.6 SecretStorage==2.3.1 simplejson==3.10.0 six==1.10.0 stevedore==1.21.0 unicodecsv==0.14.1 uritemplate==3.0.0 warlock==1.2.0 wrapt==1.10.10 ``` -- You received this message because you are subscribed to the Google Groups "elasticluster" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
