Alex and Riccardo,

I've updated the branch on bobd00 containing the PR for adding azure. See 
https://github.com/bobd00/elasticluster/blob/azure_support_part2/README-AZURE.rst

Still testing the Slurm issue noted below, but I wanted to get this out 
there for testing in the meantime. 

Best,
Dave

On Friday, September 23, 2016 at 10:54:57 AM UTC-6, Dave Steinkraus wrote:
>
> Hi Riccardo,
>
> Thanks for your suggestions. It didn't make sense to me either; I've never 
> had that problem before. I was working around it yesterday by including 
> this in the [setup/ansible-slurm] section of the config file:
>
> ansible_private_key_file=/home/dave/.ssh/myazcert.pem
>
> --and when I did so, I was able to confirm in the debug output that 
> Ansible was seeing it, and the ssh command was using it. Without that line, 
> there was no "-o IdentityFile" in the ssh command, which failed.
>
> Long story short, I think it was an artifact that on that particular 
> machine, I had cloned the Ansible repo and was running Ansible branch 
> stable-2.0.0.1 from source. (The reason I did that was so that I could run 
> it in the debugger to try to understand yet another weird problem, which 
> was that Ansible wasn't interpreting expressions like 
> '{{ansible_os_family}}' in playbooks - it was treating them as literal 
> text.)
>
> I have had problems with any version of Ansible other than the one that 
> Elasticluster installs, so I'm going to avoid that. This morning I'm doing 
> a clean run on a new linux vm, and I'm not seeing these issues. Everything 
> runs fine, far into the Ansible provisioning, when I hit this:
>
> TASK [slurm-master : Replace systemd unit file for SLURM services] 
> *************
> task path: 
> /home/dave/.virtualenvs/elasticluster/elasticluster/elasticluster/share/playbooks/roles/slurm-master/tasks/install-slurmdbd.yml:27
> fatal: [frontend001]: FAILED! => {"failed": true, "msg": "the file_name 
> '/home/dave/.virtualenvs/elasticluster/elasticluster/elasticluster/share/playbooks/roles/usr/lib/systemd/system/slurm-llnl-slurmdbd.service'
>  does not exist, or is not readable"}
>
>
> So that's my next challenge. (both the file, and even the path it's 
> looking for don't exist. What does exist is this: 
> '/home/dave/.virtualenvs/elasticluster/elasticluster/elasticluster/share/playbooks/roles/slurm-master/files/usr/lib/systemd/system/slurmdbd.service'.)
>
> Thanks,
> Dave
>
> On Friday, September 23, 2016 at 7:22:39 AM UTC-6, Riccardo Murri wrote:
>>
>> Hi Dave, 
>>
>> > The caveat: I can start and stop clusters, but Ansible provisioning is 
>> not 
>> > working for me at the moment. I think this is minor -- Ansible is not 
>> able 
>> > to make an ssh connection to the nodes, because it's not trying the 
>> right 
>> > private key. If I extract the ssh command and add "-i mykey", the 
>> command works. 
>>
>> I'm a bit surprised here -- the code in ElastiCluster that invokes 
>> Ansible is pretty simple: as long as the correct file name is in 
>> `cluster.user_key_private` there should be no possibility of error... 
>>
>> Can you please try to do the following with ElastiCluster 1.3.dev: 
>>
>> - Create a cluster but prevent Ansible from running:: 
>>
>>     elasticluster start --no-setup mycluster 
>>
>> - Run Ansible setup with maximum debug:: 
>>
>>         elasticluster -vv setup mycluster -- -vvv 
>>
>>   You can stop it with Ctrl+C as soon as the "TASK [setup]" part is 
>>   done. 
>>
>> The output should show: 
>>
>> (a) All environment variables that ElastiCluster sets for running 
>>     `ansible-playbook` (`ANSIBLE_PRIVATE_KEY_FILE` is the relevant one) 
>>
>> (b) The command-line options that Ansible passes to the slave SSH 
>>     (here `-o IdentityFile=...` is important) 
>>
>>
>> > Meanwhile, I'm starting work on updating the PR on the bobd00 fork so 
>> that 
>> > the changes will be ready to merge. Or maybe it would be faster to just 
>> > make a new fork and create the PR there. 
>>
>> Whichever is best/faster for you; I'm not strict on process. 
>>
>> Thanks again, 
>> Riccardo 
>>
>> -- 
>> Riccardo Murri, Schwerzenbacherstrasse 2, CH-8606 Nänikon, Switzerland 
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"elasticluster" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to