Dear All,

after bugging this list long enough with questions about how to get Julia 
running in parallel on a Torque/PBS managed cluster I thought I'd share my 
experience with the list. I realise that by julia standards this is a 
rather modest achievement, but I'd been happy to come across something like 
this post a while ago. :-)

So, there's nothing special about the cluster being PBS managed (rather 
than SGE or whatever), I just found that each system is as idiosyncratic as 
the sysadmin person who set it up (starting from the format of nodenames to 
scheduler options to how environment variables get forwarded into a node, 
etc etc), so it always takes a fair amount of hacking to get something 
running. It's pretty low quality hacking I would say, but it's painful on a 
cluster. You basically need to adapt the functions in iridis_launcher.jl in 
the below repo to your system. Very few of the issues actually had anything 
to do with Julia itself, so I tried to explain as much about the 
environment as possible. Again, just sharing this in the hope someone out 
there is trying to achieve something similar may find this useful:

https://github.com/floswald/parallelTest/tree/master/julia/iridis


Reply via email to