Dear All, after bugging this list long enough with questions about how to get Julia running in parallel on a Torque/PBS managed cluster I thought I'd share my experience with the list. I realise that by julia standards this is a rather modest achievement, but I'd been happy to come across something like this post a while ago. :-)
So, there's nothing special about the cluster being PBS managed (rather than SGE or whatever), I just found that each system is as idiosyncratic as the sysadmin person who set it up (starting from the format of nodenames to scheduler options to how environment variables get forwarded into a node, etc etc), so it always takes a fair amount of hacking to get something running. It's pretty low quality hacking I would say, but it's painful on a cluster. You basically need to adapt the functions in iridis_launcher.jl in the below repo to your system. Very few of the issues actually had anything to do with Julia itself, so I tried to explain as much about the environment as possible. Again, just sharing this in the hope someone out there is trying to achieve something similar may find this useful: https://github.com/floswald/parallelTest/tree/master/julia/iridis