Hi Angel,
On 5/1/23 11:28, Angel de Vicente wrote:
Ole Holm Nielsen <ole.h.niel...@fysik.dtu.dk> writes:
If I read Brian's comments correctly, he's saying that Slurm already has a
well-tested and documented solution for multi-cluster sites: Federated clusters.
Thanks Ole. Don't get me wrong, I have nothing against using Federated
clusters, and I guess I will probably end up going for it, but my
question keeps just the same (as far as I understand nothing changes in
that respect with multi-cluster or federated setting?): whether I should
just run one slurmdbd daemon or several.
As Brian wrote:
On a technical note: slurm keeps the detailed accounting data for each cluster in separate TABLES within a single database.
In the Federation page https://slurm.schedmd.com/federation.html it is
implicitly assumed that the sacctmgr command talks only to a single
slurmdbd instance. It is not, however, explicitly stated as an answer to
your question.
You can see in another presentation that there is only a *single* slurmdbd
in a federated multi-cluster scenario:
https://slurm.schedmd.com/SLUG18/slurm_overview.pdf
Look at slide 28 "Typical Enterprise Architecture".
/Ole