[slurm-dev] Re: Regards Postgres Plugin for SLURM

Rémi Palancher Sat, 09 Apr 2016 01:46:11 -0700


Hi there,

Sorry for picking up this old thread, and I'd like to share our ownexperience FWIW.

We agree too that PostgreSQL is better for handling large TB of jobsdata nowadays. But instead of writing a new specific accounting storageplugin (just quick overview of mysql plugin code is enough to beconvinced that it would be painful), we have another approach.

We consider that slurm database is just a temporary applicationspecific storage backend only used for accounting purpose, and just livewith it. Then, we enable slurmdbd automatic purging (to avoid thedatabase growing forever). With MariaDB, it goes pretty well so far.

But since we do care about jobs metadata over the lifetime of oursupercomputers, we have developed a software that crawls into slurmdatabase to fill up incrementaly a PostgreSQL database:


http://edf-hpc.github.io/hpcstats/ [*]

This software is also able to get data from monitoring software, LDAPdirectories, and so on. This way, we have all our precious data inPostgreSQL for reporting and statistics purposes. This has the followingadvantage:

- It's a separate DB, then it does not disturb slurmdbd when runningcomplex queries ;- It's a mashup of various data sources, so we can extract metrics withadvanced correlations.- It's generic and not linked to any technology, so we get all theflexibility to change whevener.


We are happy with this approach so far :)

[*] The software is open-sourced but it may be hard to make it work inyour IS without tough integration effort. It is designed as a genericframework with plugins but the current plugins are quite specifics toour needs. Feel free to contact me if you feel brave and would like anyhelp though :)


Best,
Rémi

[slurm-dev] Re: Regards Postgres Plugin for SLURM

Reply via email to