[slurm-users] Running Containerized Slurmctld and Slurmdb in Production?

Howdy,

 

Just wondering if any sites are running containerized Slurmctld and Slurmdbd in production?

 

We are in the process of planning migrating from a single host running slurmctld, slurmdbd, and MySQL (and other HPC services) to separate OpenStack VMs. Our site averages less than 1000’s running / pending jobs at any given time. Like
many HPC sites, our jobs are a mix of long running, large arrays, very short…

 

I ran across this Github project “Slurm Docker Cluster”
github.com/giovtorres/slurm-docker-cluster and got me thinking that this method might be great for simpler upgrades, ease of reproducing the cluster in development, etc…

 

How about it, anyone running containerized Slurm server processes in production?

 

Thanks, Mike

Read more here: Source link