Tablet und Bleistift

Using SLURM on MedBioNode for parallel comput

Slurm is an open-source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters.

It provides three key functions:

  1. allocating exclusive and/or non-exclusive access to resources (nodes) to users for some duration of time so they can perform work,
  2. providing a framework for starting, executing, and monitoring work (typically a parallel job such as MPI) on a set of allocated nodes, and
  3. arbitrating contention for resources by managing a queue of pending jobs.

This course will go over the basic principles of using the Slurm Workload Manager on the MedBioNode cluster. There will be some short hands-on exercises, so attendees should know how to work in Linux and log on the cluster as a prerequisites.

For registration please fill in the registration form.