Revisions to Slurm Configuration v2.1.0 on Caviness
This document summarizes alterations to the Slurm job scheduler configuration on the Caviness cluster.
Issues
Caviness Expansion 2.1
A new rack (r04) has been added to the Caviness cluster. Nodes in the new rack must be integrated into the Slurm configuration for job scheduling.
Implementation
- The Slurm
nodes.conffile will be modified to include r04. - The Slurm
partitions.conffile will be modified to:- Adjust node assignments for existing workgroups who purchased node(s) in r04
- Add new workgroups who purchased node(s) in r04
- The Slurm
topology.conffile will be modified to include OPA switches/HFIs in r04- The
/opt/shared/slurm/add-ons/bin/opa2slurmutility (written by IT-RCI staff) will be used to automatically map the OPA network
Impact
No downtime is expected to be required.
Timeline
| Date | Time | Goal/Description |
|---|---|---|
| 2020-10-01 | Authoring of this document | |
| 2020-10-02 | 14:30 | Implementation |