HPC cluster Rocket updates are scheduled for July 2025 that will improve the cluster's performance and capabilities.
1) Login Node Updates
We'll be performing system updates on both login nodes this month:
Login1: July 15th
Login2: July 22nd
To minimize disruption, we'll close new SSH connections one week before each update, allowing existing connections to naturally expire. One of the login nodes will remain available at all times, so you won't experience any service downtime.
2) Slurm Update
On July 22nd, starting at 15:00, we'll be upgrading Slurm from version 23.02 to 23.11. Your running jobs won't be affected, and you'll be able to submit new jobs during the update. However, commands like sacct, sacctmgr, and related tools will be unavailable during the update. The process should take about two hours but may run longer. We
After July 22nd, the compute nodes will be updated in a rolling fashion. This means some nodes will be temporarily drained until all updates are complete, which may result in longer queue times depending on cluster usage.