UTHPC - HPC Cluster upgrade – Maintenance details

HPC Cluster upgrade

Completed
Scheduled for 15. July 2025 at 6:00:00 – 6:00:00

Affects

rocket.hpc.ut.ee

Under maintenance from 6:00 AM to 6:00 AM

Updates
  • Update
    22. July 2025 at 6:00:00
    Update
    22. July 2025 at 6:00:00
    Maintenance is now in progress.
  • Update
    16. July 2025 at 13:11:18
    Update
    16. July 2025 at 13:11:18

    We will be directing SSH to login1 today. The login2 internal route will still stay open until the 22nd, when we will be performing maintenance and rebooting the machine.

  • In progress
    15. July 2025 at 8:41:38
    In progress
    15. July 2025 at 8:41:38
    Maintenance is now in progress.
  • Planned
    15. July 2025 at 6:00:00
    Planned
    15. July 2025 at 6:00:00

    HPC cluster Rocket updates are scheduled for July 2025 that will improve the cluster's performance and capabilities.

    1)      Login Node Updates

    We'll be performing system updates on both login nodes this month:

    • Login1: July 15th

    • Login2: July 22nd

    To minimize disruption, we'll close new SSH connections one week before each update, allowing existing connections to naturally expire. One of the login nodes will remain available at all times, so you won't experience any service downtime.

    2)      Slurm Update

    On July 22nd, starting at 15:00, we'll be upgrading Slurm from version 23.02 to 23.11. Your running jobs won't be affected, and you'll be able to submit new jobs during the update. However, commands like sacct, sacctmgr, and related tools will be unavailable during the update. The process should take about two hours but may run longer. We

    After July 22nd, the compute nodes will be updated in a rolling fashion. This means some nodes will be temporarily drained until all updates are complete, which may result in longer queue times depending on cluster usage.

  • Completed
    15. July 2025 at 6:00:00
    Completed
    15. July 2025 at 6:00:00
    Maintenance has completed successfully