Skip to main content
Have a request for an upcoming news/science story? Submit a Request

Unscheduled Anvil outage - Partial

  • Outages
  • Anvil

Link to update at February 27, 2024 5:05pm EST UPDATE:

Engineers have isolated the underlying issue.

Resolving the outage.

Link to original posting ORIGINAL:

The Anvil cluster began experiencing issues with Slurm Scheduling this past week. Engineers are currently diagnosing the root cause and are working to identify a fix.

Scheduling is still enabled at this time.

You may experience periodic SLURM outage where command will be unable to connect to the slurm controller. This can cause jobs to take longer than normal, and in some instanes fail.

In addition, Open Ondemand relies on Slurm to run applications. When these issues with slurm occur, the menu in OOD may appear empty or non functional.

We will provide another update by 5PM EST today

Originally posted: