Outages and Maintenance
-
Anvil and Negishi cluster maintenance
The Anvil and Negishi clusters will be unavailable beginning Tuesday, March 12th 2024 at 8:00am for a scheduled maintenance. The clusters will return to full production by Tuesday, March 12th at 5pm. During this time, Anvil and Negishi will have rack...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, March 6, 2024 from 8:00am - 12:00pm EST for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any transfer...
-
The Negishi cluster began experiencing issues around 2PM, and engineers isolated the problem with Negishi. In order to resolve the issue, the scheduler had to be restarted. If you had jobs running on Negishi, please check them to ensure they are stil...
-
Unscheduled Anvil outage - Partial
The Anvil cluster began experiencing issues with Slurm Scheduling this past week. Engineers are currently diagnosing the root cause and are working to identify a fix. Scheduling is still enabled at this time. You may experience periodic SLURM outage...
-
The Gilbreth cluster will be unavailable beginning Tuesday, February 27th, 2024 at 8:00am for a scheduled maintenance. The cluster will return to full production by Wednesday, February 28th at 5:00pm. During this time, Gilbreth will have minor operat...
-
The Geddes cluster began experiencing networking issues around 9:00am EST. Engineers have identified the issue and are working to bring Geddes back to service. Deployments may be in a degraded state during this time. We will provide an update by 3:00...
-
Anvil Scheduled Maintenance Wednesday, February 7th, 2024
The Anvil system will be unavailable Wednesday, February 7th, 2024 from 8:00am - 6:00pm EDT for scheduled maintenance. Any Slurm jobs which request a walltime which would take them past Wednesday, February 7th, 2024 at 8:00am EDT will not start and w...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, February 7, 2024 from 8:00am - 12:00pm EST for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any trans...
-
Weber Cluster Emergency Maintenance
The weber cluster will be unavailable beginning Tuesday, February 6th, 2024 at 1:30PM EST. The cluster will return to full production by Tuesday, February 6th, 2024 at 5:00PM. During this time a failed disk will be replaced on Weber. Any jobs which r...
-
Scratch on Gilbreth will undergo maintenance on Wednesday January 24th from 8:00 AM EST untul 5:00 PM EST. During this time, scratch will be unavailable on Gilbreth. Job scheduling on Gilbreth will be paused while storage engineers perform maintenanc...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, January 3, 2024 from 8:00am - 12:00pm EST for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any transf...
-
The Negishi cluster will be unavailable beginning Wednesday, December 20th, 2023 at 8:00am for a scheduled maintenance. The cluster will return to full production by Wednesday, December 20th, 2023 at 5:00pm. During this time, Negishi will have minor...
-
The Gilbreth cluster Globus software will be upgraded Thursday, December 7th, 2023 from 8:00am - 5:00pm EDT for scheduled maintenance. The cluster will return to full production by Thursday, December 7th, 2023 at 5:00pm EDT. During this time, Gilbret...
-
The Bell cluster will be unavailable beginning Wednesday, December 6th, 2023 at 8:00am for a scheduled maintenance. The cluster will return to full production by Wednesday, December 6th, 2023 at 5:00pm. During this time, Bell will have a configuratio...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, December 6, 2023 from 8:00am - 11:30am EST for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any trans...
-
Gilbreth is experiencing scheduling issues and jobs have been paused while RCAC works to resolve this issue. Running jobs have also been impacted, so you will need to resubmit your job if it was running when the job scheduling resumes. We will provid...
-
The Weber cluster will be unavailable beginning Tuesday, November 7th, 2023 at 8:00am for a scheduled maintenance. The cluster will return to full production by Wednesday, November 8th, 2023 at 5:00pm. During this time, Weber will be moved. No servic...
-
Fortress began experiencing issues with its tape library around 5:00PM. Engineers are currently diagnosing the issue and are working to identify a fix. Vendor support has been contacted and diagnostics have been uploaded. During this time tape access...
-
Bell Cluster partial outage (scratch fast storage tier)
The Bell cluster scratch filesystem (fast tier) is experiencing issues, and that portion of sccratch has been turned off. Scratch is still available, but will be running at a slower speed until the situation is resolved. RCAC engineers are working wi...
-
The Bell cluster will be unavailable beginning Tuesday, October 31st, 2023 at 8:00am for a scheduled maintenance. The cluster will return to full production by Tuesday, October 31st, 2023 at 5:00pm. During this time, Bell will have minor operating sy...