Outages and Maintenance
-
RCAC Whole-Floor Downtime and Power Work
The majority of the Research Computing computational resources will be unavailable July 30, 2021 7:00am - August 1, 2021 12:00pm EDT for a whole-floor downtime due to electrical power work in MATH and POD data centers. Along with a required preven...
-
The Brown cluster began experiencing issues with cooling around 9:00pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will provide an update...
-
Scheduling Paused on Multiple Clusters
At about 4:00 pm today (Wednesday, 21 July, 2021) System Engineers found an issue with the schedulers on the Bell, Brown, Gilbreth, Halstead, and Scholar clusters. Job scheduling has been paused while this is being investigated. Symptoms of this pro...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, July 7, 2021 from 8:00am - 12:00pm EDT for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates, as well as se...
-
The Gilbreth cluster began experiencing issues with its scratch file system around 5:00pm EDT on Thursday, July 1st, 2021. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue...
-
The Bell cluster began experiencing issues with its home and scratch directories filesystem around 12:40pm EDT. Problems manifest as hanging new logins and unresponsive established sessions. Engineers are currently diagnosing the issue and are workin...
-
Intermittent Access Failures on Data Depot
As of Thursday, June 17th, 2021 at 11:00am EDT, users of community clusters may experience intermittent "permission denied" errors while trying to access their files on Data Depot. Errors may come and go, and may appear on both login and c...
-
The Gilbreth cluster will be unavailable Tuesday, June 8, 2021 at 8:00am EDT for scheduled maintenance. The cluster will return to full production by %enddatetime%. During this time, Gilbreth will have the operating system patched and new hardware ad...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, June 2, 2021 at 8:00am EDT for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any transfers which reque...
-
[Cancelled] Scholar Cluster Maintenance
The Scholar cluster will be unavailable Friday, May 14, 2021 at 8:00am EDT for scheduled maintenance. The cluster will return to full production by %enddatetime%. During this time, Scholar will have the operating system patched and several software u...
-
Data Depot Hardware Replacement and Migration
On Tuesday, May 11, 2021 at 5:00pm EDT, the Data Depot storage service will be unavailable while it will be transitioned to new hardware. All Depot access methods (SCP/SFTP, Windows network drives, Globus, NFS exports, direct mounts on Research Compu...
-
Whole-Floor Cluster Maintenance
The majority of Research Computing computational resources (Bell, Brown, Gilbreth, Halstead, Hammer, Scholar, WCERES, Workbench, and WSC Hadoop clusters) will be unavailable Tuesday, May 11, 2021 at 5:00pm EDT for Data Depot migration work. The clust...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, May 5, 2021 from 8:30am - 12:00pm EDT for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any transfers...
-
The Fortress tape archive began experiencing load-induced issues around 1:00pm EDT. Problems manifest as various errors and timeouts while trying to access Fortress or transfer data. Engineers are currently diagnosing the issue and are working to ide...
-
Unscheduled outage on multiple clusters
Due to problems with cooling system in the MATH datacenter, the CMS, Bell, Brown, Gilbreth, Halstead, WCERES, and WSC Hadoop clusters began experiencing issues around 4:00pm EDT. Multiple front-end, compute and storage services are affected. Engineer...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, April 7, 2021 from 8:30am - 12:00pm EDT for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any transfer...
-
ANSYS Fluent software unavailable on Bell
We have received multiple reports about ANSYS Fluent software on Bell cluster being unavailable. We are currently diagnosing the issue and are working to identify a fix. We will provide an update by 6pm tonight.
-
The Weber cluster will be taken down for regular maintenance and upgrades beginning on Tuesday, March 16th, 2021 at 8:00am EDT. During this time, Weber will have operating system updates applied. Users will be unable to log in or use Weber for inter...
-
The Workbench cluster began experiencing issues with its network uplink around 6:30pm EST. Engineers are currently diagnosing the issue and are working to identify a fix. We will provide an update by 10 pm.
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, March 3, 2021 from 8:30am - 12:00pm EST for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any transfer...