Outages

Sort By:

Featured Newest to Oldest Oldest to Newest Recently Published

Unscheduled Brown scratch outage
- May 5, 2020 4:30pm - 8:50pm EDT Last updated: May 5, 2020 8:50pm EDT
The Brown cluster began experiencing issues with its scratch filesystem around 4:30pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will pro...
Unscheduled Data Depot outage
- May 4, 2020 10:30am - 2:50pm EDT Last updated: May 4, 2020 2:50pm EDT
The Data Depot storage system began experiencing issues with No space left on device error messages around 10:30am EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling on community clusters has been paus...
Unscheduled Weber outage
- May 4, 2020 10:00am - 12:00pm EDT Last updated: May 4, 2020 12:21pm EDT
The Weber cluster began experiencing issues around 10:00am EDT. Engineers are currently diagnosing the issue and are working to identify a fix. We will provide an update by 2:00pm.
ECN license server outage
- April 24, 2020 9:30am - 2:30pm EDT
Engineering Computing Network (ECN) has reported an outage on the software license servers for ITaP Research Computing systems that are hosted by ECN. ITaP Research Computing cluster job scheduling is not affected by the outage, but licenses for soft...
Unscheduled Data Depot windows network drive outage
- April 21, 2020 9:00am - April 22, 2020 12:00pm EDT Last updated: April 22, 2020 11:56am EDT
Since Friday, April 17, the Research Data Depot filesystem has been unavailable on community cluster systems, but remained available through other means of access (such as Windows Network Drive). Around 9:00am EDT on Tuesday, April 21st, 2020, the D...
Running Jobs on Community Clusters While Data Depot is Unavailable
- April 20, 2020 9:00pm - April 22, 2020 3:40pm EDT
Since Friday, April 17, the Research Data Depot filesystem has been unavailable on community cluster systems due to an ongoing filesystem verification. While we don't believe there is any danger of data loss, the filesystem verification will continu...
Unscheduled Data Depot outage on the clusters
- April 17, 2020 5:00pm - April 22, 2020 3:40pm EDT Last updated: April 22, 2020 3:40pm EDT
The Brown, Gilbreth, Halstead, Hammer, Rice, Scholar, Snyder, and Workbench clusters began experiencing issues with connection to Data Depot filesystem around 5:00pm EDT on Friday, April 17th, 2020. Engineers are currently diagnosing the issue and ar...
Unscheduled Outage to Multiple Systems
- February 14, 2020 10:00am - 4:30pm EST Last updated: February 14, 2020 4:26pm EST
Hammer, Scholar, Snyder, WCERES, WSC Hadoop, and Data Depot began experiencing issues with networking around 10:00am EST. Engineers are currently diagnosing the issue and are working to identify a fix. We will provide an update by 1:00pm if the issue...
Unscheduled Gilbreth outage
- January 3, 2020 11:30am - January 5, 2020 8:35pm EST Last updated: January 5, 2020 8:35pm EST
The Gilbreth cluster began experiencing issues with its scratch filesystem around 11:30am EST. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will...
Emergency Fortress Outage
- October 10, 2019 5:00pm - 6:20pm EDT Last updated: October 10, 2019 6:22pm EDT
The Fortress Archive began experiencing issues with an internal database around 4:30pm. Engineers are currently working to remove the affected database from the system to mitigate the issue. Fortress should return to normal function once this is remo...
Unscheduled Depot Outage
- October 8, 2019 5:15pm - 8:45pm EDT Last updated: October 8, 2019 8:50pm EDT
Data Depot suffered a system failure around 5:15pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will provide an update by 9pm or earlier as...
Unscheduled Rice scratch outage
- September 4, 2019 4:40pm - September 5, 2019 11:20am EDT Last updated: September 5, 2019 11:20am EDT
The Rice cluster began experiencing issues with the scratch filesystem around 4:40pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will prov...
Unscheduled power outage affecting Brown and Hammer
- August 12, 2019 6:00am - 11:30am EDT Last updated: August 12, 2019 11:36am EDT
The Brown and Hammer clusters experienced a partial power outage overnight which caused them to operate at a reduced capacity. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this...
Unscheduled research homes outage affecting all clusters
- August 7, 2019 12:30pm - 2:45pm EDT Last updated: August 7, 2019 2:49pm EDT
All clusters began experiencing issues with the home file system around 12:30pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will provide a...
Scheduling of Jobs Paused on Brown
- July 19, 2019 2:45pm - 4:30pm EDT Last updated: July 20, 2019 7:20am EDT
Starting at 2:45pm EDT, new jobs are not being scheduled on the Brown cluster in order to lighten the load. The current hot weather outside is overtaxing Brown cooling systems. Existing jobs will continue to run as planned. As soon as the cooling sys...
Update - Extend Hammer Outage
- June 28, 2019 12:05pm - 3:00pm EDT
Work continues on bringing Hammer back to normal operation. Engineers have identified the source of the problem and are currently working to find a solution. We will provide another update by 3pm today.
Unscheduled Hammer outage - Closed
- June 28, 2019 10:10am - 1:30pm EDT
Update: The issues with the Hammer cluster has been resolved and the cluster is back in production. This outage is closed. Original: The Hammer cluster began experiencing issues around 10:10am EDT. Engineers are currently diagnosing the issue and are...
Unscheduled data center outage
- June 25, 2019 10:00am - 12:05pm EDT Last updated: June 25, 2019 12:04pm EDT
Several clusters have experienced network connectivity and/or power issues around 10:00am EDT. Engineers are working on assessing and analyzing the situation. Job scheduling on affected clusters has been paused while this issue is being addressed. We...
Rice scheduling temporarily stopped
- June 13, 2019 12:15pm - 5:30pm EDT Last updated: June 13, 2019 5:31pm EDT
The Rice cluster is currently experiencing a vendor bug in its scratch filesystem. To prevent filesystem instability, job scheduling has been paused while engineers are diagnosing the issue. Currently running jobs will not be affected (only starting...
Unscheduled Github outage
- May 3, 2019 10:00am - 1:00pm EDT Last updated: May 3, 2019 3:31pm EDT
Github is currently offline and is not responding. Engineers are currently working on bringing Github back up. We will provide another update later today or as soon as it is back online.

Results 161-180 of 319

Outages

Follow Us