Outages

Sort By:

Featured Newest to Oldest Oldest to Newest Recently Published

Unscheduled Brown outage
- November 8, 2020 4:00pm - November 9, 2020 8:30am EST Last updated: November 9, 2020 9:19am EST
The Brown cluster began experiencing issues with its job scheduler around 4:00pm EST. The problem manifests itself as Slurm-related commands (slist, squeue, sinteractive, sbatch, etc) being slow, unresponsive or timing out. Queue selection dialogs in...
Unscheduled RCAC GitHub outage
- October 16, 2020 7:00pm - 7:30pm EDT Last updated: October 16, 2020 7:36pm EDT
The github.rcac server will be briefly unavailable Friday, October 16, 2020 from 7:00pm – 11:59pm for an emergency maintenance. During this time, the server will undergo maintenance tasks that can not be completed with the server in production. Opera...
Halstead Scratch Issues
- October 11, 2020 1:30pm - 4:15pm EDT Last updated: October 11, 2020 5:01pm EDT
The Halstead cluster began experiencing issues with its scratch filesystem around 1:15 pm, Sunday 11 Oct. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being address...
Unscheduled Halstead outage
- October 8, 2020 9:00pm - October 10, 2020 8:00am EDT Last updated: October 10, 2020 8:17am EDT
The Halstead cluster began experiencing issues with its scratch filesystem around 9:00pm. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will prov...
Halstead Scratch Issues
- September 27, 2020 10:00am - 11:00pm EDT Last updated: September 27, 2020 9:54pm EDT
Halstead's scratch began experiencing issues this morning (Sunday 27 Sep). Job scheduling has been paused while engineers and the system vendor investigate the issue. We will have an update by tomorrow morning (Monday 28 Sep) at 10:00 am.
Halstead Scratch Issues
- September 11, 2020 9:50am - 4:00pm EDT Last updated: September 11, 2020 3:39pm EDT
Halstead's scratch began experiencing issues at approximately 2:00 AM. Some users have reported that they are unable to read or index files within their personal scratch directories when logged in from certain front-ends. Job scheduling has been pa...
Unscheduled Fortress outage
- September 1, 2020 12:30am - September 3, 2020 5:00pm EDT Last updated: September 3, 2020 4:59pm EDT
The Fortress tape archive began experiencing issues with its disk cache subsystem being full on Tuesday, September 1st, 2020 around 12:30am EDT. The problems manifest themselves as intermittent Error -1, Error -28, and No space left on device error m...
Unscheduled Fortress outage
- August 27, 2020 9:00pm - August 29, 2020 10:00am EDT Last updated: August 29, 2020 10:01am EDT
The Fortress tape archive began experiencing issues with its disk cache subsystem on Thursday, August 27th, 2020 around 9:00pm EDT. The problems manifest themselves as intermittent Error -1, Error -28, and No space left on device error messages in HS...
Unscheduled Fortress outage
- August 25, 2020 9:00pm - August 27, 2020 4:30pm EDT Last updated: August 27, 2020 4:40pm EDT
The Fortress tape archive began experiencing issues with Error -1, Error -28, and No space left on device error messages in HSI and Globus around 9:00pm EDT on Tuesday, August 25th, 2020. Engineers are currently diagnosing the issue and are working...
Central Authentication Service (CAS) Outage
- August 24, 2020 10:43am - August 25, 2020 10:43am EDT
This morning, BoilerKey authentication for all community clusters and user facing services (such as the RCAC website, Rstudio Server) is unavailable due to a Central Authentication Service (CAS) outage. All the clusters are under normal operations an...
Unscheduled Cluster Outage
- August 14, 2020 8:00am - 12:30pm EDT
As of 12:30pm EDT all the clusters are back in production. If your job crashed during the outage, please resubmit it. We are currently experiencing an outage across the community clusters (Brown, Gilbreth, Halstead, Hammer, Rice, Scholar, Snyder, WC...
Unscheduled RCAC GitHub outage
- July 30, 2020 8:00am - 11:00am EDT Last updated: July 30, 2020 11:02am EDT
The Research Computing GitHub service (github.rcac.purdue.edu) is currently down. Engineers are currently diagnosing the issue and are working to identify a fix. We will provide an update by 12:00pm.
Unscheduled Brown scratch outage
- June 11, 2020 12:00pm - June 19, 2020 8:30pm EDT Last updated: June 19, 2020 8:30pm EDT
The Brown cluster began experiencing issues with its scratch filesystem around 12:00pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will pr...
Unscheduled Brown scratch outage
- May 15, 2020 12:30pm - 5:10pm EDT Last updated: May 15, 2020 5:13pm EDT
The Brown cluster began experiencing issues with its scratch filesystem around 12:30pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed.
Unscheduled data.rcac Transfer Node Outage
- May 11, 2020 3:00pm - May 12, 2020 12:30pm EDT Last updated: May 12, 2020 3:31pm EDT
The data.rcac.purdue.edu data transfer node began experiencing issues and was taken down at 3:00pm EDT. Engineers are currently diagnosing the issue. Data may be transferred to/from other clusters using those clusters' login nodes, and for Data Depot...
Unscheduled Home Directory Outage
- May 8, 2020 2:30pm - May 9, 2020 9:00pm EDT Last updated: May 9, 2020 9:03pm EDT
The Brown, Gilbreth, Halstead, Rice, Scholar, Snyder, and Workbench clusters began experiencing issues with intermittently slow home directories access around 2:30pm EDT. The issue has been traced to a high load on one of the filesystem's back-end se...
Unscheduled Brown scratch outage
- May 5, 2020 4:30pm - 8:50pm EDT Last updated: May 5, 2020 8:50pm EDT
The Brown cluster began experiencing issues with its scratch filesystem around 4:30pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will pro...
Unscheduled Data Depot outage
- May 4, 2020 10:30am - 2:50pm EDT Last updated: May 4, 2020 2:50pm EDT
The Data Depot storage system began experiencing issues with No space left on device error messages around 10:30am EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling on community clusters has been paus...
Unscheduled Weber outage
- May 4, 2020 10:00am - 12:00pm EDT Last updated: May 4, 2020 12:21pm EDT
The Weber cluster began experiencing issues around 10:00am EDT. Engineers are currently diagnosing the issue and are working to identify a fix. We will provide an update by 2:00pm.
ECN license server outage
- April 24, 2020 9:30am - 2:30pm EDT
Engineering Computing Network (ECN) has reported an outage on the software license servers for ITaP Research Computing systems that are hosted by ECN. ITaP Research Computing cluster job scheduling is not affected by the outage, but licenses for soft...

Results 141-160 of 315

Outages

Follow Us