Outages

Sort By:

Featured Newest to Oldest Oldest to Newest Recently Published

Unscheduled Scholar partial outage
- March 24, 2023 8:30am - 1:30pm EDT
The Scholar cluster began experiencing issues with its Thinlinc remote desktop (desktop.scholar.rcac.purdue.edu) and its RStudio Server (rstudio.scholar.rcac.purdue.edu) around 8:30am EDT. Engineers are currently diagnosing the issue and are working...
Unscheduled Gilbreth outage
- March 18, 2023 10:30pm - March 19, 2023 1:30pm EDT
The Gilbreth cluster began experiencing issues with its scheduler spool filesystem around 10:30pm EDT on Saturday, March 18th, 2023. The problem manifests as an I/O error during new batch job submissions and in Open OnDemand gateway applications. Int...
Unscheduled Bell outage
- March 15, 2023 7:50pm - 10:50pm EDT
The Bell cluster began experiencing issues with its scratch filesystem around 7:50pm EDT. File access operations (e.g. ls) may appear hanging. Logins to the Open OnDemand gateway (gateway.bell.rcac.purdue.edu) may appear sluggish or hanging as well....
Unscheduled Bell outage
- March 8, 2023 12:55pm - 8:30pm EST
The Bell cluster began experiencing issues with its scratch filesystem around 12:55pm EST. File access operations (e.g. ls) may appear hanging. Logins to the Open OnDemand gateway ( gateway.bell.rcac.purdue.edu) may appear sluggish or hanging as wel...
Unscheduled Anvil outage
- February 24, 2023 10:00am - February 25, 2023 11:50pm EST
The Anvil cluster began experiencing issues with its scratch and project file system around 10:00am EST. Access to scratch and project directories may be slow or hang. Engineers are currently diagnosing the issue and are working with the vendor to id...
Unscheduled Bell outage
- December 9, 2022 12:30pm - 4:00pm EST
The Bell cluster began experiencing issues with its Lustre scratch filesystem around 12:30pm EST. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We w...
Data Depot Degraded Performance
- November 7, 2022 10:00am - 12:20pm EST
The data depot began experiencing issues around 9:50am EST. While engineers work to diagnose and fix this issue, users may notice degraded performance in the form of sluggish I/O operations performed on Data Depot. This may also cause slow logins for...
Scheduling paused on multiple clusters
- November 7, 2022 9:50am - 12:15pm EST
The Bell, Brown, Gilbreth, Halstead, and Scholar clusters began experiencing issues with their Data Depot mounts around 9:50am EST. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while...
Scheduling Paused on Brown, Gilbreth, Halstead, and Hammer
- November 4, 2022 11:30am - 1:08pm EDT Last updated: November 4, 2022 12:01pm EDT
As of 11:30am EDT, the Brown, Gilbreth, Halstead, and Hammer clusters began experiencing issues with their filesystems which may cause login failures. Engineers are currently investigating the root cause, and in the interim, job scheduling has been p...
Unscheduled Scholar Gateway outage
- October 16, 2022 9:00pm - October 17, 2022 4:00pm EDT
The Scholar cluster began experiencing issues with its OnDemand Gateway around Sunday, October 16th, 2022 at 9:00pm EDT. The issue manifests as connection to gateway.scholar.purdue.edu timing out. Engineers are currently diagnosing the issue with the...
Bell Gateway Outage
- October 7, 2022 8:00am - October 10, 2022 5:10pm EDT
The Bell Gateway began experiencing issues following the Oct 4-6th Bell Maintenance. In particular, gateway applications have been observed to fail upon attempting to connect to the application after launching the job. Our engineers are investigating...
Bell Degraded Capacity
- September 28, 2022 12:00am - September 6, 2023 11:26am EDT
The Bell cluster continues to experience issues with Hardware. Engineers are currently diagnosing the issues and are working with vendors to schedule and perform repairs as quickly as possible. Job scheduling continues, but you may experience longer...
Unscheduled Bell outage
- September 27, 2022 2:30pm - 3:45pm EDT
A section of the Bell cluster compute nodes began experiencing issues with power feed and cooling around 2:30pm EDT. Engineers have powered down affected nodes and are working to identify a fix. Some jobs may have ended up terminated or requeued. Jo...
Unscheduled Brown outage
- August 6, 2022 11:20am - August 11, 2022 1:20pm EDT
The Brown cluster began experiencing issues with its scratch filesystem around 11:20am EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will pr...
MATH data center cooling outage
- August 3, 2022 1:40pm - 9:30pm EDT
The Math building data center began experience issues with its cooling system around 1:40pm EDT. To minimize thermal load on the cooling infrastructure, job scheduling has been paused and all idle compute nodes on Anvil, Bell, Geddes, Gilbreth, and...
Bell Scratch Degraded Storage (Returned to Service)
- June 29, 2022 - July 5, 2022
Bell Scratch is near capacity and performance is degraded. As of this morning, Bell Scratch was 94% full. This afternoon we paused scheduling as scratch was not responding consistently. We have more drives on order, but with global supply chain issue...
Unscheduled Halstead outage
- June 23, 2022 8:00am - June 24, 2022 8:15am EDT
The Halstead cluster began experiencing issues with its scratch file system around 8:00am EDT. The problem manifests as various I/O errors or hangs when reading, writing or listing scratch directories. Engineers are currently diagnosing the issue and...
Scheduling Paused on Brown and Hammer
- June 13, 2022 2:00pm - June 14, 2022 8:00pm EDT Last updated: June 13, 2022 3:45pm EDT
Beginning around 2:00pm EDT, the ailing cooling systems for Brown and Hammer began experiencing issues. To reduce the thermal load on the systems, scheduling of new jobs has been paused on Brown and Hammer and will not be resumed until after tomorrow...
Unscheduled campus power outage
- April 30, 2022 7:00pm - May 1, 2022 9:00am EDT
Several Research Computing resources became affected by a campus power outage around 7:00pm EDT. Multiple login and compute nodes may have powered down, leading to jobs fail and/or requeue with a NODE_FAIL or similar status. Engineers are currently d...
Unscheduled Bell outage
- April 9, 2022 9:00pm - April 10, 2022 10:15am EDT
The Bell cluster began experiencing issues with its scratch filesystem around 9:00pm EDT on Saturday, April 9th, 2022. Access to files in scratch may appear severely delayed or frozen. Engineers are currently diagnosing the issue and are working to...

Results 81-100 of 315

Outages

Follow Us