Article #3810: Unscheduled Brown and Hammer outage
The Brown and Hammer clusters began experiencing issues with cooling due to problems at the Physical Facilities' chiller plant around 4:40pm EDT. To a...
The Brown and Hammer clusters began experiencing issues with cooling due to problems at the Physical Facilities' chiller plant around 4:40pm EDT. To a...
The Bell, Brown, Gilbreth, Halstead, Hammer, Scholar, Workbench clusters and Data Depot began experiencing issues with intermittent high load on the D...
The Bell cluster began experiencing issues with high load and sluggish performance on the scratch filesystem around 1:20pm EDT. Engineers are currentl...
The Weber cluster began experiencing issues with expired VPN certificate around 10:00am EST. Engineers are currently diagnosing the issue and are work...
The Bell cluster began experiencing issues with its scratch filesystem around 6:30pm EST. Engineers are currently diagnosing the issue and are working...
The Weber cluster began experiencing issues with weber-sftp subsystem around 2:00pm EST. The problem affects ingress/egress path to the cluster. Eng...
The Bell cluster began experiencing issues with scheduler database around 11:35am EST. The problem manifests as freezing and/or "socket timed out...
The Gilbreth cluster began experiencing issues with its Data Depot mounts around 9:00am EST. The /depot filesystem is not visible on some of the login...
As of 8:00pm EST on Friday, February 11th, 2022 the Data Depot filesystem outage has been resolved and scheduling has been resumed on all clusters....
The Math building data center began experience issues with its cooling system around 11:40am EST. As one of manifestations, users may experience issu...
The Math building data center began experience issues with its cooling system around 11:40am EDT. As one of manifestations, users may experience issu...
As of 9:00am EDT, users of community clusters may experience slowness while trying to access Data Depot (including loading modules, starting applicati...
The Gilbreth cluster began experiencing issues with its scratch filesystem around 7:00pm EDT. Engineers are currently diagnosing the issue and are wor...
Following last night's scratch outage, the Gilbreth scratch filesystem is currently functional but operates with partially degraded performance. Engi...
The Weber cluster's data transfer server (weber-sftp.rcac.purdue.edu) suffered a cooling fan failure around 8:30pm EDT on Saturday, April 9th, 2022. T...
The Bell cluster began experiencing issues with its scratch filesystem around 9:00pm EDT on Saturday, April 9th, 2022. Access to files in scratch may...
Several Research Computing resources became affected by a campus power outage around 7:00pm EDT. Multiple login and compute nodes may have powered dow...
Beginning around 2:00pm EDT, the ailing cooling systems for Brown and Hammer began experiencing issues. To reduce the thermal load on the systems, sch...
The Halstead cluster began experiencing issues with its scratch file system around 8:00am EDT. The problem manifests as various I/O errors or hangs wh...
Bell Scratch is near capacity and performance is degraded. As of this morning, Bell Scratch was 94% full. This afternoon we paused scheduling as scrat...