Article #5695: Unscheduled Bell outage
The Bell cluster began experiencing issues with its scratch filesystem around 7:50pm EDT. File access operations (e.g. ls) may appear hanging. Logins...
The Bell cluster began experiencing issues with its scratch filesystem around 7:50pm EDT. File access operations (e.g. ls) may appear hanging. Logins...
The Bell cluster began experiencing issues with its scratch filesystem around 12:55pm EST. File access operations (e.g. ls) may appear hanging. Login...
The Anvil cluster began experiencing issues with its scratch and project file system around 10:00am EST. Access to scratch and project directories may...
The Bell cluster began experiencing issues with its Lustre scratch filesystem around 12:30pm EST. Engineers are currently diagnosing the issue and are...
The data depot began experiencing issues around 9:50am EST. While engineers work to diagnose and fix this issue, users may notice degraded performance...
The Bell, Brown, Gilbreth, Halstead, and Scholar clusters began experiencing issues with their Data Depot mounts around 9:50am EST. Engineers are cur...
As of 11:30am EDT, the Brown, Gilbreth, Halstead, and Hammer clusters began experiencing issues with their filesystems which may cause login failures....
The Scholar cluster began experiencing issues with its OnDemand Gateway around Sunday, October 16th, 2022 at 9:00pm EDT. The issue manifests as connec...
The Bell Gateway began experiencing issues following the Oct 4-6th Bell Maintenance. In particular, gateway applications have been observed to fail up...
The Bell cluster continues to experience issues with Hardware. Engineers are currently diagnosing the issues and are working with vendors to schedule...
A section of the Bell cluster compute nodes began experiencing issues with power feed and cooling around 2:30pm EDT. Engineers have powered down affec...
The Brown cluster began experiencing issues with its scratch filesystem around 11:20am EDT. Engineers are currently diagnosing the issue and are worki...
The Math building data center began experience issues with its cooling system around 1:40pm EDT. To minimize thermal load on the cooling infrastructu...
Bell Scratch is near capacity and performance is degraded. As of this morning, Bell Scratch was 94% full. This afternoon we paused scheduling as scrat...
The Halstead cluster began experiencing issues with its scratch file system around 8:00am EDT. The problem manifests as various I/O errors or hangs wh...
Beginning around 2:00pm EDT, the ailing cooling systems for Brown and Hammer began experiencing issues. To reduce the thermal load on the systems, sch...
Several Research Computing resources became affected by a campus power outage around 7:00pm EDT. Multiple login and compute nodes may have powered dow...
The Bell cluster began experiencing issues with its scratch filesystem around 9:00pm EDT on Saturday, April 9th, 2022. Access to files in scratch may...
The Weber cluster's data transfer server (weber-sftp.rcac.purdue.edu) suffered a cooling fan failure around 8:30pm EDT on Saturday, April 9th, 2022. T...
Following last night's scratch outage, the Gilbreth scratch filesystem is currently functional but operates with partially degraded performance. Engi...