Article #7080: Unscheduled Bell Outage
At around 11:00am, Bell's scratch filesystem began to show signs of a severe performance degradation. We have paused job scheduling on Bell while eng...
At around 11:00am, Bell's scratch filesystem began to show signs of a severe performance degradation. We have paused job scheduling on Bell while eng...
The Gautschi cluster began experiencing issues with its power feed around 06:45am. Engineers are currently diagnosing the issue and are working to ide...
We have noticed a discrepancy in the allocation usage after the outage, so you may see incorrect usage for your allocation(s) from mybalance. Our engi...
The Gautschi cluster began experiencing issues with internal fabrics around 02:30 2025-02-13. Engineers are currently diagnosing the issue and are wor...
The Fortress storage system began experiencing issues earlier today related to one of its adminstrative servers. This results in access being denied t...
Update: Tuesday, January 21st, 2025 at 3:02pm EST: The situation has been corrected and job scheduling is running again on Negishi. The Negishi clust...
The Anvil cluster began experiencing issues with electrical power around 2:30 PM EST. RCAC engineers are working with Purdue electricians to safely re...
We are currently experiencing network connectivity problems with the Gautschi community cluster. Engineers are investigating and will provide an upda...
Data Depot and other group-restricted spaces began experiencing issues with file permissions around 5pm. Users will notice a "permission denied&q...
Data Depot began experiencing issues with file permissions around 8pm this evening that has since resolved itself. Users will have noticed a "per...
The Negishi Scratch file system began experiencing issues around 11am EST today (December 16). The issue manifests as a "No space left on device&...
Shortly before 9:00AM eastern time, many RCAC clusters experienced a power interruption which interrupted some work due to a campus power outage. UPDA...
Starting around 3:00PM eastern time, Bell's scratch file system began to experience service interruptions and periods of unresponsiveness to user requ...
The Bell cluster began experiencing issues with the /scratch filesystem. Engineers are currently diagnosing the issue and are working to identify a fi...
Fortress began experiencing issues with communications around 3:00AM. Engineers are currently diagnosing the issue and are working to identify a fix....
The Globus connection began experiencing issues on Data Depot endpoint around 11:00am EDT. Engineers are currently diagnosing the issue and are workin...
The Bell cluster began experiencing issues with the /scratch filesystem at around 5pm today. Engineers are currently diagnosing the issue and are work...
The Scholar cluster began experiencing issues with adding new students from the courses onto the cluster. Engineers are currently diagnosing the issue...
The Negishi cluster began experiencing issues with SLURM accounting around 8:00am EDT. Engineers are currently diagnosing the issue and are working to...
The Scholar custer down from approximately 4:30PM onwards on Wednesday, March 27, 2024. Engineers have solved the issue and Scholar should return to s...