Bell
-
Bell scratch purging policy change
As announced during the recent Bell outage, some temporary austerity measures need to be implemented to prevent the scratch file system from filling up and causing more outages. Hardware to expand the scratch file system has been on order for some ti...
-
MATH data center cooling outage
The Math building data center began experience issues with its cooling system around 1:40pm EDT. To minimize thermal load on the cooling infrastructure, job scheduling has been paused and all idle compute nodes on Anvil, Bell, Geddes, Gilbreth, and...
-
A section of the Bell cluster compute nodes began experiencing issues with power feed and cooling around 2:30pm EDT. Engineers have powered down affected nodes and are working to identify a fix. Some jobs may have ended up terminated or requeued. Jo...
-
The Bell cluster continues to experience issues with Hardware. Engineers are currently diagnosing the issues and are working with vendors to schedule and perform repairs as quickly as possible. Job scheduling continues, but you may experience longer...
-
The Bell cluster will be unavailable for use October 4, 2022 8:00am - October 5, 2022 11:59pm EDT while we perform scheduled maintenance to expand Bell's scratch storage capabilities and make upgrades to Bell's AMD GPU drivers and other system soft...
-
The Bell Gateway began experiencing issues following the Oct 4-6th Bell Maintenance. In particular, gateway applications have been observed to fail upon attempting to connect to the application after launching the job. Our engineers are investigating...
-
Scheduling paused on multiple clusters
The Bell, Brown, Gilbreth, Halstead, and Scholar clusters began experiencing issues with their Data Depot mounts around 9:50am EST. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while...
-
The Bell cluster began experiencing issues with its Lustre scratch filesystem around 12:30pm EST. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We w...
-
Research Computing Holiday Break
Research Computing personnel will observe the university winter break from 5:00pm EST on Thursday, December 22nd, 2022, and will resume normal business hours on Tuesday, January 3rd, 2023. During this time, Research Computing services will continue...
-
The Bell cluster began experiencing issues with its scratch filesystem around 12:55pm EST. File access operations (e.g. ls) may appear hanging. Logins to the Open OnDemand gateway ( gateway.bell.rcac.purdue.edu) may appear sluggish or hanging as wel...
-
The Bell cluster began experiencing issues with its scratch filesystem around 7:50pm EDT. File access operations (e.g. ls) may appear hanging. Logins to the Open OnDemand gateway (gateway.bell.rcac.purdue.edu) may appear sluggish or hanging as well....
-
BoilerKey Transition to Purdue Login
Overnight on June 26-27th (Monday-Tuesday), all Purdue systems which use BoilerKey, including RCAC clusters and other systems, will switch to the new Purdue Login. For more information about this change, please see the following documentation: https:...
-
Hello, PurdueIT is replacing our ticketing system, Footprints, with a new product called Team Dynamix (TDX). Email to rcac-help@purdue.edu will begin to forward to the TDX environment effective Wednesday, July 12, 2023. You may notice a difference in...
-
Data Depot degraded performance on RCAC clusters
Users of Data Depot on RCAC clusters are currently experiencing significant performance degradation. The symptoms manifest as delays in listing or accessing files in /depot, significant lags in terminal sessions (especially if you have Data Depot in...
-
Update: As of 3:45pm, the Bell cluster has returned to production status. Scheduling is still paused on the Negishi cluster, and we will have an update by 5:00pm EDT The Bell and Negishi clusters began experiencing issues with power around 1:00pm EDT...
-
Update to Coffee-Hour Consultation Schedule
A few announcements for the RCAC community regarding our regular Coffee-Hour Consultation services. Support staff will be unavailable both today (Monday) and tomorrow (Tuesday) because of other scheduled events. Notably, tomorrow is our Annual Cyberi...
-
Multiple clusters have been powered off in MATH G109 datacenter due to a water issue in the building. Affected systems are Bell, Brown, Geddes, Gilbreth and Negishi. We will provide an update by 5:00 PM today.
-
The Bell cluster will be unavailable beginning Tuesday, October 31st, 2023 at 8:00am for a scheduled maintenance. The cluster will return to full production by Tuesday, October 31st, 2023 at 5:00pm. During this time, Bell will have minor operating sy...
-
Potential Ticket Response Delays
As the holiday season approaches, we want to inform you about potential delays in our ticket response times until the end of this year. Our support team is currently working with reduced staff, impacting our ability to respond promptly. Additionally...
-
The Bell cluster will be unavailable beginning Wednesday, December 6th, 2023 at 8:00am for a scheduled maintenance. The cluster will return to full production by Wednesday, December 6th, 2023 at 5:00pm. During this time, Bell will have a configuratio...