Article #957: Emergency Carter Cluster Maintenance
Update: Owner queues on Carter have been restarted. While Carter is currently deemed stable, performance is still impacted. Engineers are closely moni...
Update: Owner queues on Carter have been restarted. While Carter is currently deemed stable, performance is still impacted. Engineers are closely moni...
UPDATE: As of 11:45a, the Scholar cluster maintenance was completed. Cluster is back in service. The Scholar cluster will be unavailable beginning o...
UPDATE: At this time, the maintenance has been completed and is back in service. The Thinlinc cluster will be unavailable starting at 5pm on March 14t...
The Fortress archival storage system is currently experiencing intermittent connectivity. We expect the situation to be resolved by approximately 1pm....
The scratch filesystems serving Carter, Hammer, Rice, Scholar, and Snyder started behaving abnormally this morning. This may have affected some jobs,...
The Research Data Depot has been restored to service. A portion of the systems serving the Research Data Depot have suffered a failure. Some systems u...
The Conte and Hathi clusters have been updated and returned to full production. This is a gentle reminder that the Conte and Hathi clusters will be u...
The scratch filesystem serving Hammer, Rice, and Snyder is currently unavailable. Both currently running jobs and attempts to access files in scratch...
Following the security updates on Halstead, an issue was discovered that prevented multi-node MPI jobs from running properly. Scheduling on Halstead h...
Due to a recent security vulnerability, the Carter, Halstead, Hammer, Radon, Rice, Scholar, and Snyder clusters will have their operating system upgra...
The scratch filesystem serving Conte is currently unavailable. Both currently running jobs and attempts to access files in scratch will block until th...
System monitoring has revealed intermittent issues connecting to the Research Data Depot on Thursday January 19. When this issue occurs, users will ex...
Conte is back in production, and jobs have started running. Thank you for your patience. ===== Because of additional work required to fix a configura...
Patching has been completed and github.rcac.purdue.edu service is back in full production mode. Original message Tonight, Thursday, January 12, 2017,...
The maintenance work was completed successfully and Halstead has been returned to normal operations as of Wednesday, January 11, 2017 at 12:00pm. Orig...
The maintenance for Carter cluster was cancelled and will be rescheduled at a later date. The cluster has remained in service. Original Notice The Ca...
The Scholar cluster will be unavailable beginning at Thursday, January 5th, 2017 at 8:00am EST, for scheduled maintenance. The cluster will return to...
The Halstead cluster will be unavailable beginning at Wednesday, January 4th, 2017 at 10:00am EST, for scheduled early-access maintenance (see Halstea...
The Halstead cluster is back online as of 4:50 PM after scheduled early-access maintenance. Unfortunately, queued jobs were lost due to complications...
Following the restoration of power to the affected building, the EXRC cluster has been returned to service on Thursday, December 22nd, 2016 at 2:45pm...