News: Outages and Maintenance (Ended): P...

Outages and Maintenance

Halstead MPI problem, scheduling paused
- February 3, 2017 4:30pm - 11:59pm EST
Following the security updates on Halstead, an issue was discovered that prevented multi-node MPI jobs from running properly. Scheduling on Halstead has been stopped, and systems engineers are working on fixing the issue. We will provide further stat...
Emergency Security Patching of RCAC Clusters
- February 2, 2017 5:00pm - March 2, 2017 5:00pm EST
Due to a recent security vulnerability, the Carter, Halstead, Hammer, Radon, Rice, Scholar, and Snyder clusters will have their operating system upgraded to a newer version during February 2, 2017 5:00pm - March 2, 2017 5:00pm EST. Unlike other cl...
Unscheduled scratch outage on Conte
- January 30, 2017 4:30pm - 5:00pm EST Last updated: January 31, 2017 2:36pm EST
The scratch filesystem serving Conte is currently unavailable. Both currently running jobs and attempts to access files in scratch will block until the filesystem is back online. Job scheduling on Conte has been paused while storage engineers addres...
Connectivity issues to Research Data Depot
- January 19, 2017 10:00am - 11:00pm EST
System monitoring has revealed intermittent issues connecting to the Research Data Depot on Thursday January 19. When this issue occurs, users will experience pauses when working in a UNIX shell on community cluster systems, or as interrupted or drop...
Conte Cluster Maintenance
- January 19, 2017 8:00am - 11:59pm EST Last updated: January 20, 2017 1:44am EST
Conte is back in production, and jobs have started running. Thank you for your patience. ===== Because of additional work required to fix a configuration problem, this maintenance is running past the scheduled end time. We are extending the outage...
Emergency maintenance for GitHub
- January 12, 2017 9:00pm - 10:00pm EST Last updated: January 12, 2017 10:19pm EST
Patching has been completed and github.rcac.purdue.edu service is back in full production mode. Original message Tonight, Thursday, January 12, 2017, at 9:00pm – 10:00pm EST github.rcac.purdue.edu will be taken down for brief emergency maintenance. G...
Halstead Cluster Maintenance
- January 11, 2017 10:00am - 4:00pm EST
The maintenance work was completed successfully and Halstead has been returned to normal operations as of Wednesday, January 11, 2017 at 12:00pm. Original Message The Halstead cluster will be unavailable beginning at Wednesday, January 11th, 2017 at...
Carter Cluster Maintenance
- January 10, 2017 8:00am - 8:00pm EST Last updated: January 10, 2017 12:07pm EST
The maintenance for Carter cluster was cancelled and will be rescheduled at a later date. The cluster has remained in service. Original Notice The Carter cluster will be unavailable beginning at Tuesday, January 10th, 2017 at 8:00am EST, for emergen...
Scholar Cluster Maintenance
- January 5, 2017 8:00am - 5:00pm EST Last updated: January 5, 2017 4:58pm EST
The Scholar cluster will be unavailable beginning at Thursday, January 5th, 2017 at 8:00am EST, for scheduled maintenance. The cluster will return to full production by Thursday, January 5th, 2017 at 5:00pm EST. This work is being done during the se...
Halstead Cluster Maintenance
- January 4, 2017 10:00am - 4:00pm EST
The Halstead cluster will be unavailable beginning at Wednesday, January 4th, 2017 at 10:00am EST, for scheduled early-access maintenance (see Halstead Cluster Early Access Policies). The cluster will return to full production by Wednesday, January 4...
Halstead Cluster Maintenance
- December 21, 2016 10:00am - 4:50pm EST Last updated: December 21, 2016 4:57pm EST
The Halstead cluster is back online as of 4:50 PM after scheduled early-access maintenance. Unfortunately, queued jobs were lost due to complications during maintenance. If you had any jobs queued and waiting before maintenance started, you will need...
Unscheduled Outage for EXRC Cluster
- December 20, 2016 12:00pm - December 22, 2016 2:45pm EST Last updated: December 25, 2016 12:24am EST
Following the restoration of power to the affected building, the EXRC cluster has been returned to service on Thursday, December 22nd, 2016 at 2:45pm EST. Original article As of Tuesday, December 20th, 2016 at 12:00pm EST, EXRC is unavailable due to...
EXRC Scheduling Issue
- December 14, 2016 4:30pm - 8:00pm EST Last updated: December 14, 2016 7:57pm EST
UPDATE As of 7:50 pm, Wednesday, 14 December 2016, this issue is completely resolved. UPDATE As of about 6:00 pm another problem has been found in the EXRC scheduler code. We will update this news item once we have more details. Original Item The EXR...
Halstead Maintenance
- December 14, 2016 7:00am - 10:00am EST Last updated: December 14, 2016 10:32am EST
The maintenance work was completed successfully and Halstead has been returned to normal operations as of Wednesday December 14, 2016 at 10:00am. Original message: The Halstead cluster will be unavailable beginning at Wednesday, December 14th, 2016 a...
Halstead Maintenance
- December 7, 2016 1:00pm - 3:00pm EST
The Halstead cluster will be unavailable beginning at Wednesday, December 7th, 2016 at 1:00pm EST, for scheduled early-access maintenance (see Halstead Cluster Early Access Policies). The cluster will return to full production by Wednesday, December...
Unscheduled Data Depot Outage
- December 5, 2016 9:30pm - December 6, 2016 12:40am EST Last updated: December 6, 2016 12:39am EST
Update: Engineers were able to isolate the problem and restart the necessary systems. The Data Depot should be available again. Halstead users should double check their running work. A portion of the systems serving the Research Data Depot have suffe...
Job scheduling paused on Radon
- November 14, 2016 6:00pm - 7:00pm EST
Job scheduling was paused on Radon between 6 pm and 7 pm this evening. Node monitoring processes marked most nodes offline around 6 pm, preventing new jobs from starting. System engineers cleared the fault in the node monitoring, and nodes came back...
Emergency Cluster Maintenance
- November 5, 2016 8:00am - November 7, 2016 10:45pm EST Last updated: November 7, 2016 11:20pm EST
The Carter Cluster was returned to production at 10:45pm on November 7. We apologize for this extended outage. Update: November 7, 2016 6:01pm Work on reinstalling the Carter nodes continues. All other systems have returned normal operations. We...
Emergency Depot Maintenance
- November 5, 2016 8:00am - November 6, 2016 1:30pm EST Last updated: November 6, 2016 1:35pm EST
The Data Depot has concluded successfully, and it been returned to normal operations as of Sunday, November 6th, 2016 at 1:30pm EST. Original Message: The Data Depot will be taken down for emergency maintenance beginning at Saturday, November 5th, 20...
Emergency Firebox Virtual Server Maintenance
- November 5, 2016 8:00am - 11:30pm EDT Last updated: November 5, 2016 11:31pm EDT
The Firebox maintenance work concluded successfully as of Saturday, November 5th, 2016 at 11:30pm EDT. Original Message: All Firebox Virtual Servers will be taken down for emergency maintenance beginning at Saturday, November 5th, 2016 at 8:00am EDT....

Results 541-560 of 734

Outages and Maintenance

Follow Us