Outages and Maintenance

Sort By:

Featured Newest to Oldest Oldest to Newest Recently Published

Slowdown of Data Depot
- May 15, 2017 4:15pm - 8:45pm EDT Last updated: May 15, 2017 8:52pm EDT
As of 8:48pm the issue has been resolved. Original message The Research Data Depot is experiencing a system-wide slow down. Engineers have isolated the systems which are at the core of this phenomenon and are taking steps to restore normal service....
Scratch system failure on Rice, Snyder, Hammer
- May 12, 2017 3:30pm - 7:15pm EDT Last updated: May 12, 2017 7:06pm EDT
*** Update *** As of 7:00 pm, the problem on the scratch system has been corrected, and scheduling has resumed on all three affected clusters - Rice, Snyder, and Hammer. Update Storage engineers are working with the system vendor to evaluate a propos...
Unscheduled outage on Conte
- May 11, 2017 8:00am - 2:40pm EDT Last updated: May 11, 2017 2:42pm EDT
As of 2:35 pm, Conte cluster is returned to service. Scheduling is resumed in all queues. Update The source of the problem has been identified and the fix is underway. We anticipate returning Conte to service by 3pm today. Original message The Conte...
Emergency Maintenance on Rice, Snyder, Hammer
- April 26, 2017 2:40pm - April 27, 2017 12:00pm EDT Last updated: April 26, 2017 7:36pm EDT
As of 7:15pm, all queues on these clusters have resumed scheduling. Nodes will continue to be upgraded as they finish current jobs and become available. In the interim, the clusters will run in a degraded state, but will continue to start new jobs...
Unscheduled Data Depot Outage
- April 15, 2017 9:00am - 11:00am EDT
The Data Depot file system was sporadically available for 2 hours today. Some jobs running on the Community Clusters paused during the instability but have resumed. We expect no job loss to have occurred. This issue is now resolved.
EXRC Cluster Maintenance
- April 13, 2017 8:00am - 5:00pm EDT
Update: April 13, 2017 5:02pm The EXRC cluster has been returned to service. Original Message: The EXRC cluster will be unavailable beginning at Thursday, April 13th, 2017 at 8:00am EDT, for scheduled maintenance. The cluster will return to full prod...
Rice, Snyder, Hammer, Scholar Maintenance
- April 11, 2017 8:00am - April 12, 2017 3:00am EDT Last updated: April 12, 2017 2:59am EDT
The Hammer, Rice, Scholar, and Snyder clusters have been returned to service. Please note that Thinlinc clients and web browser access can be found at: Rice: desktop.rice.rcac.purdue.edu Hammer: desktop.hammer.rcac.purdue.edu Snyder: desktop.snyder.r...
Halstead Maintenance
- April 6, 2017 8:00am - 11:59pm EDT
The Halstead cluster will be unavailable beginning at Thursday, April 6th, 2017 at 8:00am EDT, for scheduled maintenance. The cluster will return to full production by Thursday, April 6th, 2017 at 11:59pm EDT. During this time, Halstead will have the...
Scheduler Issue on Halstead
- March 22, 2017 5:00pm - March 23, 2017 6:00pm EDT Last updated: March 23, 2017 11:33am EDT
Halstead nodes continue to come back online. While the cluster is operating normally, the total amount of available nodes is not yet at full capacity. We will update on the situation by 6:00pm. Update: Scheduling has been restarted and jobs are cur...
Emergency Carter Cluster Maintenance
- March 15, 2017 12:00pm - March 16, 2017 11:59pm EDT Last updated: March 16, 2017 5:05pm EDT
Update: Owner queues on Carter have been restarted. While Carter is currently deemed stable, performance is still impacted. Engineers are closely monitoring the situation and will take corrective action if necessary. Update: At this time, only Carter...
Planned Scholar Maintenance
- March 15, 2017 8:00am - 2:00pm EDT Last updated: March 15, 2017 11:53am EDT
UPDATE: As of 11:45a, the Scholar cluster maintenance was completed. Cluster is back in service. The Scholar cluster will be unavailable beginning on Wednesday, March 15th, 2017 at 8:00am EDT, for scheduled maintenance. The cluster will return to f...
RCAC Thinlinc Maintenance
- March 14, 2017 5:00pm - 8:30pm EDT Last updated: March 14, 2017 8:29pm EDT
UPDATE: At this time, the maintenance has been completed and is back in service. The Thinlinc cluster will be unavailable starting at 5pm on March 14th until midnight for necessary maintenance and upgrades. During this time, the remote desktop servic...
Unscheduled Fortress outage
- March 6, 2017 10:00am - 12:15pm EST Last updated: March 6, 2017 12:18pm EST
The Fortress archival storage system is currently experiencing intermittent connectivity. We expect the situation to be resolved by approximately 1pm. UPDATE: Storage engineers have resolved the connectivity problems and Fortress is back in full prod...
Partial scratch outages on Rice, Snyder, Carter, Scholar and Hammer
- March 6, 2017 9:00am - March 7, 2017 9:00am EST
The scratch filesystems serving Carter, Hammer, Rice, Scholar, and Snyder started behaving abnormally this morning. This may have affected some jobs, and anyone using one of the login nodes for these clusters may have had sessions freeze or seen dela...
Unscheduled Data Depot Outage
- February 21, 2017 1:00pm - 3:00pm EST Last updated: February 21, 2017 3:01pm EST
The Research Data Depot has been restored to service. A portion of the systems serving the Research Data Depot have suffered a failure. Some systems using Depot have been affected, particularly research clusters and users accessing the Depot over NFS...
Conte and Hathi Cluster Maintenance
- February 21, 2017 8:00am - 11:59pm EST Last updated: February 21, 2017 11:07pm EST
The Conte and Hathi clusters have been updated and returned to full production. This is a gentle reminder that the Conte and Hathi clusters will be undergoing a scheduled maintenance beginning at Tuesday, February 21st, 2017 at 8:00am EST. Please sa...
Unscheduled scratch outage on Rice, Snyder, and Hammer
- February 6, 2017 3:40pm - 6:15pm EST Last updated: February 6, 2017 6:12pm EST
The scratch filesystem serving Hammer, Rice, and Snyder is currently unavailable. Both currently running jobs and attempts to access files in scratch will block until the filesystem is back online. Job scheduling on Hammer, Rice, and Snyder has been...
Halstead MPI problem, scheduling paused
- February 3, 2017 4:30pm - 11:59pm EST
Following the security updates on Halstead, an issue was discovered that prevented multi-node MPI jobs from running properly. Scheduling on Halstead has been stopped, and systems engineers are working on fixing the issue. We will provide further stat...
Emergency Security Patching of RCAC Clusters
- February 2, 2017 5:00pm - March 2, 2017 5:00pm EST
Due to a recent security vulnerability, the Carter, Halstead, Hammer, Radon, Rice, Scholar, and Snyder clusters will have their operating system upgraded to a newer version during February 2, 2017 5:00pm - March 2, 2017 5:00pm EST. Unlike other cl...
Unscheduled scratch outage on Conte
- January 30, 2017 4:30pm - 5:00pm EST Last updated: January 31, 2017 2:36pm EST
The scratch filesystem serving Conte is currently unavailable. Both currently running jobs and attempts to access files in scratch will block until the filesystem is back online. Job scheduling on Conte has been paused while storage engineers addres...

Results 581-600 of 791

Outages and Maintenance

Follow Us