Outages

Sort By:

Featured Newest to Oldest Oldest to Newest Recently Published

Unscheduled Scratch Outage on Rice, Snyder, Scholar
- October 24, 2017 4:00pm - 9:03pm EDT Last updated: October 24, 2017 9:03pm EDT
The scratch filesystem serving Rice, Scholar, and Snyder is currently unavailable. Both currently running jobs and attempts to access files in scratch will block until the filesystem is back online. Job scheduling on Rice, Scholar, and Snyder has bee...
Unscheduled scratch outage on Rice, Scholar and Snyder
- October 20, 2017 8:20am - 5:50pm EDT Last updated: October 20, 2017 5:52pm EDT
The scratch filesystem serving Rice, Scholar, and Snyder is currently unavailable. Both currently running jobs and attempts to access files in scratch will block until the filesystem is back online. Job scheduling on Rice, Scholar, and Snyder has be...
Unscheduled Depot Outage
- September 7, 2017 1:30pm - 3:00pm EDT Last updated: September 7, 2017 2:58pm EDT
Access to Data Depot from the Halstead, HalsteadGPU, Hathi, Rice, Scholar, and Snyder clusters has hung starting around Thursday, September 7th, 2017 at 1:30pm EDT. Engineers are currently working to restore service to these systems. Job scheduling h...
Unscheduled Outage in Math Data Center
- September 5, 2017 2:00pm - September 15, 2017 2:00pm EDT Last updated: September 15, 2017 12:58pm EDT
At approximately 2:00pm EDT on Tuesday, September 5th, 2017, the Math building data center lost some power feeds which supply the Conte, Halstead, HalsteadGPU, Hathi, and Radon clusters. Scheduling on these has been paused until we can be sure power...
Unscheduled Depot Outage
- August 5, 2017 6:30am - 9:15am EDT Last updated: August 5, 2017 9:26am EDT
A failure has occurred in the systems which serve Data Depot to the various research clusters. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused on all systems while this issue is being add...
Unscheduled outages on portions of clusters
- July 20, 2017 12:30pm - 5:15pm EDT Last updated: July 20, 2017 5:20pm EDT
Conte, Halstead, HalsteadGPU, and Hammer are back in full production. Job scheduling has been resumed on all clusters. Please let us know if you see any lingering issues at rcac-help@purdue.edu. UPDATE July 20, 2017 2:54pm Power has been restored to...
Email to "rcac-help@purdue.edu" not Working
- June 29, 2017 5:00pm - June 30, 2017 3:45pm EDT Last updated: June 30, 2017 4:00pm EDT
As of 3:45pm Friday, the rcac-help@purdue.edu address is working normally again. Original Message Beginning 5:00pm Thursday, the rcac-help@purdue.edu email address stopped accepting email. Anything sent since then has not been received. We are workin...
Email notifications from Research Computing website broken
- June 29, 2017 5:00pm - July 6, 2017 3:00pm EDT Last updated: July 6, 2017 3:09pm EDT
Email notifications are up and running again as usual. Original Message As of 5pm Thursday evening, email notifications from the Research Computing website are not working. Some people are receiving no email and others are receiving damaged emails. T...
Halstead Scheduling Outage
- May 25, 2017 10:30am - June 2, 2017 10:30am EDT Last updated: June 9, 2017 10:41am EDT
Nodes have continued to gradually reboot into the new image as jobs complete. At this point, more than 80% of Halstead has completed this process, and we have not seen any issues in them doing so. This outage is closed. Update: May 25, 2017 5:00pm...
Data Depot Outage
- May 18, 2017 1:50pm - 3:10pm EDT Last updated: May 18, 2017 3:26pm EDT
Engineers have restored failed core servers back to a functional state. Data Depot is up and running as normal and job scheduling resumed. Should you encounter any lingering issues please let us know at rcac-help@purdue.edu Original Message Some core...
Slowdown of Data Depot
- May 15, 2017 4:15pm - 8:45pm EDT Last updated: May 15, 2017 8:52pm EDT
As of 8:48pm the issue has been resolved. Original message The Research Data Depot is experiencing a system-wide slow down. Engineers have isolated the systems which are at the core of this phenomenon and are taking steps to restore normal service....
Scratch system failure on Rice, Snyder, Hammer
- May 12, 2017 3:30pm - 7:15pm EDT Last updated: May 12, 2017 7:06pm EDT
*** Update *** As of 7:00 pm, the problem on the scratch system has been corrected, and scheduling has resumed on all three affected clusters - Rice, Snyder, and Hammer. Update Storage engineers are working with the system vendor to evaluate a propos...
Unscheduled outage on Conte
- May 11, 2017 8:00am - 2:40pm EDT Last updated: May 11, 2017 2:42pm EDT
As of 2:35 pm, Conte cluster is returned to service. Scheduling is resumed in all queues. Update The source of the problem has been identified and the fix is underway. We anticipate returning Conte to service by 3pm today. Original message The Conte...
Unscheduled Data Depot Outage
- April 15, 2017 9:00am - 11:00am EDT
The Data Depot file system was sporadically available for 2 hours today. Some jobs running on the Community Clusters paused during the instability but have resumed. We expect no job loss to have occurred. This issue is now resolved.
Scheduler Issue on Halstead
- March 22, 2017 5:00pm - March 23, 2017 6:00pm EDT Last updated: March 23, 2017 11:33am EDT
Halstead nodes continue to come back online. While the cluster is operating normally, the total amount of available nodes is not yet at full capacity. We will update on the situation by 6:00pm. Update: Scheduling has been restarted and jobs are cur...
Unscheduled Fortress outage
- March 6, 2017 10:00am - 12:15pm EST Last updated: March 6, 2017 12:18pm EST
The Fortress archival storage system is currently experiencing intermittent connectivity. We expect the situation to be resolved by approximately 1pm. UPDATE: Storage engineers have resolved the connectivity problems and Fortress is back in full prod...
Partial scratch outages on Rice, Snyder, Carter, Scholar and Hammer
- March 6, 2017 9:00am - March 7, 2017 9:00am EST
The scratch filesystems serving Carter, Hammer, Rice, Scholar, and Snyder started behaving abnormally this morning. This may have affected some jobs, and anyone using one of the login nodes for these clusters may have had sessions freeze or seen dela...
Unscheduled Data Depot Outage
- February 21, 2017 1:00pm - 3:00pm EST Last updated: February 21, 2017 3:01pm EST
The Research Data Depot has been restored to service. A portion of the systems serving the Research Data Depot have suffered a failure. Some systems using Depot have been affected, particularly research clusters and users accessing the Depot over NFS...
Unscheduled scratch outage on Rice, Snyder, and Hammer
- February 6, 2017 3:40pm - 6:15pm EST Last updated: February 6, 2017 6:12pm EST
The scratch filesystem serving Hammer, Rice, and Snyder is currently unavailable. Both currently running jobs and attempts to access files in scratch will block until the filesystem is back online. Job scheduling on Hammer, Rice, and Snyder has been...
Halstead MPI problem, scheduling paused
- February 3, 2017 4:30pm - 11:59pm EST
Following the security updates on Halstead, an issue was discovered that prevented multi-node MPI jobs from running properly. Scheduling on Halstead has been stopped, and systems engineers are working on fixing the issue. We will provide further stat...

Results 201-220 of 315

Outages

Follow Us