Maintenance
-
The maintenance is complete and github.rcac is back in production, now at version 2.6.4. There are several bug fixes and security patches in this version, but no major feature updates. Additionally, an infrastructure change has been made to the way...
-
The Depot maintenance has been completed successfully. Depot is now back to normal operations. Other compute cluster outages running concurrent with this however, are still in progress. This maintenance window next week has been reduced as much as...
-
An issue was discovered shortly after Conte, Hansen, Hathi, and Radon were brought back online with the /group path on several front-ends and nodes. Any scripts or jobs that rely on the /group path may have had issues immediately following return to...
-
Carter and Scholar are back online for use as of 6:25am, though they will be operating with many nodes still offline. Staff will be working through Wednesday to steadily increase the number of nodes available. This concludes the POD cluster mainten...
-
Conte has been returned to normal operations as of Wednesday, May 18th, 2016 at 5:55am EDT. All upgrades were completed, though a small number of nodes which require more attention to be fully ready for jobs remain offline for now and will be return...
-
The scheduler issue has been resolved, and Conte has been returned to normal operations as of Wednesday, February 10th, 2016 at 9:30pm EST. Update: February 10, 2016 7:04pm There was a minor issue discovered with the newly upgraded scheduler which i...
-
Hathi & WinHPC Power Maintenance
The Hathi and WinHPC clusters will be unavailable beginning at Thursday, February 4th, 2016 at 6:00am EST, for scheduled maintenance to the power feed. Both clusters will return to full production by Thursday, February 4th, 2016 at 5:00pm EST. During...
-
Fortress will be unavailable from 8:00am to 9:00am Wednesday, 3 February, 2016 for routine maintenance.
-
Rice and Snyder Cluster Maintenance
As of 10:40pm, the Snyder cluster was returned to normal service in the POD. This concludes this maintenance. Update: February 5, 2016 8:54pm As of 8:25 pm, Friday, 5 Feb 2016, the Rice cluster maintenance has completed and the system is returning...
-
Carter has been returned to normal operation. Update: January 20, 2016 3:26pm: We are doing return to service testing now and expect Carter to return to production by 7:00pm. Update: January 20, 2016 12:00pm: Work is being wrapped up on Carter and...
-
January 7, 2016, 6pm The Fortress move has completed and has been returned to production. Original Due to a failure in the notice system, the earlier attempts to notify of this work which were sent on Dec 7th and Jan 3rd were not delivered. The Fortr...
-
Carter has been return to normal operations. All queues have been enabled. Update: December 2, 2015 12:15pm Carter is mostly ready to return to service, but the site-wide home filesystem has suffered a failure which is preventing this from being co...
-
The Fortress Archive service, Fortress, will be unavailable starting Wednesday, November 4th, 2015 at 6:00am EST for regular maintenance and will return at Wednesday, November 4th, 2015 at 8:00am EST. During this time, access via HSI, HTAR, Globus of...
-
November 3, 2015 6:15pm The maintenance for Radon is completed and the cluster has been returned to production. Original The Radon cluster will be unavailable beginning at Tuesday, November 3, 2015 from 7:00am - 7:00pm EST, for scheduled maintenance....
-
October 22, 2015 9:15pm All services have been restored and Hammer is now in production. October 22, 2015 7:00pm Engineers continue to work through issues relating to the move. Another update will be sent at 9pm. Original The Hammer cluster will be...
-
Emergency scratch maintenance on Carter and Scholar
The scratch filesystem serving Carter/Scholar underwent emergency maintenance through Friday night and well into Saturday. We expect this work to resolve the periodic hangs this filesystem has been experiencing for the last two days. Job scheduling...
-
Cluster Maintenance - Hansen/Peregrine1
Update: September 22, 2015 1pm The work affecting Hansen and Peregrine1 scratch filesystems has been completed and the clusters are back in full production. Original The Hansen and Peregrine1 cluster will be unavailable beginning at Tuesday, Septembe...
-
Update: September 23, 2015 8am Shortly after 2am, Engineers were able to complete the file transfer and return Carter back to production. Update: September 22, 2015 11pm The file transfer continues and will last well into the night. The next update...
-
As of 11:55 pm August 18, 2015, Fortress/HPSS has been brought back online. Storage engineers continue working on bringing upgraded Fortress up and deploying new software to all RCAC systems. Current estimate for return to service: 12:00 am August 1...
-
Cluster Maintenance - Peregrine1
The Peregrine1 cluster will be unavailable beginning at August 17, 2015 8:00am - August 19, 2015 6:00pm EDT, for scheduled maintenance. The cluster will return to full production by Wednesday, August 19th, 2015 at 6:00pm EDT. During this time, Pere...