Maintenance
-
Fortress HPSS Software Upgrade
Update The issues with the HPSS upgrade have been resolved with assistance from the vendor and the system is now in normal operations. Anyone using HSI or HTAR from their personal systems will need to upgrade their client to the latest (version 5) c...
-
Network Maintenance for Carter, Conte, and Hansen clusters
The Carter and Conte clusters will be briefly unavailable on Monday, October 13, 2014 for upgrades to the clusters' respective network routers. This upgrade will significantly upgrade Conte and Carter's network connectivity, as part of ITaP's 2014 re...
-
Update: the vendor has identified and confirmed a bug in the system's software. The engineers continue assessing the situation and are working on a plan of actions. The outage window is further extended. Update: The Depot maintenance ran into some pr...
-
The Rossmann cluster will be unavailable on Tuesday, December 2, 2014 for maintenance on the cluster's network interconnect, and for routine operating system patches. Any PBS jobs which request a walltime which would take them past 6:00am on Tuesday,...
-
Important Operating System Updates - Hathi Hadoop Cluster
On the morning of Tuesday, February 3, 2015, Hathi login nodes will be rebooted to apply an important Red Hat Linux operating system update.
-
Important operating system updates - Community Clusters
On the morning of Thursday, February 5, 2015, Carter, Conte, Hansen, Peregrine1, Radon, and Rossmann login servers will be rebooted to apply an important Red Hat Linux operating system update. Additionally, during this time scratch storage servers w...
-
Security updates for Conte and Radon scratch
Conte scratch has returned to full production as of approximately 2:15 pm. Update: February 26, 2015 1:20pm Radon scratch has returned to full production as of approximately 10:30 am. The work on Conte scratch is progressing as planned. Original mes...
-
Research Data Depot Security Updates
As of 3:15 pm the maintenance is complete and Research Data Depot is returned to full production. Original message: The storage servers powering the Research Data Depot will undergo maintenance on Thursday, February 26, 2015 from 10:00am - 4:00pm EST...
-
ROSSMANN IS ALIVE Rossmann Cluster is now back online. Kudos to the maintenance team and thanks for your patience! UPDATE 17 March, 2015 Our systems engineers are hard at work to bring Rossmann Cluster back to life. We shall e-mail everyone as soon a...
-
The Radon cluster will be unavailable on Tuesday, 17 March, 2015. On that day, Radon's job scheduler will be updated, and its network connection to its scratch system reconfigured. As of 5:30pm Tuesday, March 17, Radon cluster has been returned to se...
-
UPDATE As of 9:00 pm Tuesday, 14 April, 2015, the Conte cluster is back in full production mode. During the maintenance, all nodes were checked for reliability and system software installations were checked for consistency between nodes, and issues...
-
UPDATE As of 4:45 pm Tuesday, May 19, all the work noted below has been completed and both Hansen and Peregrine-1 have been returned to full service. Thanks for your patience. =-=-= The Hansen and Peregrine-1 clusters will be unavailable beginning at...
-
Software upgrades on Rice Cluster have been completed by 7:30pm. It is now open for access by early adopters. Please let us know if you see any issue with the cluster. Maintenance on Snyder, Rossmann, Hansen, Hammer, and Conte has been completed and...
-
Fortress Service Unavailable June 23
The Fortress data archiving services will be unavailable starting 8:00AM on 23 June, 2015 due to a scheduled maintenance. During this outage, our storage engineers will: Upgrade hardware, and Configure RAID on the internal servers. Users are reques...
-
The Hammer, Hathi, Radon, and Snyder cluster will be unavailable beginning at Wednesday, July 1, 2015 from 8:00am - 12:00pm EDT, for scheduled maintenance. The cluster will return to full production by Wednesday, July 1st, 2015 at 12:00pm EDT. The do...
-
Cluster Maintenance - Peregrine1
The Peregrine1 cluster will be unavailable beginning at August 17, 2015 8:00am - August 19, 2015 6:00pm EDT, for scheduled maintenance. The cluster will return to full production by Wednesday, August 19th, 2015 at 6:00pm EDT. During this time, Pere...
-
As of 11:55 pm August 18, 2015, Fortress/HPSS has been brought back online. Storage engineers continue working on bringing upgraded Fortress up and deploying new software to all RCAC systems. Current estimate for return to service: 12:00 am August 1...
-
Update: September 23, 2015 8am Shortly after 2am, Engineers were able to complete the file transfer and return Carter back to production. Update: September 22, 2015 11pm The file transfer continues and will last well into the night. The next update...
-
Cluster Maintenance - Hansen/Peregrine1
Update: September 22, 2015 1pm The work affecting Hansen and Peregrine1 scratch filesystems has been completed and the clusters are back in full production. Original The Hansen and Peregrine1 cluster will be unavailable beginning at Tuesday, Septembe...
-
Emergency scratch maintenance on Carter and Scholar
The scratch filesystem serving Carter/Scholar underwent emergency maintenance through Friday night and well into Saturday. We expect this work to resolve the periodic hangs this filesystem has been experiencing for the last two days. Job scheduling...