Rice and Snyder Cluster Maintenance

February 1, 2016 8:00am - February 5, 2016 10:40pm EST
Maintenance
Rice, Snyder

As of 10:40pm, the Snyder cluster was returned to normal service in the POD. This concludes this maintenance.

Update: February 5, 2016 8:54pm

As of 8:25 pm, Friday, 5 Feb 2016, the Rice cluster maintenance has completed and the system is returning to service. Jobs are running now, and logins are currently in the process of being re-enabled.

Snyder has almost finished being reconfigured for its new location as well, and will undergo final validation testing prior to its return to service. Once that has been completed, or by 10:30 pm, we will update this item again.

Updated

The re-cabling of the Rice and Snyder clusters in the POD is still in progress. We hope to have this completed soon and begin return to service validation of the clusters. We will issue and update by 8:00pm tonight if they are not back in service before then.

Original Message:

The Rice and Snyder clusters will be unavailable beginning at Monday, February 1st, 2016 at 8:00am EST for scheduled maintenance and relocation to our portable datacenter (POD). Both clusters will return to full production by Friday, February 5th, 2016 at 5:00pm.

During this time, Rice and Snyder will have their operating system patched, the PBS resource management system upgraded, and maintenance performed on the Lustre scratch filesystem. They will also be physically relocated as part of the larger University datacenter management plans, which is the reason this maintenance will take longer than most maintenance upgrades.

Any PBS jobs which request a walltime which would take them past Monday, February 1st, 2016 at 8:00am EST will not start and will remain in the queue until after the maintenance is completed.

Originally posted: January 7, 2016 10:36am EST
Last updated: February 5, 2016 10:43pm EST

Rice and Snyder Cluster Maintenance

Follow Us