Article #356: RCAC system and data center maintenance
RCAC systems including the DXUL/fortress archival storage system will be unavailable beginning at 8am Tuesday, 3/17, while system and MATH data center...
RCAC systems including the DXUL/fortress archival storage system will be unavailable beginning at 8am Tuesday, 3/17, while system and MATH data center...
RCAC systems will be unavailable from 8am-6pm Friday, October 9, for electrical work in the MATH data center and system maintenance. The Coates Linux...
Network problems arose following Coates cluster maintenance Tuesday, January 5. ITaP staff are working to resolve these problems, but we are currentl...
Coates-b, -c, and -e nodes have been powered down due to a problem with a CDU (cooling distribution unit) that cools those systems. PBS jobs running...
Job scheduling on the Coates and Rossmann Linux cluster was disabled from 7:15-10:20pm Saturday, October 30, due to a partial cooling loss in the MATH...
The Lustre storage system that provides scratch storage on the Rossmann and Coates Linux clusters (via /scratch/lustreA) failed at approximately 1:30p...
What’s happening? ITaP’s research computing systems will be shut down beginning at 3 a.m. Tuesday, March 29. The Coates and Rossmann cluster supercomp...
All RCAC systems will be unavailable on Tuesday, March 29th from 3:00am – 6:00pm. The Rossmann, Coates, Radon, and Moffett clusters will remain down t...
What’s happening? ITaP’s research computing systems will be shut down beginning at 5 p.m. Friday, Aug 5, including the Rossmann, Coates, Moffett and R...
Beginning at 5:00 pm, Friday, August 5th, the Coates and Rossmann supercomputer clusters will be unavailable due to work to complete a power and cooli...
This week, ITaP engineers have been troubleshooting issues with the Coates cluster, with the most common symptom being PBS jobs that abort or restart...
The LustreA scratch filesystem, used by Rossmann and Coates, suffered an unknown failure sometime in the early morning of November 15, 2011. LustreA...
This morning, the PBS system on Coates developed an issue with the storage holding its internal state.While systems engineers are working on recoverin...
During the week of spring break, 2012, the Steele, Coates, and Rossmann clusters will each be down for maintenance for one day to install OS patches a...
Update - 6:45 pm Tuesday, 10 April 2012 ITaP engineers have found and repaired the network issue that was affecting Coates nodes type B, C and E. Job...
Update: 10:00pm Tuesday As of 8:30pm Tuesday 21 August 2012, the LustreB filesystem has been returned to full service. Our storage engineers with assi...
UPDATE: 9 October, 2012 The Coates and Rossmann Clusters have both returned to production, and their maintenance is completed, as of 11:30 am, Tuesday...
During scheduled network maintenance on network equipment connecting storage to ITaP clusters, all scheduling will be paused from 4-6pm. Running jobs...
During the New Years' weekend holiday, all ITaP HPC resources will be unavailable due to a scheduled upgrade of research home directories. While the s...
Update - 7:00pm, 1/4/2013: - All community clusters (Steele, Coates, Rossmann, Hansen, Carter, and Peregrine1) are back in production. Radon is curr...