RCAC will be performing emergency maintenance on all its Linux systems Wednesday-Friday, August 27-29. This work includes the application of patches, InfiniBand upgrades, and downtime for system reboots as described in the schedule below. Job scheduling reservations are in place to prevent longer jobs from starting prior to each cluster's reboot time. When possible, currently-executing jobs will be requeued and restarted after cluster nodes have been returned to service.
Please refer questions and comments about this maintenance to rcac-help@purdue.edu.
| Resource | Downtime | Notes | Status |
|---|---|---|---|
| Gray | Tue, 08/26, noon-5pm | Completed 5pm Wed, 8/27 | |
| CMS | Wed, 08/27, 3-8am | Completed 7:30am Wed, 8/27 | |
| Radon | Wed, 08/27, noon-5pm | Completed 5:30pm Wed, 8/27 | |
| Steele | Wed, 08/27, noon-5pm | 1 | Completed 10:20pm Wed, 8/27 |
| Venice | Wed, 08/27, noon-5pm | Completed 2:30pm Thu, 8/28 | |
| Julius/SGI Altix | Thu, 08/28, 8am-6pm | Completed 5:00pm Thu, 8/28 | |
| Pete | Fri, 08/29, 8am-5pm | 2 | Completed 12:30pm Sat, 8/30 |
| Prospero | Fri, 08/29, 8am-5pm | 1 | Completed 7:30pm Fri, 8/29 |
| Rossmann | Fri, 08/29, 8am-5pm | 1 | Completed 7:30pm Fri, 8/29 |
| 1) It is recommended that applications using InfiniBand message-passing libraries be recompiled following the maintenance | |||
| 2) Due to complications with the integration of SFS/Lustre with a new Linux kernel installed as part of the system maintenance, the contents of /scratch/lustre1 have been copied onto an RCAC BlueArc file server, where they will be available for use on the Pete cluster while RCAC works with HP to address the SFS/Lustre upgrade issues. | |||
Share this...