Emergency maintenance for RCAC Linux systems Wed-Fri, August 27-29

September 01, 2008

RCAC will be performing emergency maintenance on all its Linux systems Wednesday-Friday, August 27-29. This work includes the application of patches, InfiniBand upgrades, and downtime for system reboots as described in the schedule below.  Job scheduling reservations are in place to prevent longer jobs from starting prior to each cluster's reboot time.  When possible, currently-executing jobs will be requeued and restarted after cluster nodes have been returned to service.

Please refer questions and comments about this maintenance to rcac-help@purdue.edu.

Resource Downtime NotesStatus
Gray  Tue, 08/26, noon-5pm Completed 5pm Wed, 8/27
CMS  Wed, 08/27, 3-8am Completed 7:30am Wed, 8/27
RadonWed, 08/27, noon-5pm Completed 5:30pm Wed, 8/27
Steele Wed, 08/27, noon-5pm1Completed 10:20pm Wed, 8/27
Venice   Wed, 08/27, noon-5pm Completed 2:30pm Thu, 8/28
Julius/SGI Altix Thu, 08/28, 8am-6pm Completed 5:00pm Thu, 8/28
Pete   Fri, 08/29, 8am-5pm2
Completed 12:30pm Sat, 8/30
Prospero Fri, 08/29, 8am-5pm1Completed 7:30pm Fri, 8/29
Rossmann Fri, 08/29, 8am-5pm1Completed 7:30pm Fri, 8/29
1) It is recommended that applications using InfiniBand message-passing
libraries be recompiled following the maintenance
2) Due to complications with the integration of SFS/Lustre with a new Linux kernel installed as part of the system maintenance, the contents of /scratch/lustre1 have been copied onto an RCAC BlueArc file server, where they will be available for use on the Pete cluster while RCAC works with HP to address the SFS/Lustre upgrade issues.

 

Share this...
Close
E-mail It