Conte Cluster Maintenance

May 17, 2016  8:00am – May 18, 2016  5:55am
Conte

Conte has been returned to normal operations as of Wednesday, May 18th, 2016 at 5:55am. All upgrades were completed, though a small number of nodes which require more attention to be fully ready for jobs remain offline for now and will be returned to service as they are addressed. This concludes the maintenance.

-----

Upgrades and fixing problems with the Xeon Phi coprocessors continues through the night with several staff working through the nodes. We will post another update by at least 8:00am.

-----

The Isilon home filesystem has been restored, but in order to avoid a recurrence of the problem, we must limit the rate at which systems are returned to service. This will push back our return to service for Conte. Estimates currently have Conte back before 2:00am. We will post an update by then if it is not already back in service.

-----

Due to unavailability of the Isilon/home filesystem, Conte maintenance is extended till 10:00 PM tonight. We shall send an update no later than 10:00 PM.

-----

The update of Conte is in full swing. Due to the large number of packages that need to be installed on each machine, it is taking longer than regular maintenance. We expect the cluster to be back in production shortly. We will issue an update no later than 7 PM today.

-----

This is a friendly reminder about the Conte Cluster maintenance on Tuesday, May 17th, 2016 at 8:00am. Please save all your work before the start of maintenance. If you have processes on front-ends or data transfer operations that continue past 8:00am tomorrow, they will be terminated by the system.

-----

The Conte cluster will be unavailable beginning at Tuesday, May 17th, 2016 at 8:00am, for scheduled maintenance. The cluster will return to full production by 5 PM that day.

The downtime is necessary to upgrade the Linux Kernel along with the MPSS software on Conte which allows use of the Xeon Phi accelerators. A minor scheduler upgrade and security patches will also be applied.

Any PBS jobs which request a walltime which would take them past Tuesday, May 17th, 2016 at 8:00am will not start and will remain in the queue until after the maintenance is completed.

If you have any questions or concerns about the update please contact us at rcac-help@purdue.edu.

Originally posted: April 25, 2016  11:12am

Purdue University, 610 Purdue Mall, West Lafayette, IN 47907, (765) 494-4600

© 2017 Purdue University | An equal access/equal opportunity university | Copyright Complaints | Maintained by ITaP Research Computing

Trouble with this page? Disability-related accessibility issue? Please contact us at online@purdue.edu so we can help.