Unscheduled Outage to Conte Scratch

January 20, 2015  3:45pm – January 21, 2015  3:45pm

Update 2

Storage engineers have brought Conte scratch back to full production. Job scheduling on Conte has been restored as of approximately 2:20 pm on 1/21/2015.

Any job that was actively writing to scratch while the filesystem was in a degraded state may have experienced data corruption on write. We advise to check your output for integrity.

We apologize for the inconvenience this outage may have caused you and appreciate your patience throughout the repair process.

Update 1

At 12:00 noon on 1/21/2014, the lustre filesystem on Conte will be completely unavailable while storage engineers work to fully bring it back into production. While this work is ongoing, job scheduling is paused.

Original message:

Beginning at 3:45pm on Tuesday, January 20th, 2015, Conte scratch was unavailable due to a hardware controller issue. Storage engineers brought the affected storage server back online and scratch was returned to production by 5:30 pm. Since then, continuing problems accessing scratch have been found on some front-ends and are being investigated now, though we believe all compute nodes are working as expected.

We're sorry for any inconvenience this may be causing you in your work. Thank you for your patience while our engineers work to ensure the system is fully operational.

Originally posted: January 20, 2015  8:37pm

Purdue University, 610 Purdue Mall, West Lafayette, IN 47907, (765) 494-4600

© 2017 Purdue University | An equal access/equal opportunity university | Copyright Complaints | Maintained by ITaP Research Computing

Trouble with this page? Disability-related accessibility issue? Please contact us at online@purdue.edu so we can help.