Skip to main content
Have a request for an upcoming news/science story? Submit a Request

Outages and Maintenance

  • Rossmann Cluster OS Upgrade

    During the maintenance scheduled for 3/15/2014-3/16/2014, the Rossmann cluster will be upgraded to Red Hat Enterprise Linux, version 6. Only those PBS jobs with walltimes short enough that they will finish prior to the beginning of this maintenance...

  • RCAC Fileserver Maintenance

    UPDATE - As of 7:45pm Sunday, March 16th, 2014, the fileserver maintenance has completed successfully, and cluster systems are back online. All Research Computing systems will be unavailable from 8:00am Saturday, 3/15/2014 through Sunday, 3/16/2014...

  • LustreD unavailable

    The Lustre D filesystem, serving the Conte cluster, has become unavailable as of about 8:00 pm Thursday 13 Feb, 2014. System engineers are working to bring the system back to 100% operation. Currently running jobs should be able to continue, but sch...

  • System Maintenance

    The Hansen, Coates, and Rossmann clusters will be unavailable beginning at 8:00am on Tuesday, January 7, 2014, for scheduled maintenance. The clusters will return to full production by 5:00pm, Wednesday, January 8. During this time, these systems wil...

  • Hansen and WinHPC clusters at reduced capacity

    On December 21, 2013, the Hansen and WinHPC clusters will operate at reduced capacity while datacenter power maintenance is performed on a portion of the system. In the days leading up to December 21st, this will appear as potentially increased queue...

  • Lustre D filesystem unavailable

    Update - 2:25pm, 12/16/2013 The LustreD scratch filesystem has been returned to service and both the filesystem and scheduler appear to be working properly. Conte has been returned to normal production service as of 2:20pm. Update - 10:30am, 12/16/2...

  • Maintenance completed on LustreD filesystem

    UPDATE 6:00 pm 14 Dec 2013 As of 5:45 pm we believe this problem has been corrected and Conte has returned to normal operation. The LustreD filesystem, serving the Conte cluster, is experiencing some issues as of about 4:30 pm Saturday 14 Dec 2013. S...

  • Network Storage Outage

    All ITaP Research Computing systems are currently experiencing an issue with accessing network filesystems. A case has been opened with our vendor as ITaP engineers troubleshoot the issue. Cluster users may experience issues accessing files in /home,...

  • Most Major Clusters Stopped

    Nearly all major clusters operated by ITaP Research Computing are stopped due to issues with their storage systems relating to the power loss on the West Lafayette campus in the wake of the severe weather Sunday night. This includes: Conte, Carter,...

  • Fortress Archive Down

    The Fortress HPSS Archive is offline due to issues with their storage systems relating to the power loss on the West Lafayette campus in the wake of the severe weather Sunday night. Engineers are investigating the problem now, but until this is reso...

  • LustreC hardware issue

    Update: 11:00pm, Nov. 12, 2013 ITaP storage engineers have returned the offline hardware to production and LustreC is back in production. Queues on Hansen and Carter have been restarted as of 11:45pm. Update: 5:00pm Following consultation with vendor...

  • Partial scratch96 filesystem outage

    In the evening of 10/10/2013, the fileserver providing the "scratch96" filesystem serving some users of the Steele and Radon clusters suffered a permanent failure to its 2nd tier storage. This means that files on scratch96 that are older th...

  • Fortress HPSS Archive Unavailable

    Update - 10:15 am Fortress is back in full production. Original Message: As of 8:00am, Thursday, September 19, the Fortress HPSS is temporarily unavailable due to issues with communicating with its tape drives. Storage engineers are working to return...

  • LustreC Filesystem Maintenance

    The high performance scratch file system (LustreC) supporting the Carter, Hansen, Peregrine1, and WinHPC research clusters is in need of mandatory maintenance work. The work should be performed as soon as possible in order to ensure full performance...

  • Carter Maintenance

    As you may be aware, on April 5, the Board of Trustees approved the purchase of the next generation of community cluster, to be named "Conte". Since that time, ITaP staff have begun preparations for installing the new system, which will arr...

  • Software Stack Changes during Carter Maintenance

    Between July 8 and July 16, Carter will be unavailable due to scheduled maintenance. On July 8, there will be changes made to the software stack on most of ITaP's community clusters. Changes will include updates to the default version of the Intel co...

  • LustreC filesystem unavailable

    Update: May 13, 2013 11:00pm: LustreC has been returned to service. Carter, Hansen, and Peregrine1 are back in production with queues enabled. Update: May 13, 2013 3:00pm: storage engineers are continuing to work with vendor support to return Lustre...

  • Unscheduled Fortress Outage

    Resolved: As of about 4:45pm ET, the connectivity issue affecting the Fortress archive has been resolved. The HPSS archive is back in full production. If you encounter any issues, please contact us at rcac-help@purdue.edu Update: ITaP Storage Enginee...

  • Network outage affecting Peregrine1 cluster

    On April 24, 2013, network engineers will be relocating fiber optics that connect the Peregrine1 cluster to infrastructure in West Lafayette. This outage is scheduled for 12:00am through 5:00am. This will leave Peregrine1 unable to run jobs Any PBS j...

  • Scheduling paused on Carter cluster

    Update: 8:12pm Scheduling on Carter has been resumed, and Carter is back in full production. Original Message: Beginning the morning of April 16, a number of compute nodes on the Carter cluster are experiencing a connectivity issue. While ITaP engine...