Skip to main content
Have a request for an upcoming news/science story? Submit a Request

Outages

  • Depot Access Issues from ECN Systems

    Update Working closely with ECN, RCAC engineers have deployed a new CIFS server to mitigate any incompatibilities. ECN users affected by this issue should connect to their Depot space though \\[datadepot2.rcac.purdue.edu](http://datadepot2.rcac.purdu...

  • All Clusters Outage

    All Research Computing systems suffered an unplanned outage Saturday, March 24th, 2018 at 8:15pm EDT due to a widespread power failure in the area. Thanks to diligent efforts all night and today by many teams across ITaP, all computational clusters h...

  • Job Scheduling Issue on Clusters

    • widget.news::news.updated:

    As of Monday, April 16th, 2018 at 10:00am EDT, Halstead, HalsteadGPU, and Hammer are not properly scheduling new jobs due to a problem with the Moab scheduler. Existing jobs are unaffected. We are working with the vendor to address this and expect...

  • Unscheduled Conte scratch outage

    • widget.news::news.updated:

    The Conte cluster began experiencing issues with the scratch filesystem around 10:45am EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will pr...

  • Unscheduled network outage on Brown, Rice, Snyder and Hammer

    • widget.news::news.updated:

    The Brown, Hammer, Rice, and Snyder clusters began experiencing issues with scratch filesystems and network connectivity around 2:30am EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused...

  • Unscheduled Rice outage

    • widget.news::news.updated:

    The Rice cluster is currently experiencing issues with the scratch filesystem. Engineers have identified the problem and are in the process of applying the fix. Job scheduling has been paused while this issue is being addressed. We will provide an up...

  • Scratch unavailable on Snyder cluster

    • widget.news::news.updated:

    As of approximately 2:30pm EDT, the Snyder cluster is currently experiencing issues with its scratch filesystem. Engineers are currently diagnosing the issue and are working to identify a fix. Attempts to access scratch will likely fail. Job scheduli...

  • Unscheduled Workbench outage

    • widget.news::news.updated:

    The Data Workbench and WSC Hadoop cluster began experiencing issues with connectivity around 2:00pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. We will provide an update by 3 pm.

  • Unscheduled Depot Outage

    • widget.news::news.updated:

    The Data Depot central servers lost their connection to Depot and all rebooted around 11:45am EDT. Servers are already returning, but engineers are watching the situation closely. Scheduling on clusters is being paused. Cluster jobs might be able...

  • Unscheduled Rice scratch outage

    • widget.news::news.updated:

    The Rice cluster began experiencing issues with its scratch system around 4:30pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will provide...

  • Unscheduled Fortress Archive Outage

    • widget.news::news.updated:

    The Fortress Archive was undergoing routine monthly maintenance at 8:00am EST when it suffered a failure of the internal database. Engineers are working with the vendor to repair the database and restore service as soon as possible. Until then, any a...

  • Unscheduled Workbench Outage

    • widget.news::news.updated:

    The Workbench cluster began experiencing issues with the server which manages its Thinlinc virtual desktop around 10:30pm EST. Currently, no new Thinlinc sessions will be able to be started. Engineers are currently diagnosing the issue and are worki...

  • Campus power outage

    • widget.news::news.updated:

    A power event around 8:40 am this morning affected all Research Computing resources, along with networking, and other resources around campus. Networking is offline making all resources inaccessible. Networking is slowly coming back online around 9:3...

  • Halstead and Brown unscheduled outage

    • widget.news::news.updated:

    Halstead, HalsteadGPU, Brown, and BrownGPU went offline during a campus power event around 8:40 am this morning. Engineers are working to bring the compute nodes and the scratch system back online. Other systems are back online at this time. Job sche...

  • Unscheduled Brown and BrownGPU outage

    • widget.news::news.updated:

    The Brown and BrownGPU cluster began experiencing issues with the scratch filesystem around 8:00am EST. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed...

  • Unscheduled Fortress Outage

    • widget.news::news.updated:

    Fortress began experiencing issues around 11:15am EST. Engineers are currently diagnosing the issue and are working to identify a fix. We will provide an update by 4:00pm.

  • Home Filesystem / Slowdown / Login Issue on all Clusters

    • widget.news::news.updated:

    All clusters began experiencing issues with logins and a general slowdown around 3:10pm EST. This has been identified as being due to an issue on the /home/ filesystem. Engineers are continuing to examine this and are working to alleviate the issue r...

  • Unscheduled Fortress outage

    • widget.news::news.updated:

    The Fortress tape archive began experiencing issues with Globus and HSI access around 2:30pm EDT on Sunday, March 10th, 2019. Engineers are currently diagnosing the issue and are working with the vendor to identify a fix. We will provide an update by...

  • Campus power outage affecting multiple clusters

    • widget.news::news.updated:

    At approximately, 8:30am EDT, the Brown, Hammer, Rice, and Snyder clusters became unavailable due to a campus power outage. While power has been restored, engineers are currently working returning the clusters to service. Job scheduling has been paus...

  • Unscheduled Github outage

    • widget.news::news.updated:

    Github is currently offline and is not responding. Engineers are currently working on bringing Github back up. We will provide another update later today or as soon as it is back online.