Skip to main content
Have a request for an upcoming news/science story? Submit a Request

Outages and Maintenance

  • Bell cluster will be unavailable Wednesday

    • widget.news::news.updated:

    The Bell cluster will be unavailable Wednesday, November 25, 2020 at 8:00am EST for scheduled maintenance. During this time, our Engineering team will work on finalizing the cluster's internal configuration. Both cluster front-ends and compute nodes...

  • Bell cluster will be unavailable Wednesday

    • widget.news::news.updated:

    The Bell cluster will be unavailable Wednesday, November 11, 2020 at 8:00am EST for scheduled maintenance. During this time, our Engineering team will be working with vendor representatives to fine-tune performance of the Bell scratch filesystem and...

  • Unscheduled Brown outage

    • widget.news::news.updated:

    The Brown cluster began experiencing issues with its job scheduler around 4:00pm EST. The problem manifests itself as Slurm-related commands (slist, squeue, sinteractive, sbatch, etc) being slow, unresponsive or timing out. Queue selection dialogs in...

  • Home and Applications Filesystem Maintenance - All Clusters

    • widget.news::news.updated:

    Most of the research computing clusters (Brown, Gilbreth, Halstead, Hammer, Rice, Scholar, Snyder, WCERES, Workbench, and WSC Hadoop) as well as some other minor systems will be unavailable beginning at Tuesday, November 3rd, 2020 at 9:00am EST, for...

  • Bell, Halstead, Hammer and CMS Clusters Maintenance

    • widget.news::news.updated:

    The Bell, Halstead, Hammer and CMS clusters will be unavailable Tuesday, November 3, 2020 at 8:00am EST for scheduled maintenance. The clusters will return to full production by %enddatetime%. During this time, the clusters will have their operating...

  • Bell cluster will be unavailable Wednesday

    • widget.news::news.updated:

    The Bell cluster will be unavailable Wednesday, October 28, 2020 at 8:00am EDT for scheduled maintenance. During this time, our Engineering team will be working with vendor representatives to complete benchmarking steps and finalize the cluster's int...

  • Bell will be unavailable Tuesday night through Thursday

    • widget.news::news.updated:

    The Bell Cluster will be unavailable Tuesday, October 20, 2020 at 8:00pm EDT. During this time, our Engineering team will be working with vendor representatives to complete benchmarking steps and finalize the cluster's internal configuration. During...

  • Unscheduled RCAC GitHub outage

    • widget.news::news.updated:

    The github.rcac server will be briefly unavailable Friday, October 16, 2020 from 7:00pm – 11:59pm for an emergency maintenance. During this time, the server will undergo maintenance tasks that can not be completed with the server in production. Opera...

  • Rice Scheduler Issues

    • widget.news::news.updated:

    As of about 8:15 pm yesterday (Monday 12 October) the Slurm batch scheduler on Rice entered a degraded state due to a problem with its internal database. Job Scheduling on Rice has been paused while we work on this issue. We will provide an update b...

  • Halstead Scratch Issues

    • widget.news::news.updated:

    The Halstead cluster began experiencing issues with its scratch filesystem around 1:15 pm, Sunday 11 Oct. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being address...

  • Unscheduled Halstead outage

    • widget.news::news.updated:

    The Halstead cluster began experiencing issues with its scratch filesystem around 9:00pm. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will prov...

  • Fortress Archive Monthly Maintenance

    The Fortress Archive will be unavailable Wednesday, October 7, 2020 from 8:00am - 12:00pm EDT for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal updates and have some work done on its ta...

  • Halstead Scratch Issues

    • widget.news::news.updated:

    Halstead's scratch began experiencing issues this morning (Sunday 27 Sep). Job scheduling has been paused while engineers and the system vendor investigate the issue. We will have an update by tomorrow morning (Monday 28 Sep) at 10:00 am.

  • Halstead Scratch Issues

    • widget.news::news.updated:

    Halstead's scratch began experiencing issues at approximately 2:00 AM. Some users have reported that they are unable to read or index files within their personal scratch directories when logged in from certain front-ends. Job scheduling has been pa...

  • Unscheduled Fortress outage

    • widget.news::news.updated:

    The Fortress tape archive began experiencing issues with its disk cache subsystem being full on Tuesday, September 1st, 2020 around 12:30am EDT. The problems manifest themselves as intermittent Error -1, Error -28, and No space left on device error m...

  • Weber Cluster Maintenance

    • widget.news::news.updated:

    The Weber cluster will be taken down for regular maintenance and upgrades beginning on Monday, August 31st, 2020 at 8:00am EDT. During this time, Weber will have operating system updates applied. Users will be unable to log in or use Weber for inter...

  • Unscheduled Fortress outage

    • widget.news::news.updated:

    The Fortress tape archive began experiencing issues with its disk cache subsystem on Thursday, August 27th, 2020 around 9:00pm EDT. The problems manifest themselves as intermittent Error -1, Error -28, and No space left on device error messages in HS...

  • Gilbreth Cluster Maintenance

    • widget.news::news.updated:

    The Gilbreth cluster will be unavailable Wednesday, August 26, 2020 at 8:00am EDT for scheduled maintenance. The cluster will return to full production by %enddatetime%. During this time, Gilbreth will have the operating system patched and a maintena...

  • Unscheduled Fortress outage

    • widget.news::news.updated:

    The Fortress tape archive began experiencing issues with Error -1, Error -28, and No space left on device error messages in HSI and Globus around 9:00pm EDT on Tuesday, August 25th, 2020. Engineers are currently diagnosing the issue and are working...

  • Central Authentication Service (CAS) Outage

    This morning, BoilerKey authentication for all community clusters and user facing services (such as the RCAC website, Rstudio Server) is unavailable due to a Central Authentication Service (CAS) outage. All the clusters are under normal operations an...