Scholar
-
Unscheduled Data Depot and community clusters outage
At about 9:30am EDT, Data Depot servers started experiencing a ramping high load. Coupled with an ongoing scaling issues with the metadata subsystem, this caused Data Depot to become increasingly unresponsive for both community clusters and network d...
-
Unscheduled Data Depot outage on multiple clusters
The Bell, Brown, Gilbreth, Halstead, Scholar, and Workbench clusters began experiencing issues with mounting old Data Depot filesystem around 12:30am EDT. Multiple nodes are flagged offline by an automatic check, and bioinformatics application suite...
-
RCAC Whole-Floor Downtime and Power Work
The majority of the Research Computing computational resources will be unavailable July 30, 2021 7:00am - August 1, 2021 12:00pm EDT for a whole-floor downtime due to electrical power work in MATH and POD data centers. Along with a required preven...
-
Scheduling Paused on Multiple Clusters
At about 4:00 pm today (Wednesday, 21 July, 2021) System Engineers found an issue with the schedulers on the Bell, Brown, Gilbreth, Halstead, and Scholar clusters. Job scheduling has been paused while this is being investigated. Symptoms of this pro...
-
Intermittent Access Failures on Data Depot
As of Thursday, June 17th, 2021 at 11:00am EDT, users of community clusters may experience intermittent "permission denied" errors while trying to access their files on Data Depot. Errors may come and go, and may appear on both login and c...
-
[Cancelled] Scholar Cluster Maintenance
The Scholar cluster will be unavailable Friday, May 14, 2021 at 8:00am EDT for scheduled maintenance. The cluster will return to full production by %enddatetime%. During this time, Scholar will have the operating system patched and several software u...
-
Whole-Floor Cluster Maintenance
The majority of Research Computing computational resources (Bell, Brown, Gilbreth, Halstead, Hammer, Scholar, WCERES, Workbench, and WSC Hadoop clusters) will be unavailable Tuesday, May 11, 2021 at 5:00pm EDT for Data Depot migration work. The clust...
-
The Data Depot storage server began experiencing issues around 3:00pm EST on Thursday, February 4th, 2021. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused on all clusters while this issue...
-
A large number of Scholar accounts have been accidentally removed during overnight processing. This manifests as "LDAP authorization check failed", or "Incorrect or Invalid username/password" and similar errors when trying to logi...
-
The Bell, Brown, Gilbreth, Halstead, Rice, Scholar, and Snyder clusters began experiencing issues with their Data Depot mounts around 10:00pm EST. Engineers are currently diagnosing the issue and are working to identify a fix. To avoid job losses for...
-
Access to RCAC Resources During ITaP Central Authentication Outage
On Sunday, December 27th, 2020, ITaP staff will perform major upgrades to the central authentication infrastructure. All applications that require logging in with BoilerKey or Career Account credentials will be unavailable Sunday, December 27, 2020 f...
-
Research Computing Holiday Break
Research Computing personnel will observe the university winter break from 5:00pm EST EST on Friday, December 18th, 2020, and will resume normal business hours on Monday, January 4th, 2021. During this time, Research Computing services will continue...
-
The Scholar cluster will be taken down for regular inter-semester maintenance and upgrades starting at Wednesday, December 16th, 2020 at 8:00am EST. All jobs which cannot complete before then will be held queued during this time, and no one will be a...
-
Home and Applications Filesystem Maintenance - All Clusters
Most of the research computing clusters (Brown, Gilbreth, Halstead, Hammer, Rice, Scholar, Snyder, WCERES, Workbench, and WSC Hadoop) as well as some other minor systems will be unavailable beginning at Tuesday, November 3rd, 2020 at 9:00am EST, for...
-
The Scholar cluster will be taken down for a short maintenance on Thursday, August 20th, 2020 at 10:00am EDT. During this maintenance, we will apply patches to the RStudio Server on Scholar. All jobs which cannot complete before the start of the main...
-
Requiring BoilerKey or SSH key authentication on Community Clusters
During Aug 17-20th, 2020, due to immediate security concerns, we will be changing community cluster access to require BoilerKey two-factor authentication (2FA) for all direct SSH or Thinlinc desktop access to each cluster and will no longer support p...
-
As of 12:30pm EDT all the clusters are back in production. If your job crashed during the outage, please resubmit it. We are currently experiencing an outage across the community clusters (Brown, Gilbreth, Halstead, Hammer, Rice, Scholar, Snyder, WC...
-
BoilerKey and SSH Key Login to Clusters FAQ
As explained in the news article on Requiring BoilerKey or SSH key authentication on Community Clusters, all clusters will now be requiring BoilerKey or SSH key authentication in order to log in to them, effective mid-August 2020. Here are some comm...
-
The Scholar cluster will be taken down for regular inter-semester maintenance and upgrades starting at Monday, June 15th, 2020 at 8:00am EDT. All jobs which cannot complete before then will be held queued during this time, and no one will be able to...
-
Unscheduled Home Directory Outage
The Brown, Gilbreth, Halstead, Rice, Scholar, Snyder, and Workbench clusters began experiencing issues with intermittently slow home directories access around 2:30pm EDT. The issue has been traced to a high load on one of the filesystem's back-end se...