Bell

Sort By:

Featured Newest to Oldest Oldest to Newest Recently Published

Unscheduled Bell outage
- December 9, 2022 12:30pm - 4:00pm EST
The Bell cluster began experiencing issues with its Lustre scratch filesystem around 12:30pm EST. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We w...
Scheduling paused on multiple clusters
- November 7, 2022 9:50am - 12:15pm EST
The Bell, Brown, Gilbreth, Halstead, and Scholar clusters began experiencing issues with their Data Depot mounts around 9:50am EST. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while...
Bell Gateway Outage
- October 7, 2022 8:00am - October 10, 2022 5:10pm EDT
The Bell Gateway began experiencing issues following the Oct 4-6th Bell Maintenance. In particular, gateway applications have been observed to fail upon attempting to connect to the application after launching the job. Our engineers are investigating...
Scheduled Bell Maintenance
- October 4, 2022 8:00am - October 6, 2022 6:30pm EDT
The Bell cluster will be unavailable for use October 4, 2022 8:00am - October 5, 2022 11:59pm EDT while we perform scheduled maintenance to expand Bell's scratch storage capabilities and make upgrades to Bell's AMD GPU drivers and other system soft...
Bell Degraded Capacity
- September 28, 2022 12:00am - September 6, 2023 11:26am EDT
The Bell cluster continues to experience issues with Hardware. Engineers are currently diagnosing the issues and are working with vendors to schedule and perform repairs as quickly as possible. Job scheduling continues, but you may experience longer...
Unscheduled Bell outage
- September 27, 2022 2:30pm - 3:45pm EDT
A section of the Bell cluster compute nodes began experiencing issues with power feed and cooling around 2:30pm EDT. Engineers have powered down affected nodes and are working to identify a fix. Some jobs may have ended up terminated or requeued. Jo...
MATH data center cooling outage
- August 3, 2022 1:40pm - 9:30pm EDT
The Math building data center began experience issues with its cooling system around 1:40pm EDT. To minimize thermal load on the cooling infrastructure, job scheduling has been paused and all idle compute nodes on Anvil, Bell, Geddes, Gilbreth, and...
Bell scratch purging policy change
- July 1 - 23, 2022
As announced during the recent Bell outage, some temporary austerity measures need to be implemented to prevent the scratch file system from filling up and causing more outages. Hardware to expand the scratch file system has been on order for some ti...
Bell Scratch Degraded Storage (Returned to Service)
- June 29, 2022 - July 5, 2022
Bell Scratch is near capacity and performance is degraded. As of this morning, Bell Scratch was 94% full. This afternoon we paused scheduling as scratch was not responding consistently. We have more drives on order, but with global supply chain issue...
Unscheduled campus power outage
- April 30, 2022 7:00pm - May 1, 2022 9:00am EDT
Several Research Computing resources became affected by a campus power outage around 7:00pm EDT. Multiple login and compute nodes may have powered down, leading to jobs fail and/or requeue with a NODE_FAIL or similar status. Engineers are currently d...
Unscheduled Bell outage
- April 9, 2022 9:00pm - April 10, 2022 10:15am EDT
The Bell cluster began experiencing issues with its scratch filesystem around 9:00pm EDT on Saturday, April 9th, 2022. Access to files in scratch may appear severely delayed or frozen. Engineers are currently diagnosing the issue and are working to...
Unscheduled Data Depot Slowdown on Community Clusters
- March 17, 2022 9:00am - 1:30pm EDT
As of 9:00am EDT, users of community clusters may experience slowness while trying to access Data Depot (including loading modules, starting applications or reading data) . The symptoms appear on both login and compute nodes. System engineers are act...
Unscheduled Math Data Center Cooling Outage
- March 16, 2022 11:40am - 2:50pm EDT
The Math building data center began experience issues with its cooling system around 11:40am EDT. As one of manifestations, users may experience issues while logging in to the Anvil, Bell, Gilbreth, and Halstead clusters. To minimize thermal load on...
Whole-Floor Cluster Maintenance
- March 15, 2022 4:00pm - March 16, 2022 12:00pm EDT
The majority of Research Computing computational resources (Bell, Brown, Geddes, Gilbreth, Halstead, Hammer, Scholar, Weber, and Workbench clusters) will be unavailable March 15, 2022 4:00pm - March 16, 2022 12:00pm EDT during Whole-Floor Data Depo...
Unscheduled Math data center cooling outage
- February 28, 2022 11:40am - 8:00pm EST
The Math building data center began experience issues with its cooling system around 11:40am EST. As one of manifestations, users may experience issues while logging in to the Anvil, Bell, Gilbreth, Halstead, Workbench, and Data Depot clusters. To m...
Unscheduled Data Depot outage
- February 11, 2022 5:45pm - 8:00pm EST
As of 8:00pm EST on Friday, February 11th, 2022 the Data Depot filesystem outage has been resolved and scheduling has been resumed on all clusters. The Bell, Brown, Gilbreth, Halstead, Scholar, Workbench, and Data Depot cluster began experiencing i...
Unscheduled Bell outage
- January 21, 2022 11:35am - 3:46pm EST
The Bell cluster began experiencing issues with scheduler database around 11:35am EST. The problem manifests as freezing and/or "socket timed out" and "Unable to contact slurm controller" error messages upon the usual Slurm comman...
Unscheduled Bell outage
- December 25, 2021 6:30pm - 10:00pm EST
The Bell cluster began experiencing issues with its scratch filesystem around 6:30pm EST. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will prov...
Research Computing Holiday Break
- December 22, 2021 5:00pm - January 3, 2022 8:00am EST
Research Computing personnel will observe the university winter break from 5:00pm EST EST on Wednesday, December 22nd, 2021, and will resume normal business hours on Monday, January 3rd, 2022. During this time, Research Computing services will conti...
Unscheduled Bell outage
- October 22, 2021 1:20pm - 5:20pm EDT
The Bell cluster began experiencing issues with high load and sluggish performance on the scratch filesystem around 1:20pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this...

Results 61-80 of 108

Bell

Follow Us