Unscheduled Bell outage
As of 3:43pm EDT, affected nodes have been isolated and rest of the Bell cluster has been returned to normal service. Job queues have been enabled and job scheduling has been resumed. We apologize for the disruption of service. Please report any issues to email@example.com.
A section of the Bell cluster compute nodes began experiencing issues with power feed and cooling around 2:30pm EDT. Engineers have powered down affected nodes and are working to identify a fix. Some jobs may have ended up terminated or requeued.
Job scheduling has been paused while this issue is being addressed. We will provide an update by 6pm tonight.