Unscheduled Bell outage

Search
RSS Feeds
Announcements
Events
Outages and Maintenance
Science Highlights

Unscheduled Bell outage

September 27, 2022 2:30pm - 3:45pm EDT
Outages
Bell

Link to update at September 27, 2022 3:43pm EDT UPDATE: September 27, 2022 3:43pm EDT

As of 3:43pm EDT, affected nodes have been isolated and rest of the Bell cluster has been returned to normal service. Job queues have been enabled and job scheduling has been resumed. We apologize for the disruption of service. Please report any issues to rcac-help@purdue.edu.

Link to original posting ORIGINAL: September 27, 2022 2:30pm EDT

A section of the Bell cluster compute nodes began experiencing issues with power feed and cooling around 2:30pm EDT. Engineers have powered down affected nodes and are working to identify a fix. Some jobs may have ended up terminated or requeued.

Job scheduling has been paused while this issue is being addressed. We will provide an update by 6pm tonight.

Originally posted: September 27, 2022 3:23pm EDT