Skip to main content
Have a request for an upcoming news/science story? Submit a Request

Unscheduled Bell outage

  • Outages
  • Bell

Link to update at January 23, 2021 5:52pm EST UPDATE:

As of 5:45pm, engineers have patched Lustre client software for Bell scratch filesystem and returned the cluster to normal service. Job queues have been enabled and job scheduling has been resumed.

We apologize for the disruption of service. Please report any issues to rcac-help@purdue.edu.

Link to update at January 23, 2021 11:40am EST UPDATE:

Work continues on bringing Bell scratch filesystem to normal operations. Engineers have traced the source of the problem to the faulty Lustre client software and are currently working with the vendor on applying the fix.

We will provide another update by 6pm.

Link to update at January 22, 2021 10:31pm EST UPDATE:

At approximately 10:30pm the problem with Bell scratch has returned. Job scheduling has been paused again while engineers continue to troubleshoot the issue.

We will provide an update by noon tomorrow.

Link to update at January 22, 2021 7:30pm EST UPDATE:

As of 7:30pm EST, the Bell cluster has been returned to normal service. Job queues have been enabled and job scheduling has been resumed. We apologize for the disruption of service. Please report any issues to rcac-help@purdue.edu.

Link to original posting ORIGINAL:

The Bell cluster began experiencing issues with its scratch filesystem around 4:00pm EST.

Engineers are currently diagnosing the issue and have opened a ticket with the vendor to identify a fix. Job scheduling has been paused while this issue is being addressed.

We will provide an update by noon tomorrow.

Originally posted:
Last updated: