Unscheduled outage on Conte
As of 2:35 pm, Conte cluster is returned to service. Scheduling is resumed in all queues.
Update
The source of the problem has been identified and the fix is underway. We anticipate returning Conte to service by 3pm today.
Original message
The Conte cluster is currently experiencing issues related to an Infiniband library. Our system administrators are working to track and fix the glitch.
Job scheduling on Conte has been paused while engineers address the issue.