News: Outages (Ended): Page 4

Outages

All Upcoming Ended

Oldest to Newest Newest to Oldest

Article #815: Unscheduled scratch outage on Hammer
- February 1, 2016 8:00am - February 5, 2016 7:00pm EST
- Hammer
The Hammer scratch filesystem has now returned to normal operations. Original Message: During the maintenance of the Rice and Snyder clusters this wee...
Article #818: Unscheduled outage on Carter
- February 2, 2016 6:00pm - February 3, 2016 10:50pm EST
- Carter
The underlying issues affecting Carter are resolved and job scheduling has been resumed. Many individual nodes remain offline for corrective action,...
Article #819: Unscheduled outage on Carter
- February 4, 2016 8:00am - 10:30am EST
- Carter
The cause of this turned out to be a power loss to Carter's scratch filesystem and portions of the Data Depot, which has been restored now. Carter no...
Article #820: Unscheduled Outage in Math Data Center
- February 4, 2016 8:00am - 10:30am EST
- Carter, Conte, Hansen, Radon, Scholar, Data Depot
Most of the impact of this turned out to be to the Depot storage system, which has now been restored to normal operations. All the other affected sys...
Article #821: Unscheduled scratch outage on Carter
- February 12, 2016 10:20am - 12:00pm EST
- Carter
There was an issue with the cluster's gateway switches, causing infiniband traffic to be incapable of IP over infiniband. This also caused an instabil...
Article #822: Unscheduled outage on Rice and Snyder
- February 17, 2016 7:30pm - 9:15pm EST
- Rice, Snyder
As of 9:15 PM, the Snyder and Rice clusters have been brought back into service after cooling was brought back online. Front-ends are operational and...
Article #825: Unscheduled Outage on Data Depot
- February 23, 2016 11:00am - February 24, 2016 6:00pm EST
- Data Depot
The Depot filesystem checks have all completed cleanly and the Depot has been fully returned to normal operations. All queues on all clusters are sch...
Article #828: ECN services outage - ITaP Research Computing systems impacted
- March 1, 2016 6:30am - 9:00am EST
- Carter, Conte, Hammer, Hansen, Peregrine1, Radon, Rice, Scholar, Snyder
Engineering Computing Network (ECN) will be performing staged patching and reboots of all of ECN's RedHat Linux workstations and servers to protect ag...
Article #831: Unscheduled outage for Peregrine1
- March 7, 2016 12:30pm - 2:30pm EST
- Peregrine1
As of Monday, March 7th, 2016 at 12:30pm EST, the Peregrine1 cluster is unavailable due to a failed network switch in its datacenter. This switch is...
Article #834: Unscheduled outage on Peregrine-1
- March 17, 2016 4:00pm - 6:40pm EDT
- Peregrine1
Outage RESOLVED A misconfiguration that caused an unneeded IB driver to be loaded was fixed. Peregrine-1 is back online. Job scheduling is on. Origi...
Article #835: Standby queues paused on Hansen Cluster
- March 28, 2016 12:15pm - 5:00pm EDT
- Hansen
Update As of 5:20pm the standby and standby-c queues have been started and their jobs are being scheduled for execution. The standby queues on Hansen...
Article #836: Unscheduled Scratch Outage on Carter
- March 30, 2016 4:30pm - March 31, 2016 1:45pm EDT
- Carter, Scholar
The scratch storage on Carter and Scholar has been returned to normal operations. The rebuild process will be continuing in the background, so we wil...
Article #837: Unscheduled Scheduling Outage on Hansen
- April 8, 2016 5:00am - 10:45am EDT
- Hansen
Job scheduling on Hansen has returned to normal. This concludes the outage. Original Message: Hansen is not currently scheduling any new jobs. A file...
Article #844: Unscheduled Scratch Outage on Carter
- April 20, 2016 4:40pm - 8:20pm EDT
- Carter
UPDATE: The issue with Carter's scratch filesystem has been resolved. The filesystem is now available. Job scheduling on the cluster has been resum...
Article #851: Unscheduled Storage Outage
- May 17, 2016 5:30pm - May 18, 2016 12:15am EDT
- Carter, Hammer, Hansen, Hathi, Peregrine1, Radon, Rice, Scholar, Snyder
The Isilon filesystem was restored to normal service and all affected clusters had it remounted as quickly as was sustainable by the filesystem. This...
Article #852: Network issues affecting Snyder cluster
- May 24, 2016 12:00pm - 4:00pm EDT
- Snyder
The problem is now RESOLVED after the reboot of a router. ======= The network serving Snyder is currently experiencing issues. Attempts to log in to t...
Article #853: Campus network outage
- May 26, 2016 7:00am - 1:00pm EDT
Networking to and from campus, and around large parts of campus are down. Many services are unreachable at the moment. We will provide updates as they...
Article #854: Campus networking outage
- May 26, 2016 7:00am - 1:00pm EDT
Networking to and from campus, and around large parts of campus are down. Many services are unreachable at the moment. We will provide updates as they...
Article #855: Unscheduled Storage Outage
- June 7, 2016 4:10pm - 10:00pm EDT
- Conte, Hansen, Hathi, Radon
The underlying storage has been fixed, and all these clusters have been returned to normal operations as of 10:00pm EDT. As of Tuesday, June 7th, 201...
Article #861: ECN Services Outage
- July 16, 2016 8:00am - 12:00pm EDT
- Carter, Conte, Hammer, Hansen, Radon, Rice, Scholar, Snyder
Engineering Computing Network (ECN) will be performing scheduled maintenance this weekend on several ECN server resulting in their unavailability for...

Results 61-80 of 253