Article #562: Scheduling paused on ITaP research clusters
During scheduled network maintenance on network equipment connecting storage to ITaP clusters, all scheduling will be paused from 4-6pm. Running jobs...
During scheduled network maintenance on network equipment connecting storage to ITaP clusters, all scheduling will be paused from 4-6pm. Running jobs...
Update: Noon, 1/8/13 The power issue in MATH has been resolved. Power has been restored to the nodes in the Coates-A subcluster affected by the outage...
Campus chilled water serving the MATH data center is experiencing above-normal temperatures, and as a precaution, scheduling on the Coates, Rossmann,...
Update: As of about 11:00 am, the problem with the chilled water has been corrected, and scheduling has resumed on all RCAC clusters. Thank you for yo...
As of 9:00am, are seeing a problem with the LustreC scratch filesystem that serves Carter, Hansen, and Peregrine1. To prevent any more jobs from runn...
Update: ITaP engineers have corrected the issue affecting the LustreC filesystem. The system is back in production. Job scheduling on Carter, Hansen a...
Update: 8:12pm Scheduling on Carter has been resumed, and Carter is back in full production. Original Message: Beginning the morning of April 16, a nu...
On April 24, 2013, network engineers will be relocating fiber optics that connect the Peregrine1 cluster to infrastructure in West Lafayette. This out...
Resolved: As of about 4:45pm ET, the connectivity issue affecting the Fortress archive has been resolved. The HPSS archive is back in full production....
Update: May 13, 2013 11:00pm: LustreC has been returned to service. Carter, Hansen, and Peregrine1 are back in production with queues enabled. Update...
Update - 10:15 am Fortress is back in full production. Original Message: As of 8:00am, Thursday, September 19, the Fortress HPSS is temporarily unavai...
In the evening of 10/10/2013, the fileserver providing the "scratch96" filesystem serving some users of the Steele and Radon clusters suffer...
Update: 11:00pm, Nov. 12, 2013 ITaP storage engineers have returned the offline hardware to production and LustreC is back in production. Queues on Ha...
Nearly all major clusters operated by ITaP Research Computing are stopped due to issues with their storage systems relating to the power loss on the W...
The Fortress HPSS Archive is offline due to issues with their storage systems relating to the power loss on the West Lafayette campus in the wake of t...
All ITaP Research Computing systems are currently experiencing an issue with accessing network filesystems. A case has been opened with our vendor as...
Update - 2:25pm, 12/16/2013 The LustreD scratch filesystem has been returned to service and both the filesystem and scheduler appear to be working pro...
The Lustre D filesystem, serving the Conte cluster, has become unavailable as of about 8:00 pm Thursday 13 Feb, 2014. System engineers are working to...
The scratch filesystem on Hansen and Carter is currently unavailable due to a hardware issue. Attempts to access scratch will block until the filesyst...
UPDATE: Fortress was successfully returned to service as of 7:35 pm Wednesday, 15 July. As of 8:30am on July 15, 2014, the Fortress HPSS Archive is un...