Article #796: Unscheduled Outage on Conte
Update - 9:20pm Conte has been returned to full production as of 9:15pm. During the failure earlier today, the internal tracking of jobs within the sc...
Update - 9:20pm Conte has been returned to full production as of 9:15pm. During the failure earlier today, the internal tracking of jobs within the sc...
Service was restored around 7:30pm today. Engineers changed the way Samba authenticates users to avoid this problem going forward. -- Service was rest...
October 30, 2015 11:00am ITaP Engineers have made additional timeout changes to the scratch filesystem which has increased stability. Additional work...
**Update: August 25, 2015 9:00 pm ** On Monday, August 24, a disk tray in the Rossmann scratch storage system suffered multiple failures and despite g...
UPDATE As of 8pm on August 15, 2015 the scratch filesystem serving Rossmann is back in full production. Original message: The scratch filesystem servi...
Due to power work in the MSEE building, most ECN services will be unavailable between 6:30am – 9:00pm EDT on Saturday, August 15, 2015. For Research C...
ITaP engineers have identified issues causing intermittent failures on Carter. Engineers are currently tuning parameters on Depot system that have bee...
Update: The scheduling server has been rebooted and job submissions appear to be working normally again. Please let us know at rcac-help@purdue.edu if...
Due to power work in the MSEE building, most ECN services will be unavailable between 5:30 pm Thursday, 11 June, 2015 and 8:00 am Friday 12 June 2015....
Fortress Samba service has been restored as of 10:15am on Monday, June 8th. We apologize for any inconvenience this has caused and thanks for your pat...
The Hathi Hadoop cluster will be unavailable Monday, 13 April, 2015 from 9:00 am to 1:00 pm. During that time, the cluster hardware will be upgraded w...
UPDATE 4:00 pm Tuesday 3 February 2015 The problem referenced below has been diagnosed and corrected, and job scheduling should work as expected on th...
Update 2 Storage engineers have brought Conte scratch back to full production. Job scheduling on Conte has been restored as of approximately 2:20 pm o...
Engineering Computing Network (ECN) in coordination with Physical Facilities will be conducting a planned power outage in the MSEE building from 8am u...
Power has been restored and the Peregrine 1 cluster has been restarted and is in full production mode The Calumet campus is experiencing a power issue...
Due to a network issue at the Indiana GigaPOP, connectivity to RCAC resources from off campus is intermittent. Access to the research computing web si...
UPDATE: LustreD has been returned to service and scheduling has been resumed as of about 4 pm Saturday, October 4th, 2014. The LustreD filesystem, ser...
UPDATE: Fortress was successfully returned to service as of 7:35 pm Wednesday, 15 July. As of 8:30am on July 15, 2014, the Fortress HPSS Archive is un...
The scratch filesystem on Hansen and Carter is currently unavailable due to a hardware issue. Attempts to access scratch will block until the filesyst...
The Lustre D filesystem, serving the Conte cluster, has become unavailable as of about 8:00 pm Thursday 13 Feb, 2014. System engineers are working to...