Home Filesystem / Slowdown / Login Issue on all Clusters

March 4, 2019  3:10pm – 9:10pm
Brown, Gilbreth, Halstead, Hammer, Rice, Scholar, Snyder, Workbench

UPDATE: March 5, 2019  12:27am

As of 9:10pm, the home filesystem issue was resolved and all clusters were returned to normal service. Please report any issues to rcac-help@purdue.edu.


UPDATE: March 4, 2019  7:57pm

Work continues on bringing the home filesystem performance back to normal operation. We are pursuing multiple approaches on this simultaneously which we believe will help. We will provide another update by 9:00am if operations are not restored overnight.


ORIGINAL: March 4, 2019  4:03pm

All clusters began experiencing issues with logins and a general slowdown around 3:10pm. This has been identified as being due to an issue on the /home/ filesystem. Engineers are continuing to examine this and are working to alleviate the issue right now. Job scheduling has been paused while this issue is being addressed.

No files on the /home/ filesystem are in danger, but performance will be degraded until this is resolved. We will provide an update by 8:00pm.

Originally posted: March 4, 2019  4:03pm