Halstead
-
Halstead Cluster Early Access Policies
Some reminders about the current operational status of Halstead: Halstead is currently in early access testing mode. The system is still being fully qualified by ITaP engineers and during this early access period, the cluster's nodes, network, and m...
-
Transition to hierarchical modules on Halstead
Halstead's module stack is configured in a hierarchical fashion, a change from the "flat" module configuration on previous clusters. This change may require some minimal changes to your job scripts. This change will help prevent errors and...
-
Update: Engineers were able to isolate the problem and restart the necessary systems. The Data Depot should be available again. Halstead users should double check their running work. A portion of the systems serving the Research Data Depot have suffe...
-
The Halstead cluster will be unavailable beginning at Wednesday, December 7th, 2016 at 1:00pm EST, for scheduled early-access maintenance (see Halstead Cluster Early Access Policies). The cluster will return to full production by Wednesday, December...
-
The maintenance work was completed successfully and Halstead has been returned to normal operations as of Wednesday December 14, 2016 at 10:00am. Original message: The Halstead cluster will be unavailable beginning at Wednesday, December 14th, 2016 a...
-
The Halstead cluster is back online as of 4:50 PM after scheduled early-access maintenance. Unfortunately, queued jobs were lost due to complications during maintenance. If you had any jobs queued and waiting before maintenance started, you will need...
-
The Halstead cluster will be unavailable beginning at Wednesday, January 4th, 2017 at 10:00am EST, for scheduled early-access maintenance (see Halstead Cluster Early Access Policies). The cluster will return to full production by Wednesday, January 4...
-
The maintenance work was completed successfully and Halstead has been returned to normal operations as of Wednesday, January 11, 2017 at 12:00pm. Original Message The Halstead cluster will be unavailable beginning at Wednesday, January 11th, 2017 at...
-
Emergency Security Patching of RCAC Clusters
Due to a recent security vulnerability, the Carter, Halstead, Hammer, Radon, Rice, Scholar, and Snyder clusters will have their operating system upgraded to a newer version during February 2, 2017 5:00pm - March 2, 2017 5:00pm EST. Unlike other cl...
-
Halstead MPI problem, scheduling paused
Following the security updates on Halstead, an issue was discovered that prevented multi-node MPI jobs from running properly. Scheduling on Halstead has been stopped, and systems engineers are working on fixing the issue. We will provide further stat...
-
Halstead nodes continue to come back online. While the cluster is operating normally, the total amount of available nodes is not yet at full capacity. We will update on the situation by 6:00pm. Update: Scheduling has been restarted and jobs are cur...
-
The Halstead cluster will be unavailable beginning at Thursday, April 6th, 2017 at 8:00am EDT, for scheduled maintenance. The cluster will return to full production by Thursday, April 6th, 2017 at 11:59pm EDT. During this time, Halstead will have the...
-
Clusters to complete transition to hierarchy modules
The transition to hierarchy modules has completed today, May 9th. If any of your job scripts still have old module names the module load will no longer work so be sure to double check your PBS job scripts and output for any old module names. We have...
-
Engineers have restored failed core servers back to a functional state. Data Depot is up and running as normal and job scheduling resumed. Should you encounter any lingering issues please let us know at rcac-help@purdue.edu Original Message Some core...
-
Nodes have continued to gradually reboot into the new image as jobs complete. At this point, more than 80% of Halstead has completed this process, and we have not seen any issues in them doing so. This outage is closed. Update: May 25, 2017 5:00pm...
-
Removal of netcdf and hdf5 system-level library installations
Research Computing has begun removing several libraries installed at the system-level that should be provided by the module command. These libraries include netcdf, hdf5, and several related packages. This change should have limited impact as module...
-
Unscheduled outages on portions of clusters
Conte, Halstead, HalsteadGPU, and Hammer are back in full production. Job scheduling has been resumed on all clusters. Please let us know if you see any lingering issues at rcac-help@purdue.edu. UPDATE July 20, 2017 2:54pm Power has been restored to...
-
After a long review, Research Computing has determined it is necessary to alter the scratch storage purge policy on all systems. Effective August 28, 2017, all scratch storage systems will begin purging files which have not been accessed (for either...
-
A failure has occurred in the systems which serve Data Depot to the various research clusters. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused on all systems while this issue is being add...
-
Access to Data Depot from the Halstead, HalsteadGPU, Hathi, Rice, Scholar, and Snyder clusters has hung starting around Thursday, September 7th, 2017 at 1:30pm EDT. Engineers are currently working to restore service to these systems. Job scheduling h...