Remote Desktop Launch
Jupyter Hub Launch
Rstudio Launch

Overview of Data Workbench

The Data Workbench is an interactive compute environment for non-batch big data analysis and simulation, and is a part of Purdue's Community Cluster Program. The Data Workbench consists of HP compute nodes with two 8-core Intel Xeon processors (16 cores per node), and 256 GB of memory. All nodes are interconnected with 10 Gigabit Ethernet. The Data Workbench entered production on October 1, 2017.

Data Workbench Detailed Hardware Specification

The Data Workbench consists of HP Servers with two 8-core Intel E5-2660 CPUs, 256 GB of memory, and 10 Gigabit Ethernet network.
Number of Nodes Processors per Node Cores per Node Memory per Node Interconnect
8 Two 8-core Intel E5-2670 16 256 GB 10 GigE

Data Workbench nodes run Red Hat Enterprise Linux 7 (RHEL7) and are intended for interactive work via the Thinlinc remote desktop software, Jupyterhub, or Rstudio Server. Data Workbench provides no batch system.

The application of operating system patches occurs as security needs dictate. All nodes allow for unlimited stack usage, as well as unlimited core dump size (though disk space and server quo tas may still be a limiting factor). All nodes guarantee even access to CPU and memory resources via Linux cgroups.

On Data Workbench, ITaP recommends the following set of compiler and math libraries:

  • Intel 16.0.1.150
  • MKL

This compiler and these libraries are loaded by default. To load the recommended set again:

$ module load rcac

To verify what you loaded:

$ module list

Purdue University, 610 Purdue Mall, West Lafayette, IN 47907, (765) 494-4600

© 2017 Purdue University | An equal access/equal opportunity university | Copyright Complaints | Maintained by ITaP Research Computing

Trouble with this page? Disability-related accessibility issue? Please contact us at online@purdue.edu so we can help.