Big data research focus of free Purdue, National Science Foundation workshop Feb. 19-20

February 19, 2020  11:00am – February 20, 2020  12:00pm
Envision Center (STEW B001)

Purdue will host a free two-day big data workshop focusing on topics such as Hadoop and Spark on Wednesday, Feb. 19, and Thursday, Feb. 20, for faculty, staff and students who want to learn more about tools for working with large sets of data.

The workshop will take place from 11 a.m. to 5 p.m. both days in the Envision Center, Stewart Center, Room B001-D, which is located off the tunnel between the Stewart Center and the Purdue Memorial Union. The second day’s subject matter will build on the first day, so participants should attend both sessions.

Space is limited to 25 participants. Those wishing to attend should register soon. Participants should bring their own laptop with a terminal client such as PuTTY or MobaXterm installed.

There is no cost to register. The event is sponsored by ITaP and the National Science Foundation Extreme Science and Engineering Discovery Environment (XSEDE), as well as the Pittsburgh Supercomputing Center.

Participants register with XSEDE, in which Purdue is a partner. A free XSEDE account can be created on the XSEDE user portal at Once you have an account, you can register through the XSEDE portal. More information is available here.

The mini course covers the basics of using Hadoop and Spark in a Linux environment, as well as machine learning with Spark and deep learning with TensorFlow. There are no prerequisites, but familiarity with Python will be helpful.

The supercomputers that ITaP Research Computing makes available to Purdue researchers through the Community Cluster Program support software that allows a user to deploy high-performance Hadoop clusters on demand.

This big data workshop is part of a series of high-performance computing training sessions ITaP sponsors at Purdue through XSEDE, says Eric Adams, who coordinates training for ITaP Research Computing.

The workshop is delivered nationwide using high-definition video conferencing to allow students to interact in real time with course instructors from the Pittsburgh Supercomputing Center and to work in person with local colleagues and experts. At Purdue, staff from ITaP Research Computing will be on hand.

For more information, email

Originally posted: February 3, 2020  1:46pm