Big data research with Hadoop and Spark focus of Purdue, National Science Foundation workshop November 1

November 1, 2016  11:00am – 5:00pm
Envision Center (STEW B001)

Purdue will host a big data workshop focusing on Hadoop and Spark on Tuesday, November 1, for faculty, staff and students looking to learn more about tools for working with large sets of data.

The workshop will take place from 11 a.m. to 5 p.m. in the Envision Center, Stewart Center, Room B001-D, which is located off the tunnel between the Stewart Center and the Purdue Memorial Union. Space is limited to 25 participants so those wishing to attend should register soon. There is no cost to register. The event is sponsored by ITaP and a National Science Foundation Extreme Science and Engineering Discovery Environment (XSEDE), as well as the Pittsburgh Supercomputing Center and the National Center for Supercomputing Applications.

Participants register with XSEDE, in which Purdue is a partner. A free XSEDE account can be created on the XSEDE user portal at portal.xsede.org. Once you have an account, you can register through the XSEDE portal. Slides and more information are available here.

The mini course will include an introduction to big data and hands-on exercises with Hadoop and Spark, as well as an overview of related XSEDE computing resources. Participants should bring a laptop and be able to write C or Fortran in a Linux environment and to use a UNIX text editor such as vi or Emacs.

In addition to the general-purpose research supercomputers ITaP Research Computing makes available to Purdue researchers through the Community Cluster Program, ITaP also offers a Hadoop cluster for use by faculty, their students and research staff.

The big data workshop is part of a series of high-performance computing training sessions ITaP sponsors at Purdue through XSEDE, says Randy Herban, who coordinates training for ITaP Research Computing.

The big data workshop is delivered nationwide using high-definition video conferencing to allow students to interact in real time with course instructors from the Pittsburgh Supercomputing Center and the National Center for Supercomputing Applications and to work in person with local colleagues and experts. At Purdue, staff from ITaP Research Computing will be on hand.

For more information, email rcac-help@purdue.edu.

Originally posted: October 13, 2016  10:00am