Skip to main content
Have a request for an upcoming news/science story? Submit a Request

Big data research with Hadoop and Spark focus of Purdue, National Science Foundation workshop Aug. 4

  • Envision Center (STEW B001)
  • Events

Purdue will host a big data workshop focusing on Hadoop and Spark on Tuesday, Aug. 4, for faculty, staff and students looking to learn more about tools for working with large sets of data.

The workshop will take place from 11 a.m. to 5 p.m. in the Envision Center, Stewart Center, Room B001, which is located off the tunnel between the Stewart Center and the Purdue Memorial Union. Space is limited so participants should register soon. There is no cost to register. The National Science Foundation and ITaP are sponsoring the event.

Participants register with the National Science Foundation Extreme Science and Engineering Discovery Environment (XSEDE), in which Purdue is a partner. A free XSEDE account can be created on the XSEDE user portal at portal.xsede.org.

The mini course will include an introduction to big data, cover applications and platforms for data-intensive research, and offer hands-on exercises with Hadoop and Spark. Purdue is the only site in Indiana hosting the workshop. Participants should be able to write C or Fortran in a Linux environment and to use a Unix text editor such as vi or Emacs. They also should bring a laptop.

In addition to the eight research supercomputing systems it has built for Purdue researchers since 2008 through the Community Cluster Program, ITaP Research Computing also offers a Hadoop cluster for use by faculty, their students and research staff.

The big data workshop is part of a series of high-performance computing training sessions ITaP sponsors at Purdue through XSEDE, says Stephen Harrell, a senior high-performance computing system administrator who coordinates training for ITaP Research Computing.

The big data workshop is delivered nationwide using high-definition video conferencing to allow students to interact in real time with course instructors from the Pittsburgh Supercomputing Center and the National Center for Supercomputing Applications and to work in person with local colleagues and experts. At Purdue, staff from ITaP Research Computing will be on hand.

For more information, email rcac-help@purdue.edu.

Originally posted: