Feb. 4 high-performance computing mini course will focus on big data, Hadoop and SPARQL

January 22, 2014 12:00am - December 31, 2014 11:59pm EST
DLR 221
Events

Purdue will host a workshop in February for graduate students, post-doctoral researchers, faculty and staff looking to gain skills in working with big data using applications such as Hadoop and SPARQL and leveraging the power of Purdue’s community clusters.

The one-day big data session will take place from 11 a.m. to 5 p.m. on Tuesday, Feb. 4, in the Discovery Learning Research Center, Room 221, at Purdue’s Discovery Park. Space is limited so participants should register soon. There is no cost to register. The National Science Foundation and ITaP are sponsoring the event.

Participants register with the National Science Foundation Extreme Science and Engineering Discovery Environment (XSEDE), in which Purdue is a partner. A free XSEDE account can be created on the XSEDE user portal at portal.xsede.org.

The mini course will include hands-on practice and cover such topics as HDFS (Hadoop Distributed File System), Hadoop programming and streaming, and the SPARQL query language, as well as broader subjects like the graph analytic approach to data analysis, including real-world examples.

The workshop is part of a series of high-performance computing training sessions being held by XSEDE. ITaP plans to host others at Purdue in 2014, says Verónica Vergara, a scientific applications analyst who coordinates training for ITaP Research Computing (RCAC).

The workshop is delivered nationwide using high-definition video conferencing to allow participants to interact in real time with course instructors from the Pittsburgh Supercomputing Center and to work in person with local colleagues and experts. At Purdue, staff from ITaP Research Computing (RCAC) will be on hand.

For more information, email rcac-help@purdue.edu.

Originally posted: January 22, 2014

Feb. 4 high-performance computing mini course will focus on big data, Hadoop and SPARQL

Follow Us