Events

To register for this course, please go to http://www.odum.unc.edu/datamatters

Data sets continue to grow, seemingly without bound. Hadoop is a framework for dealing with these growing “monsters,” which may include a mixture of complex and structured data. Created at Yahoo from work originally done by Google, Hadoop combines a fast filesystem with a surprisingly simple way to write massive parallel programs that run quickly. It is used in situations where researchers and information specialists would like to run analytics that are computationally extensive. Built on top of its core capabilities are the Pig and Hive database packages, tools which make it feasible to work with trillions of rows. This course will cover installation and use of Hadoop's filesystem, writing parallel programs (using the Map/Reduce paradigm), and the relational algebra and database capabilities of Pig and Hive. The session will include both lecture and in-class exercises.

**Cancellation Policy The following refund policy will apply in the event of cancellations:

• Full refunds, less a 33% cancellation fee, will be issued for any course enrollments canceled by you before May 14, 2014.

• Full refunds, less a 50% cancellation fee, will be issued for any course enrollments canceled by you between May 14, 2014 and June 15, 2014.

• Full refunds, less a 75% cancellation fee, will be issued for any course enrollments canceled by you after June 15, 2014.

• Prior to the event start date, if we cannot accommodate you in a course you registered for, we can issue a credit toward any UNC Odum Institute course (except for ICPSR courses) or issue you a full refund for the course. Credits must be used by September 30, 2014.

Event Title	Hadoop for Huge Data Sets
Location	Friday Center
Sponsor	H.W. Odum Institute
Date/Time	06/25/2014 9:30 AM - 4:00 PM
Event Price