Pig for wrangling Bigdata

Engineers who want to parse and extract useful information from large datasets
Purchase Options
Learn Subscription
7-Day FREE Trial
Now Only $14.99 Regular Price $24.99
/ Month
Team Pricing
  • Buy 5-9 Enrollments And Save 15% ($12.74 monthly.)
  • Buy 10-19 Enrollments And Save 25% ($11.24 monthly.)
  • Buy 20-above Enrollments And Save 40% ($8.99 monthly.)
Individual Course
On Sale!
Now Only $10.00 Regular Price $49.00
/ Each
Engineers who want to parse and extract useful information from large datasets

Taught by a team which includes 2 Stanford-educated, ex-Googlers  and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data processing jobs. 

Pig is aptly named, it is omnivorous, will consume any data that you throw at it and bring home the bacon!

Let's parse that 

omnivorous: Pig works with unstructured data. It has many operations which are very SQL-like but Pig can perform these operations on data sets which have no fixed schema. Pig is great at wrestling data into a form which is clean and can be stored in a data warehouse for reporting and analysis.

bring home the bacon: Pig allows you to transform data in a way that makes is structured, predictable and useful, ready for consumption.

  • Analysts who want to wrangle large, unstructured data into shape
  • Engineers who want to parse and extract useful information from large datasets

Course Objective:

Pig Basics: Scalar and Complex data types (Bags, Maps, Tuples), basic transformations such as Filter, Foreach, Load, Dump, Store, Distinct, Limit, Order by and other built-in functions.

Advanced Data Transformations and Optimizations: The mind-bending Nested Foreach, Joins and their optimizations using "parallel", "merge", "replicated" and other keywords, Co-groups and Semi-joins, debugging using Explain and Illustrate commands

Real-world example: Clean up server logs using Pig


Working with Pig requires some basic knowledge of the SQL query language, a brief understanding of the Hadoop eco-system and MapReduce 

More Information
Lab AccessNo
Learning StyleSelf-Paced Learning
Course Duration6 Hours
Write Your Own Review
Only registered users can write reviews. Please Sign in or create an account
Why QuickStart

Turn Training Into A Personalized Learning Experience

  • Project Problem Solving through ExpertConnect & Peer-To-Peer Learning
  • Career Paths - Find The Quickest Path To What You Need To Learn
  • One Subscription, All Access - Take Multiple Courses & Become An Expert
  • Manage Your Organization Through Learner & Manager Analytics
  • Virtual Instructor-Led Training Supplemented With Self-Paced Learning
  • Integrate With Your LMS Through API's

Sales Support

Sales (866) 991-3924

Mon-Fri. 8am-6pm CST


Ask A Learning Concierge