Pig for wrangling Bigdata
Self-Paced LearningLearning Style
6 HoursCourse Duration
Select A Class Schedule
| Learn Subscription |
7-Day FREE Trial
Now Only $14.99 Regular Price $24.99/ Month
|Individual Course |
Now Only $10.00 Regular Price $49.00/ Each
Taught by a team which includes 2 Stanford-educated, ex-Googlers and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data processing jobs.
Pig is aptly named, it is omnivorous, will consume any data that you throw at it and bring home the bacon!
Let's parse that
omnivorous: Pig works with unstructured data. It has many operations which are very SQL-like but Pig can perform these operations on data sets which have no fixed schema. Pig is great at wrestling data into a form which is clean and can be stored in a data warehouse for reporting and analysis.
bring home the bacon: Pig allows you to transform data in a way that makes is structured, predictable and useful, ready for consumption.
- Analysts who want to wrangle large, unstructured data into shape
- Engineers who want to parse and extract useful information from large datasets
Pig Basics: Scalar and Complex data types (Bags, Maps, Tuples), basic transformations such as Filter, Foreach, Load, Dump, Store, Distinct, Limit, Order by and other built-in functions.
Advanced Data Transformations and Optimizations: The mind-bending Nested Foreach, Joins and their optimizations using "parallel", "merge", "replicated" and other keywords, Co-groups and Semi-joins, debugging using Explain and Illustrate commands
Real-world example: Clean up server logs using Pig
Working with Pig requires some basic knowledge of the SQL query language, a brief understanding of the Hadoop eco-system and MapReduce
Self-Paced Learning Outline
You, This Course and Us
Where does Pig fit in?
Pig Operations And Data Transformations
Advanced Data Transformations
Optimizing Data Transformations
A real-world example
Installing Hadoop in a Local Environment
|Learning Style||Self-Paced Learning|
|Course Duration||6 Hours|
Turn Training Into A Personalized Learning Experience
- Project Problem Solving through ExpertConnect & Peer-To-Peer Learning
- Career Paths - Find The Quickest Path To What You Need To Learn
- One Subscription, All Access - Take Multiple Courses & Become An Expert
- Manage Your Organization Through Learner & Manager Analytics
- Virtual Instructor-Led Training Supplemented With Self-Paced Learning
- Integrate With Your LMS Through API's
Sales (866) 991-3924
Mon-Fri. 8am-6pm CST