Pig for wrangling Bigdata

Engineers who want to parse and extract useful information from large datasets
  • Self-Paced Learning

    Learning Style
  • Intermediate

    Difficulty
  • 6 Hours

    Course Duration
Pricing
About Individual Course:
  • Individual course plan gives you access to this course
On Sale!
Now Only $10.00 Regular Price $49.00
Now Only $10.00 Regular Price $49.00
/ Each
When you subscribe, you get:
Learn Subscription Plan gives you access to this course, PLUS:

  • 620+ high impact technical, end user and leadership courses
  • Peer to peer learning and access to expert mentors
  • Learner and Manager Analytics
  • Access to Cognitive Learning Research platform to troubleshoot project issues
7-Day FREE Trial
On Sale!
Now Only $14.99 Regular Price $24.99
Now Only $14.99 Regular Price $24.99
/ Month
Team
Pricing
  • Buy 5-9 Enrollments And Save 15% ($12.74 monthly.)
  • Buy 10-19 Enrollments And Save 25% ($11.24 monthly.)
  • Buy 20-above Enrollments And Save 40% ($8.99 monthly.)
Engineers who want to parse and extract useful information from large datasets

Taught by a team which includes 2 Stanford-educated, ex-Googlers  and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data processing jobs. 

Pig is aptly named, it is omnivorous, will consume any data that you throw at it and bring home the bacon!

Let's parse that 

omnivorous: Pig works with unstructured data. It has many operations which are very SQL-like but Pig can perform these operations on data sets which have no fixed schema. Pig is great at wrestling data into a form which is clean and can be stored in a data warehouse for reporting and analysis.

bring home the bacon: Pig allows you to transform data in a way that makes is structured, predictable and useful, ready for consumption.

Audience:
  • Analysts who want to wrangle large, unstructured data into shape
  • Engineers who want to parse and extract useful information from large datasets

Course Objective:

Pig Basics: Scalar and Complex data types (Bags, Maps, Tuples), basic transformations such as Filter, Foreach, Load, Dump, Store, Distinct, Limit, Order by and other built-in functions.

Advanced Data Transformations and Optimizations: The mind-bending Nested Foreach, Joins and their optimizations using "parallel", "merge", "replicated" and other keywords, Co-groups and Semi-joins, debugging using Explain and Illustrate commands

Real-world example: Clean up server logs using Pig

Prerequisites:

Working with Pig requires some basic knowledge of the SQL query language, a brief understanding of the Hadoop eco-system and MapReduce 

More Information
Lab Access No
Learning Style Self-Paced Learning
Difficulty Intermediate
Course Duration 6 Hours
Language English
Write Your Own Review
Only registered users can write reviews. Please Sign in or create an account
Sales Support

Sales (866) 991-3924

Mon-Fri. 8am-6pm CST

Have Questions? Ask Us.

Why QuickStart

Turn Training Into A Personalized Learning Experience


  • Problem Solving through ExpertConnect & Peer-To-Peer Learning
  • Find The Quickest Path To Learn With Career Paths
  • Access All Courses With Master Subscription
  • Manage Your Team With Learning Analytics
  • Virtual Classroom Training & Self-Paced Learning
  • Integrate With Your LMS Through API's