NewSale

Pig for wrangling Bigdata

Engineers who want to parse and extract useful information from large datasets
    • Learning Style
      Self-Paced Learning
    • Difficulty
      Intermediate
    • Course Duration
      6 Hours
Engineers who want to parse and extract useful information from large datasets
Start FREE Subscription Trial
Get started with our Learn Subscription Plan that includes this course, PLUS:

  • 328 high impact technical, end user and learning & business management courses
  • 100% online self-paced courses
  • Course completion certificates
  • Live tech support and you will be assigned your personal Learning Concierge
  • 7-Day FREE Trial
    Then Billed
    $24.99
    Every Month Until Canceled
  • Start FREE Trial
Purchase As Individual Course
  • Self-Paced Online Content
  • Attend Course Any Day or Any Time
  • Reports & Statistics
  • Certificate Upon Completion
  • Now Only $50.00 Regular Price $70.00
    Self-Paced Learning
  • Enroll Now
Purchase For Teams
Team Pricing Available - Request A Quote Today!

  • Group Discounts & Private Training Available
  • Free Learning Management Center
  • Group Reporting & Tracking
  • Author / Publish Your Own Courses
  • Request Team Enrollment

Taught by a team which includes 2 Stanford-educated, ex-Googlers  and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data processing jobs. 

Pig is aptly named, it is omnivorous, will consume any data that you throw at it and bring home the bacon!

Let's parse that 

omnivorous: Pig works with unstructured data. It has many operations which are very SQL-like but Pig can perform these operations on data sets which have no fixed schema. Pig is great at wrestling data into a form which is clean and can be stored in a data warehouse for reporting and analysis.

bring home the bacon: Pig allows you to transform data in a way that makes is structured, predictable and useful, ready for consumption.

Audience:
  • Analysts who want to wrangle large, unstructured data into shape
  • Engineers who want to parse and extract useful information from large datasets

Course Objective:

Pig Basics: Scalar and Complex data types (Bags, Maps, Tuples), basic transformations such as Filter, Foreach, Load, Dump, Store, Distinct, Limit, Order by and other built-in functions.

Advanced Data Transformations and Optimizations: The mind-bending Nested Foreach, Joins and their optimizations using "parallel", "merge", "replicated" and other keywords, Co-groups and Semi-joins, debugging using Explain and Illustrate commands

Real-world example: Clean up server logs using Pig

Prerequisites:

Working with Pig requires some basic knowledge of the SQL query language, a brief understanding of the Hadoop eco-system and MapReduce 

More Information
Lab Access No
Learning Style Self-Paced Learning
Difficulty Intermediate
Course Duration 6 Hours
Language English
Write Your Own Review
You're reviewing:Pig for wrangling Bigdata
Your Rating