HDP Developer Apache Pig and Hive (HW HDP PH)

The course is designed for developers who need to create applications to analyze Big Data stored in Apache Hadoop.
  • Virtual Classroom

    Learning Style
  • Intermediate

    Difficulty
  • 4 Days

    Course Duration
Pricing
About Individual Course:
  • Individual course plan gives you access to this course
$2,800.00
$2,800.00
/ Seat
The course is designed for developers who need to create applications to analyze Big Data stored in Apache Hadoop.

About this course:

This course is designed for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Pig and Hive. Topics include: Hadoop, YARN, HDFS, MapReduce, data ingestion, workflow definition, using Pig and Hive to perform data analytics on Big Data and an introduction to Spark Core and Spark SQL. This course also prepares the students for the HDP Certified Developer (HDPCD) Exam A Big Data Hadoop Certification

The average salary for a Hadoop Developer is $110,000 per year.

Course Objectives:

After completing this course, students will be able to:

  • Describe Hadoop, YARN and use cases for Hadoop
  • Describe Hadoop ecosystem tools and frameworks
  • Describe the HDFS architecture
  • Use the Hadoop client to input data into HDFS
  • Transfer data between Hadoop and a relational database
  • Explain YARN and MaoReduce architectures
  • Run a MapReduce job on YARN
  • Use Pig to explore and transform data in HDFS
  • Understand how Hive tables are defined and implemented
  • Use Hive to explore and analyze data sets
  • Use the new Hive windowing functions
  • Explain and use the various Hive file formats
  • Create and populate a Hive table that uses ORC file formats
  • Use Hive to run SQL-like queries to perform data analysis
  • Use Hive to join datasets using a variety of techniques
  • Write efficient Hive queries
  • Create ngrams and context ngrams using Hive
  • Perform data analytics using the DataFu Pig library
  • Explain the uses and purpose of HCatalog
  • Use HCatalog with Pig and Hive
  • Define and schedule an Oozie workflow
  • Present the Spark ecosystem and high-level architecture
  • Perform data analysis with Spark's Resilient Distributed Dataset API
  • Explore Spark SQL and the DataFrame API

Audience:

Software developers who need to understand and develop applications for Hadoop.

Prerequisites:

Students should be familiar with programming principles and have experience in software development. SQL knowledge is also helpful. No prior Hadoop knowledge is required.

Suggested prerequisites courses:

More Information
Lab Access Yes
Technology Hadoop
Learning Style Virtual Classroom
Difficulty Intermediate
Course Duration 4 Days
Language English
VPA Eligible VPA Eligible
Write Your Own Review
Only registered users can write reviews. Please Sign in or create an account
Sales Support

Sales (866) 991-3924

Mon-Fri. 8am-6pm CST

Have Questions? Ask Us.

Why QuickStart

Turn Training Into A Personalized Learning Experience


  • Problem Solving through ExpertConnect & Peer-To-Peer Learning
  • Find The Quickest Path To Learn With Career Paths
  • Access All Courses With Master Subscription
  • Manage Your Team With Learning Analytics
  • Virtual Classroom Training & Self-Paced Learning
  • Integrate With Your LMS Through API's