Hortonworks HDP Developer Enterprise Apache Spark I (HW HDP Spark)

This course is designed as an entry point for developers.
  • Virtual Classroom

    Learning Style
  • Intermediate

    Difficulty
  • 4 Days

    Course Duration
Pricing
About Individual Course:
  • Individual course plan gives you access to this course
$2,800.00
$2,800.00
/ Seat
This course is designed as an entry point for developers.

About this course:

This course is designed as an entry point for developers who need to create applications to analyze Big Data stored in Apache Hadoop using Spark. This course also prepares the students for the HDP Apache Spark Developer Certification exam.

Topics include:

  • An overview of the Hortonworks Data Platform (HDP), including HDFS and YARN
  • Using Spark Core APIs for interactive data exploration
  • Spark SQL and DataFrame operations
  • Spark Streaming and DStream operations
  • Data visualization, reporting, and collaboration
  • Performance monitoring and tuning
  • Building and deploying Spark applications
  • Introduction to the Spark Machine Learning Library

 The average salary for a Hadoop Developer is $110,000 per year.

Course Objectives:

After completing this course, students will be able to:

  • Describe Hadoop, HDFS, YARN, and the HDP ecosystem
  • Describe Spark use cases
  • Explore and manipulate data using Zeppelin
  • Explore and manipulate data using a Spark REPL
  • Explain the purpose and function of RDDs
  • Employ functional programming practices
  • Perform Spark transformations and actions
  • Work with Pair RDDs
  • Perform Spark queries using Spark SQL and DataFrames
  • Use Spark Streaming stateless and window transformations
  • Visualize data, generate reports, and collaborate using Zeppelin
  • Monitor Spark applications using Spark History Server
  • Learn general application optimization guidelines/tips
  • Use data caching to increase performance of applications
  • Build and package Spark applications
  • Deploy applications to the cluster using YARN
  • Understand the purpose of Spark MLlib

Audience:

Software engineers that are looking to develop in-memory applications for time sensitive and highly iterative applications in an Enterprise HDP environment.

Prerequisites:

Students should be familiar with programming principles and have previous experience in software development using either Python or Scala. Previous experience with data streaming, SQL, and HDP is also helpful, but not required.

Suggested prerequisites courses:

More Information
Lab Access Yes
Technology Hadoop
Learning Style Virtual Classroom
Difficulty Intermediate
Course Duration 4 Days
Language English
VPA Eligible VPA Eligible
Write Your Own Review
Only registered users can write reviews. Please Sign in or create an account
Sales Support

Sales (866) 991-3924

Mon-Fri. 8am-6pm CST

Have Questions? Ask Us.

Why QuickStart

Turn Training Into A Personalized Learning Experience


  • Problem Solving through ExpertConnect & Peer-To-Peer Learning
  • Find The Quickest Path To Learn With Career Paths
  • Access All Courses With Master Subscription
  • Manage Your Team With Learning Analytics
  • Virtual Classroom Training & Self-Paced Learning
  • Integrate With Your LMS Through API's