PDP Url

Reinforcement Learning Explained

Learn how to frame reinforcement learning problems, tackle classic examples, explore basic algorithms from dynamic programming, temporal difference learning, and progress towards larger state space using function approximation and DQN (Deep Q Network).

Self-Paced

Learning Style

Advanced

Difficulty

42 Hours

Course Duration

Course Info

Download PDF

Certificate

See Sample

About Individual Course:
  • Individual course plan gives you access to this course
$99.00
$99.00
/ Each
When you subscribe, you get:
Learn Subscription plan gives you access to this course and over 820 other popular courses
On Sale!
Now Only $39.99 Regular Price $44.99
Now Only $39.99 Regular Price $44.99
/ Month
Team
Pricing
  • Buy 5-9 Enrollments And Save 68% ($12.74 monthly.)
  • Buy 10-19 Enrollments And Save 72% ($11.24 monthly.)
  • Buy 20-above Enrollments And Save 78% ($8.99 monthly.)
Learn how to frame reinforcement learning problems, tackle classic examples, explore basic algorithms from dynamic programming, temporal difference learning, and progress towards larger state space using function approximation and DQN (Deep Q Network).

Course Information

About this course:

Reinforcement Learning (RL) is an area of machine learning, where an agent learns by interacting with its environment to achieve a goal.  

In this course, you will be introduced to the world of  reinforcement learning. You will learn how to frame reinforcement learning problems and start tackling classic examples like news recommendation, learning to navigate in a grid-world, and balancing a cart-pole. 

You will explore the basic algorithms from multi-armed bandits, dynamic programming, TD (temporal difference) learning, and progress towards larger state space using function approximation, in particular using deep learning. You will also learn about algorithms that focus on searching the best policy with policy gradient and actor critic methods. Along the way, you will get introduced to Project Malmo, a platform for Artificial Intelligence experimentation and research built on top of the Minecraft game.

Course Objective:

  • Reinforcement Learning Problem
  • Markov Decision Process
  • Bandits
  • Dynamic Programming
  • Temporal Difference Learning
  • Approximate Solution Methods
  • Policy Gradient and Actor Critic
  • RL that Works

Audience:

  • Data Analyst
  • Programmers

Prerequisite:

  • There are no prerequisite required for this course

Outline

More Information

More Information
Brand Microsoft
Subjects App Development
Lab Access No
Technology Microsoft
Learning Style Self-Paced Learning
Difficulty Advanced
Course Duration 42 Hours
Language English

Reviews

Write Your Own Review
Only registered users can write reviews. Please Sign in or create an account

Contact A Learning Consultant


click here