In this course, you will learn how to solve problems with large, high-dimensional, and potentially infinite state spaces. You will see that estimating value functions can be cast as a supervised learning problem---function approximation---allowing you to build agents that carefully balance generalization and discrimination in order to maximize reward. We will begin this journey by investigating how our policy evaluation or prediction methods like Monte Carlo and TD can be extended to the function approximation setting. You will learn about feature construction techniques for RL, and representation learning via neural networks and backprop. We conclude this course with a deep-dive into policy gradient methods; a way to learn policies directly without learning a value function. In this course you will solve two continuous-state control tasks and investigate the benefits of policy gradient methods in a continuous-action environment.
提供方
课程信息
Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode.
您将获得的技能
- Artificial Intelligence (AI)
- Machine Learning
- Reinforcement Learning
- Function Approximation
- Intelligent Systems
Probabilities & Expectations, basic linear algebra, basic calculus, Python 3.0 (at least 1 year), implementing algorithms from pseudocode.
授课大纲 - 您将从这门课程中学到什么
Welcome to the Course!
On-policy Prediction with Approximation
Constructing Features for Prediction
Control with Approximation
Policy Gradient
审阅
- 5 stars84.32%
- 4 stars12.88%
- 3 stars1.99%
- 2 stars0.53%
- 1 star0.26%
来自PREDICTION AND CONTROL WITH FUNCTION APPROXIMATION的热门评论
This specialization is a gift to humanity. It should have been inscribed into the golden disc of the Voyager and shared with the aliens.
A great and interactive course to learn about using function approximation for control. Great way to learn DRL and its alternatives.
The course was really good one with quizzes to make us remember the important lesson items and well polished Assignments are given which i haven't seen before in coursera
more detailed explanation of some of the assignments and how state values are got with tile coding but overall a great experience!
关于 强化学习 专项课程

常见问题
我什么时候能够访问课程视频和作业?
我订阅此专项课程后会得到什么?
有助学金吗?
还有其他问题吗?请访问 学生帮助中心。