How can robots perceive the world and their own movements so that they accomplish navigation and manipulation tasks? In this module, we will study how images and videos acquired by cameras mounted on robots are transformed into representations like features and optical flow. Such 2D representations allow us then to extract 3D information about where the camera is and in which direction the robot moves. You will come to understand how grasping objects is facilitated by the computation of 3D posing of objects and navigation can be accomplished by visual odometry and landmark-based localization.
- 5 stars59.84%
- 4 stars25.03%
- 3 stars7.55%
- 2 stars4.09%
- 1 star3.46%
Interesting material, the last programming assignment was very challenging. Lots of topics covered, a good introduction.
This is quite challenging course. So far, this is the course with the largest amount of material, I wish the class will be split into two courses.
The content is quite useful but the teaching can be improved upon through shorter videos and more animations instead of hand gestures (or static images) to explain mathematical derivations.
For Computer Vision enthusiast who wants to learn about Multiple View Geometry, this is the best beginner course