Computer-Vision-CMU-16720A

Instructor: Deva Ramanan
Semester: Fall 2022

This course introduces the fundamental techniques used in computer vision, that is, the analysis of patterns in visual images to reconstruct and understand the objects and scenes that generated them. Topics covered include image formation and representation, camera geometry, and calibration, computational imaging, multi-view geometry, stereo, 3D reconstruction from images, motion analysis, physics-based vision, image segmentation and object recognition. The material is based on graduate-level texts augmented with research papers, as appropriate.

Homework 1: Spatial Pyramid Matching for Scene Classification

Procedure:

Topics Covered:

Feature Extraction based on Filter Banks
K Means Clustering
Visual Word Dictionary
Scene Classification
Hyperparameters Tuning

Results:

Description.
Visual words for three sample images from the SUN database.

Homework 2: Lucas-Kanade Object Tracking

Procedure:

Topics Covered:

Simple Lucas & Kanade Tracker with Naive Template Update
Lucas & Kanade Tracker with Template Correction
Two-dimensional Tracking with a Pure Translation Warp Function
Two-dimensional Tracking with a Plane Affine Warp Function
Lucas & Kanade Forward Additive Approach
Lucas & Kanade Inverse Compositional Approach

Results:

Description:
Lucas-Kanade tracking using Naive Template Update (purple) versus Template Correction (Red).

Homework 3: Augmented Reality with Planar Homographies

Procedure:

Topics Covered:

Direct Linear Transform
Matrix Decomposition to calculate Homography
Limitations of Planar Homography
FAST Detector and BRIEF Descriptors
Feature Matching
Compute Homography via RANSAC
Automated Homography Estimation and Warping
Augmented Reality Application using Homography
Real-Time Augmented Reality with High FPS
Panorama Generation based on Homography

Results:

Description:
Augmented reality clip, superimposing a video sequence onto a book cover - using Planar Homographies.

Homework 4: 3D Reconstruction

Procedure:

Topics Covered:

Fundamental Matrix Estimation using Point Correspondence
Metric Reconstruction
Retrieval of Camera Matrices up to a Scale and Four-Fold Rotation Ambiguity
Triangulation using the Homogeneous Least Squares Solution
3D Visualization from a Stereo-Pair by Triangulation and 3D Locations Rendering
Bundle Adjustment
Estimated fundamental matrix through RANSAC for noisy correspondences
Jointly optmized reprojection error w.r.t 3D estimated points and camera matrices
Non-linear optimization using SciPy least square optimizer

Results:

Description:
Temple (top) reconstructed in 3D (bottom).

Homework 5: Neural Networks for Recognition

Procedure:

Topics Covered:

Manual Implementation of a Fully Connected Network
Text Extraction from Images of Handwritten Characters
PyTorch Implementation of a Convolutional Neural Network
Fine Tuning of SqueezeNet in PyTorch
Comparison between Fine Tuning and Training from Scratch

Results:

Description:
Neural network text recognition results, based on raw images (example on top).

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
HW1		HW1
HW2		HW2
HW3		HW3
HW4		HW4
HW5		HW5
Results		Results
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Computer-Vision-CMU-16720A

Homework 1: Spatial Pyramid Matching for Scene Classification

Procedure:

Results:

Homework 2: Lucas-Kanade Object Tracking

Procedure:

Results:

Homework 3: Augmented Reality with Planar Homographies

Procedure:

Results:

Homework 4: 3D Reconstruction

Procedure:

Results:

Homework 5: Neural Networks for Recognition

Procedure:

Results:

About

Uh oh!

Releases

Packages

Languages

artrela/Computer-Vision-CMU-16720A

Folders and files

Latest commit

History

Repository files navigation

Computer-Vision-CMU-16720A

Homework 1: Spatial Pyramid Matching for Scene Classification

Procedure:

Results:

Homework 2: Lucas-Kanade Object Tracking

Procedure:

Results:

Homework 3: Augmented Reality with Planar Homographies

Procedure:

Results:

Homework 4: 3D Reconstruction

Procedure:

Results:

Homework 5: Neural Networks for Recognition

Procedure:

Results:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages