Skip to content

artrela/Computer-Vision-CMU-16720A

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Computer-Vision-CMU-16720A

Instructor: Deva Ramanan
Semester: Fall 2022

This course introduces the fundamental techniques used in computer vision, that is, the analysis of patterns in visual images to reconstruct and understand the objects and scenes that generated them. Topics covered include image formation and representation, camera geometry, and calibration, computational imaging, multi-view geometry, stereo, 3D reconstruction from images, motion analysis, physics-based vision, image segmentation and object recognition. The material is based on graduate-level texts augmented with research papers, as appropriate.


Homework 1: Spatial Pyramid Matching for Scene Classification

Procedure:

bow_diagram

Topics Covered:

  1. Feature Extraction based on Filter Banks
  2. K Means Clustering
  3. Visual Word Dictionary
  4. Scene Classification
  5. Hyperparameters Tuning

Results:

visual-words

Description.
Visual words for three sample images from the SUN database.


Homework 2: Lucas-Kanade Object Tracking

Procedure:

lk-tracking-graphic

Topics Covered:

  1. Simple Lucas & Kanade Tracker with Naive Template Update
  2. Lucas & Kanade Tracker with Template Correction
  3. Two-dimensional Tracking with a Pure Translation Warp Function
  4. Two-dimensional Tracking with a Plane Affine Warp Function
  5. Lucas & Kanade Forward Additive Approach
  6. Lucas & Kanade Inverse Compositional Approach

Results:

lk-results

Description:
Lucas-Kanade tracking using Naive Template Update (purple) versus Template Correction (Red).


Homework 3: Augmented Reality with Planar Homographies

Procedure:

homographies-graphic

Topics Covered:

  1. Direct Linear Transform
  2. Matrix Decomposition to calculate Homography
  3. Limitations of Planar Homography
  4. FAST Detector and BRIEF Descriptors
  5. Feature Matching
  6. Compute Homography via RANSAC
  7. Automated Homography Estimation and Warping
  8. Augmented Reality Application using Homography
  9. Real-Time Augmented Reality with High FPS
  10. Panorama Generation based on Homography

Results:

kf-panda-graphic

Description:
Augmented reality clip, superimposing a video sequence onto a book cover - using Planar Homographies.


Homework 4: 3D Reconstruction

Procedure:

3d-graphic

Topics Covered:

  1. Fundamental Matrix Estimation using Point Correspondence
  2. Metric Reconstruction
  3. Retrieval of Camera Matrices up to a Scale and Four-Fold Rotation Ambiguity
  4. Triangulation using the Homogeneous Least Squares Solution
  5. 3D Visualization from a Stereo-Pair by Triangulation and 3D Locations Rendering
  6. Bundle Adjustment
  7. Estimated fundamental matrix through RANSAC for noisy correspondences
  8. Jointly optmized reprojection error w.r.t 3D estimated points and camera matrices
  9. Non-linear optimization using SciPy least square optimizer

Results:

temple temple-reconstr

Description:
Temple (top) reconstructed in 3D (bottom).


Homework 5: Neural Networks for Recognition

Procedure:

neural-net

Topics Covered:

  1. Manual Implementation of a Fully Connected Network
  2. Text Extraction from Images of Handwritten Characters
  3. PyTorch Implementation of a Convolutional Neural Network
  4. Fine Tuning of SqueezeNet in PyTorch
  5. Comparison between Fine Tuning and Training from Scratch

Results:

raw-note nn-reading

Description:
Neural network text recognition results, based on raw images (example on top).

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages