CS 280A: Computer Vision Portfolio

A comprehensive collection of computer vision projects exploring fundamental and advanced techniques in image processing, 3D reconstruction, and generative AI.

🎯 Projects

Project 1: Prokudin-Gorskii Colorizing

Automatic color image recovery from digitized glass plate photographs using multi-scale pyramid search technique.

Key Techniques: Image Alignment • Normalized Cross-Correlation (NCC) • Pyramid Search • Image Enhancement

Multi-scale alignment with coarse-to-fine refinement
Automatic contrast, white balance, and color correction
Canny edge-based feature matching for robust alignment

Project 2: Fun with Filters & Frequencies

Explore edge detection, image sharpening, hybrid images, and multiresolution blending using Laplacian stacks.

Key Techniques: Convolution • Gaussian Pyramids/Laplacian Stacks • Frequency Filtering • Hybrid Images

2D convolution from scratch with zero-padding
Derivative of Gaussian (DoG) for edge detection
Unsharp mask for image sharpening
Multiresolution blending for seamless image composition

Project 3: Image Mosaicing

Manual and automatic image stitching using homography estimation, feature detection, and RANSAC for robust alignment.

Key Techniques: Homography • RANSAC • Harris Corner Detection • Feature Descriptors

Single-scale and multi-scale homography computation
Harris corner detection with Adaptive Non-Maximal Suppression (ANMS)
8×8 feature descriptor extraction and matching
Cylindrical warping for wide field-of-view stitching

Project 4: Neural Radiance Fields (NeRF)

3D scene reconstruction from multi-view images using machine learning, positional encoding, and volumetric rendering techniques.

Key Techniques: NeRF • PyTorch • Machine Learning • Volumetric Rendering

Multi-view camera calibration with ArUco markers
2D neural field fitting with sinusoidal positional encoding
Full 3D NeRF training on multi-view datasets
Novel view synthesis and depth map rendering

Project 5A: Diffusion Models & Generative AI

Explore diffusion models for image generation, editing, and synthesis including text-to-image and visual anagrams.

Key Techniques: Diffusion Models • Classifier-Free Guidance • SDEdit • Image Inpainting

Forward and reverse diffusion processes
Iterative denoising with and without text conditioning
Image-to-image translation and style transfer
Visual anagrams and hybrid image generation

Project 5B: Flow Matching from Scratch

Train flow matching models on MNIST with time and class conditioning for iterative image generation.

Key Techniques: Flow Matching • Time Conditioning • Class Conditioning • Classifier-Free Guidance

Single-step vs iterative denoising comparison
Time-conditioned UNet architecture
Class-guided generation with one-hot encoding
Removal of learning rate scheduler with fixed tuning

📋 Technical Highlights

Python & PyTorch — Deep learning implementations
NumPy — Numerical computing and signal processing
OpenCV — Computer vision algorithms
Image Processing — Filtering, registration, reconstruction
Deep Learning — Neural networks for vision tasks
Optimization — RANSAC, pyramid search, gradient descent

🚀 Getting Started

Each project folder contains:

index.html — Interactive project report with visualizations
Results and comparisons
Bells & whistles

🎓 Course Information

CS 280A: Computer Vision — UC Berkeley

This course covers fundamental and advanced topics in computer vision including:

Image formation and geometry
Feature detection and description
Image alignment and registration
3D reconstruction
Neural approaches to vision tasks

📝 License

This portfolio contains course projects from UC Berkeley's CS 280A. Please refer to course policies regarding academic integrity and code sharing.

Author: Joshua Sun
Institution: UC Berkeley
Year: 2025

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
1		1
2		2
3		3
4		4
5		5
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS 280A: Computer Vision Portfolio

🎯 Projects

Project 1: Prokudin-Gorskii Colorizing

Project 2: Fun with Filters & Frequencies

Project 3: Image Mosaicing

Project 4: Neural Radiance Fields (NeRF)

Project 5A: Diffusion Models & Generative AI

Project 5B: Flow Matching from Scratch

📋 Technical Highlights

🚀 Getting Started

🎓 Course Information

📝 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎯 Projects

📋 Technical Highlights

🚀 Getting Started

🎓 Course Information

📝 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Uh oh!

Languages