Home - iffatAGheyas/learn-computer-vision GitHub Wiki
🧠 Computer Vision: Image & Video Processing — Syllabus
Welcome to your journey into Computer Vision! This syllabus is divided into modules, each linking to its own detailed wiki page.
📅 Duration: 8–10 Weeks (Flexible Pace)
Module 1: Introduction to Computer Vision
📘- What is computer vision?
- Applications in industry and research
- Image formation basics
- Cameras, sensors, and pixels
Module 2: Digital Image Fundamentals
🖼️- Image representation: Grayscale, RGB, etc.
- Color spaces (RGB, HSV, YCrCb)
- Sampling and quantization
- Image histograms and enhancement
Module 3: Image Filtering and Convolution
🎛️- Convolution and correlation
- Smoothing filters (Gaussian, box)
- Edge detection (Sobel, Canny)
- Sharpening and gradient operators
Module 4: Image Segmentation
✂️- Thresholding (global, adaptive, Otsu)
- Region growing, watershed
- Contours and connected components
- Superpixels
Module 5: Feature Detection and Matching
📌- Corner detectors (Harris, Shi-Tomasi)
- Keypoints and descriptors (SIFT, SURF, ORB)
- Feature matching (Brute-force, FLANN)
- Homographies and transformations
Module 6: Motion and Video Analysis
🎞️- Optical flow (Lucas-Kanade, Farneback)
- Background subtraction
- Object tracking (Kalman filters, Mean-shift, CamShift)
- Video stabilization
Module 7: Object Detection & Recognition
🎯- HOG + SVM
- Viola-Jones face detector
- Deep learning-based methods (YOLO, SSD, Faster R-CNN)
- Pretrained models and transfer learning
Module 8: Deep Learning in Computer Vision
🤖- CNN architecture basics
- Image classification with CNNs
- Semantic and instance segmentation (U-Net, Mask R-CNN)
- Applications: OCR, face recognition, medical imaging
Module 9: Project and Applications
🛠️- Choose a mini-project (e.g., real-time object tracker)
- Dataset selection and preparation
- Model training, testing, evaluation
- Deployment tips
✅ Prerequisites
- Basic Python
- Numpy, Matplotlib
- OpenCV
- (Optional) PyTorch or TensorFlow
Stay curious and keep building! 🧑💻✨