Computer Vision
Image processing to deep learning. OpenCV, PIL, CNNs, transfer learning, object detection with YOLO, segmentation, OCR, and deploying vision models to production.
FundamentalsTopics 1–10
- ·What is Computer Vision?
- ·Images as Numpy Arrays
- ·OpenCV Basics
- ·Reading & Displaying
- ·Color Spaces (BGR/RGB/HSV)
- ·Image Transformations
- ·Convolution & Kernels
- ·Edge Detection
- ·Thresholding & Drawing
- ·PIL/Pillow & Image Pyramids
Start Fundamentals →
IntermediateTopics 11–20
- ·CNN Fundamentals
- ·Feature Maps & Pooling
- ·Backbones (ResNet, EfficientNet)
- ·Transfer Learning
- ·Data Augmentation
- ·Training a Classifier
- ·Batch Norm & Regularisation
- ·Object Detection (YOLO)
- ·Model Evaluation
- ·Datasets & Labels
Start Intermediate →
AdvancedTopics 21–30
- ·Modern Detection (YOLOv8+)
- ·Semantic Segmentation
- ·Instance & Panoptic Segmentation
- ·Pose Estimation & Keypoints
- ·Vision Transformers (ViT)
- ·Depth Estimation & 3D
- ·OCR & Document Understanding
- ·Multimodal Vision (CLIP & beyond)
- ·Tracking & Optical Flow
- ·Generative Vision (Diffusion & GANs)
Start Advanced →
ProductionTopics 31–40
- ·Model Optimisation & Quantisation
- ·ONNX & Runtime Engines
- ·Serving (FastAPI · Triton)
- ·Batch & Dynamic Batching
- ·GPU Inference at Scale
- ·Edge & Mobile Deployment
- ·Video & Streaming Pipelines
- ·Monitoring & Drift
- ·A/B Testing & Rollout
- ·Cost, Throughput & SLOs
Start Production →