π’ 1. Semantic Segmentation
- Semantic Segmentation aims to predict a category for each pixel in an image.
- Example: An image with 2 dogs, 1 cat, and a wolf β output a pixel-wise mask with class labels.
- Common in autonomous driving, medical imaging, and scene understanding.
πͺ 2. Sliding Window in Segmentation
- Idea: Slide a window across the image β classify the central pixel.
- β
Works in theory, but:
- β Super computationally expensive.
- Many redundant computations.
- Doesnβt scale well to high-res images.
π 3. Fully Convolutional Networks (FCN)
- FCNs solve this by removing fully connected layers.
- They apply convolution and upsampling across the entire image.
- β
Output: Dense pixel-wise predictions.
- Faster and more efficient than sliding windows.
β¬οΈ 4. Upsampling & Unpooling
β
Upsampling
- Increases the spatial resolution of feature maps.
- Examples: Displaying low-res images on high-res screens.