Meeting Recap

Date & Time: 3 April 2025 2PM

Attendees:

Yoo, Y
Noah GOSCINIAK

1. Executive Summary

Yoo, Young Joon shared expert insights on modern computer vision segmentation techniques. Key recommendations included U-Net variants and Fully Convolutional Network (FCN) deep learning models to address our image segmentation needs.

2. Discussion Points

Computer Vision Segmentation Overview

Importance of precise pixel-level classification for our use cases (e.g., medical imagery, scene understanding).

U-Net Family of Models

Architecture: Encoder–decoder with skip connections for fine-grained feature recovery.
Variants:
- U-Net++ (nested U-Net) for richer multi-scale feature fusion.
- Attention U-Net to focus on relevant spatial regions.
Pros & Cons:
- - Excellent performance with limited data.
- – Higher memory footprint; may require patch-based training.

Fully Convolutional Networks (FCNs)

Core Idea: Replace fully connected layers with convolutional layers for end-to-end segmentation.
Notable Models:
- FCN-8s for coarse-to-fine upsampling.
- DeepLab (atrous convolutions + CRF) for sharper boundaries.
Pros & Cons:
- - Simpler architecture & faster inference.
- – Can struggle with small object segmentation without additional post-processing.

Model Selection Criteria

Data Volume & Annotation Quality
Compute Resources & Inference Latency
Target Application Requirements (e.g., boundary precision vs. speed)

3. Decision

Evaluate and prototype both a U-Net variant and an FCN-based model on our dataset to compare accuracy, speed, and resource utilization.

4. Next Steps & Action Items

Set up data pipeline and preprocess images for segmentation tasks Implement baseline U-Net and run initial training/validation Implement baseline FCN-8s (and DeepLab if resources allow) Compare metrics (IoU, pixel accuracy, inference time) Review results with Yoo, Young Joon and refine model selection

Last updated 22 hours ago