Meeting Recap

Date & Time: 3 April 2025 2PM

Attendees:

  • Yoo, Y

  • Noah GOSCINIAK


1. Executive Summary

Yoo, Young Joon shared expert insights on modern computer vision segmentation techniques. Key recommendations included U-Net variants and Fully Convolutional Network (FCN) deep learning models to address our image segmentation needs.


2. Discussion Points

Computer Vision Segmentation Overview

  • Importance of precise pixel-level classification for our use cases (e.g., medical imagery, scene understanding).

U-Net Family of Models

  • Architecture: Encoder–decoder with skip connections for fine-grained feature recovery.

  • Variants:

    • U-Net++ (nested U-Net) for richer multi-scale feature fusion.

    • Attention U-Net to focus on relevant spatial regions.

  • Pros & Cons:

      • Excellent performance with limited data.

    • – Higher memory footprint; may require patch-based training.

Fully Convolutional Networks (FCNs)

  • Core Idea: Replace fully connected layers with convolutional layers for end-to-end segmentation.

  • Notable Models:

    • FCN-8s for coarse-to-fine upsampling.

    • DeepLab (atrous convolutions + CRF) for sharper boundaries.

  • Pros & Cons:

      • Simpler architecture & faster inference.

    • – Can struggle with small object segmentation without additional post-processing.

Model Selection Criteria

  • Data Volume & Annotation Quality

  • Compute Resources & Inference Latency

  • Target Application Requirements (e.g., boundary precision vs. speed)


3. Decision

Evaluate and prototype both a U-Net variant and an FCN-based model on our dataset to compare accuracy, speed, and resource utilization.


4. Next Steps & Action Items

Set up data pipeline and preprocess images for segmentation tasks Implement baseline U-Net and run initial training/validation Implement baseline FCN-8s (and DeepLab if resources allow) Compare metrics (IoU, pixel accuracy, inference time) Review results with Yoo, Young Joon and refine model selection

Last updated