Active Learning Image Dataset Labelling

T E C H N I C A L C A S E S T U D Y

Active Learning Loop: 67%

Labelling Cost Reduction on

2M Image Training Dataset

Uncertainty Sampling, CLIP Diversity Filtering, and Argilla Workflow for Manufacturing QA Defect Detection

Industry: Computer Vision · Manufacturing QA

Technology: PyTorch · Argilla · CLIP · scikit-learn

Blog Article 2025

Table of Contents

Introduction: The Hidden Cost of Random Labelling in Computer Vision 1

Background: Manufacturing QA and the Class Imbalance Problem 2

The Scale of the Challenge 2

Active Learning: Concepts and Architecture 3

The Active Learning Loop Architecture 3

Uncertainty Sampling: Prioritising the Model's Confusion 4

Margin Sampling 5

Entropy-Based Sampling 5

Least Confidence Sampling 5

Implementation in This Case Study 5

Diversity Sampling with CLIP Embeddings 6

CLIP Embeddings for Visual Representation 6

K-Means Clustering for Diverse Subset Selection 7

Implementation Details 7

Labelling Workflow Management with Argilla 7

Batch Distribution and Priority Queuing 8

Confidence Metadata and Multi-Annotator Agreement 8

Integration Architecture 8

Calibration Quality Control: Monitoring Labeller Performance 9

The Calibration Set 9

Agreement Threshold and Intervention Protocol 9

Detected Quality Issues 10

Results: Quantitative Outcomes 10

Limitations and Counterarguments 11

Computational Overhead of the Active Learning Loop 11

Sensitivity to Initial Seed Quality 11

Labelling Latency and Iteration Cycle Time 12

Applicability to Other Domains 12

The 67% Reduction May Not Generalise 12

Conclusion and Future Outlook 12

References 13

Your privacy, your call

Active Learning Image Dataset Labelling

Need help shipping this?

Introduction: The Hidden Cost of Random Labelling in Computer Vision

Background: Manufacturing QA and the Class Imbalance Problem

The Scale of the Challenge

Active Learning: Concepts and Architecture

The Active Learning Loop Architecture

Uncertainty Sampling: Prioritising the Model's Confusion

Margin Sampling

Entropy-Based Sampling

Least Confidence Sampling

Implementation in This Case Study

Diversity Sampling with CLIP Embeddings

CLIP Embeddings for Visual Representation

K-Means Clustering for Diverse Subset Selection

Implementation Details

Labelling Workflow Management with Argilla

Batch Distribution and Priority Queuing

Confidence Metadata and Multi-Annotator Agreement

Integration Architecture

Calibration Quality Control: Monitoring Labeller Performance

The Calibration Set

Agreement Threshold and Intervention Protocol

Detected Quality Issues

Results: Quantitative Outcomes

Limitations and Counterarguments

Computational Overhead of the Active Learning Loop

Sensitivity to Initial Seed Quality

Labelling Latency and Iteration Cycle Time

Applicability to Other Domains

The 67% Reduction May Not Generalise

Conclusion and Future Outlook

References