Self-Supervised Representation Learning: Introduction, advances, and challenges

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

TOC

May 2022

Self-Supervised Representation Learning: Introduction, advances, and challenges

abstract_general_5.jpg

SPM Articles

By:

Linus Ericsson; Henry Gouk; Chen Change Loy; Timothy M. Hospedales

Self-supervised representation learning (SSRL) methods aim to provide powerful, deep feature learning without the requirement of large annotated data sets, thus alleviating the annotation bottleneck-one of the main barriers to the practical deployment of deep learning today. These techniques have advanced rapidly in recent years, with their efficacy approaching and sometimes surpassing fully supervised pretraining alternatives across a variety of data modalities, including image, video, sound, text, and graphs. This article introduces this vibrant area, including key concepts, the four main families of approaches and associated state-of-the-art techniques, and how self-supervised methods are applied to diverse modalities of data. We further discuss practical considerations including workflows, representation transferability, and computational cost. Finally, we survey major open challenges in the field, that provide fertile ground for future work.

Deep neural networks (DNNs) now underpin state-of-the-art artificial intelligence (AI) systems for analysis of diverse data types. However, the conventional paradigm has been to train these systems using supervised learning, where performance has grown roughly logarithmically with annotated data set sizes. The cost of such annotation has proven to be a scalability bottleneck for the continued advancement of state-of-the-art performance, and a more fundamental barrier for the deployment of DNNs in application areas where data and annotations are intrinsically rare, costly, dangerous, or time consuming to collect.

This situation has motivated a wave of research in SSRL, where freely available labels from carefully designed pretext tasks are used as supervision to discriminatively train deep representations. The resulting representations can then be reused for training a DNN to solve a downstream task of interest using comparatively little task-specific annotated data compared to conventional supervised learning.

Self-supervision refers to learning tasks that ask a DNN to predict one part of the input data—or a label programmatically derivable thereof—given another part of the input. This is in contrast to supervised learning, which asks the DNN to predict a manually provided target output, and generative modeling, which asks a DNN to estimate the density of the input data or learn a generator for input data. Self-supervised algorithms differ primarily in their strategy for defining the derived labels to predict. This choice of pretext task determines the (in)variances of the resulting learned representation and thus how effective it is for different downstream tasks.

Read on IEEE Xplore

Tags:

SPM Article

IEEE Signal Processing Magazine

SPM May 2022

SPS Social Media

IEEE SPS Facebook Page https://www.facebook.com/ieeeSPS
IEEE SPS X Page https://x.com/IEEEsps
IEEE SPS Instagram Page https://www.instagram.com/ieeesps/?hl=en
IEEE SPS LinkedIn Page https://www.linkedin.com/company/ieeesps/
IEEE SPS YouTube Channel https://www.youtube.com/ieeeSPS

IEEE SPS Educational Resources

IEEE SPS Resource Center

IEEE SPS YouTube Channel

© Copyright 2025 IEEE - All rights reserved. Use of this website signifies your agreement to the IEEE Terms and Conditions.
A public charity, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

TMM.png

New Editor-in-Chief (EIC) of the IEEE Transactions on Multimedia (T-MM)

ICASSP 2026 Blog Header.png

(ICASSP 2026) 2026 IEEE International Conference on Acoustics, Speech, and Signal Processing

mentor_help_general_3.jpg

Call for Mentors: 2025 IEEE SPS SigMA Program - Signal Processing Mentorship Academy

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Self-Supervised Representation Learning: Introduction, advances, and challenges

Publications & Resources

Signal Processing Magazine

For Authors

TMM.png

mentor_help_general_3.jpg

general_get_involved_tc_article_full.jpg

Top Reasons to Join SPS Today!

Self-Supervised Representation Learning: Introduction, advances, and challenges

abstract_general_5.jpg

SPS Social Media

IEEE SPS Educational Resources

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Self-Supervised Representation Learning: Introduction, advances, and challenges

Search form

You are here

Publications & Resources

Signal Processing Magazine

For Authors

Top Reasons to Join SPS Today!

Self-Supervised Representation Learning: Introduction, advances, and challenges

SPS Social Media

IEEE SPS Educational Resources