Skip to main content

Data Challenges

(with available data challenges)
closed
Machine Learning for Signal Processing

Digitization and Classification of ECG Images: The George B. Moody PhysioNet Challenge 2024

Supported by the SPS Challenge Program

The George B. Moody PhysioNet Challenges are annual competitions that invite participants to develop automated approaches for addressing important physiological and clinical problems. The 2024 Challenge invites teams to develop algorithms for digitizing and classifying electrocardiograms (ECGs) captured from images or paper printouts. 

L3DAS22 Machine Learning for 3D Audio Signal Processing: ICASSP 2022

Associated SPS Event: IEEE ICASSP 2022 Grand Challenge

The L3DAS22 Challenge aims at encouraging and fostering research on machine learning for 3D audio signal processing. 3D audio is gaining increasing interest in the machine learning community in recent years. The range of applications is incredibly wide, extending from virtual and real conferencing to autonomous driving, surveillance and many more.

ICAS 2021 Challenge: Self-awareness in Heterogeneous Multi-Robot Systems

A new Challenge on self-awareness in heterogeneous multi-robot systems has been organized within the first International Conference on Autonomous Systems (ICAS 2021). The research field of this competition is the unsupervised anomaly detection through self-aware autonomous systems, which is an active topic involving IEEE Signal Processing Society, also through the Autonomous Systems Initiative, and Intelligent Transportation Systems Society.

AI for Prosthetics Challenge - Reinforcement learning with musculoskeletal models

In this competition, you are tasked with developing a controller to enable a physiologically-based human model with a prosthetic leg to walk and run. You are provided with a human musculoskeletal model, a physics-based simulation environment OpenSim where you can synthesize physically and physiologically accurate motion, and datasets of normal gait kinematics. You are scored based on how well your agent adapts to the requested velocity vector changing in real time.

AutoML for Lifelong Machine Learning

In many real-world machine learning applications, AutoML is strongly needed due to the limited machine learning expertise of developers. Moreover, batches of data in many real-world applications may be arriving daily, weekly, monthly, or yearly, for instance, and the data distributions are changing relatively slowly over time. This presents a continuous learning or Lifelong Machine Learning challenge for an AutoML system.

Signal Processing for Communications and Networking

The First Pathloss Radio Map Prediction Challenge: ICASSP 2023

In wireless communications, the pathloss (or large scale fading coefficient) quantifies the loss of signal strength between a transmitter (Tx) and a receiver (Rx) due to large scale effects, such as free-space propagation loss, and interactions of the radio waves with the obstacles (which block line-of sight, like buildings, vehicles, pedestrians), e.g. penetrations, reflections and diffractions.

ZYELL - NCTUNetwork Anomaly Detection Challenge (ICASSP 2021)

Associated SPS Event: IEEE ICASSP 2021 Grand Challenge

In today’s digital age, network security is critical as billions of computers around the world are connected with each other over networks. Symantec’s Internet Security Threat Report indicates a 56% increase in the number of network attacks in 2019. Network anomaly detection (NAD) is an attempt to detect anomalous network traffic by observing traffic data over time to define what is “normal” traffic and pick out potentially anomalous behavior that differs in some way.

Speech and Language Processing

ICASSP SP Clarity Challenge: Speech Enhancement for Hearing Aids: IEEE ICASSP 2023

This signal processing challenge is designed to get the latest advancements in speech enhancement applied to hearing aids. 430 million people worldwide require rehabilitation to address hearing loss. Yet even in developed countries, only 40% of people who could benefit from hearing aids have them and use them often enough, because they believe that hearing aids perform poorly.

ICASSP2023 General Meeting Understanding and Generation Challenge (MUG): ICASSP 2023

The advent of spoken language processing (SLP) technologies on meeting transcripts is crucial for distilling, organizing, and prioritizing information. Meeting transcripts impose two key challenges to SLP tasks.  First, meeting transcripts exhibit a wide variety of spoken language phenomena, leading to dramatic performance degradation.  Second, meeting transcripts are usually long-form documents with several thousand words or more, posing a great challenge to mainstay Transformer-based models with high computational complexity.

ICASSP 2023 Speech Signal Improvement Challenge: ICASSP 2023

The ICASSP 2023 Speech Signal Improvement Challenge is intended to stimulate research in the area of improving the speech signal quality in communication systems. The speech signal quality can be measured with SIG in ITU-T P.835 and is still a top issue in audio communication and conferencing systems. For example, in the ICASSP 2022 Deep Noise Suppression challenge, the improvement in the background (BAK) and overall (OVRL) quality is impressive, but the improvement in the speech signal (SIG) is statistically zero.

5th Deep Noise Suppression Challenge: IEEE ICASSP 2023

The 5th DNS Challenge aims to motivate development of DNS models with great speech quality in presence of reverberation, noise and interfering (neighboring) talkers. DNS has gained momentum given new trends of hybrid and remote work in a variety of daily-life scenarios. Improving speech quality reduce meeting fatigue and improve clarity of communication. 

Spoken Language Understanding Challenge: ICASSP 2023

Spoken Language Understanding (SLU) is a critical component of conversational voice assistants, requiring converting user utterances into a structured format for task executions. SLU systems typically consist of an ASR component to convert audio to text and an NLU component to convert text to a tree like structure, however recently, E2E SLU systems have also become of increasing interest in order to increase quality, model efficiency, and data efficiency.

LIMMITS’23 - Lightweight, Multi-Speaker, Multi-Lingual Indic Text-to-Speech: ICASSP 2023

The LIMMITS’23 challenge on LIghtweight, Multi-speaker, Multi-lingual Indic Text-to-Speech Synthesis is being organized as part of the Signal Processing Grand Challenge track at ICASSP 2023. As a part of this challenge, TTS corpora in Marathi, Hindi, and Telugu datasets will be released. These TTS corpora are being built in the SYSPIN project at SPIRE lab, Indian Institute of Science (IISc) Bangalore, India.

Multimodal Information Based Speech Processing (MISP) 2022 Challenge: IEEE ICASSP 2023

The Multimodal Information Based Speech Processing (MISP) 2022 Challenge aims to extend the application of signal processing technology in specific scenarios, using audio and video data. We target the home TV scenario, where 2-6 people communicate with each other with TV noise in the background. Our new tracks focus on audio-visual speaker diarization (AVSD), and audio-visual diarization and recognition (AVDR).

Deep Noise Suppression Challenge: ICASSP 2022

Associated SPS Event: IEEE ICASSP 2022 Grand Challenge

Noise suppression has become more important than ever before due to the increasing use of voice interfaces for various applications. Given the millions of internet-connected devices being employed for audio/video calls, noise suppression is expected to be effective for all noise types chosen from daily-life scenarios.

SPeech Enhancement for Augmented Reality (SPEAR) challenge 2022

Listening in noisy reverberant environments can be challenging. The recent emergence of hearable devices, such as smart headphones, smart glasses and virtual/augmented reality headsets, presents an opportunity for a new class of speech and acoustic signal processing algorithms which use multimodal sensor data to compensate for, or even exploit, changes in head orientation. 

Multi-Speaker Multi-Style Voice Cloning Challenge (M2VoC) (ICASSP 2021)

Associated SPS Event: IEEE ICASSP 2021 Grand Challenge

Text-to-speech (TTS) or speech synthesis has witnessed significant performance improvement with the help of deep learning. The latest advances in end-to-end text-to-speech paradigm and neural vocoder have enabled us to produce very realistic and natural-sounding synthetic speech reaching almost human-parity performance. But this amazing ability is still limited to the ideal scenarios with a large single-speaker less-expressive training set.

(CHiME 2016) 4th CHiME Speech Separation and Recognition Challenge

This 4th CHiME Speech Separation and Recognition Challenge (CHiME-4) challenge revisits the datasets originally recorded for CHiME-3, i.e., Wall Street Journal corpus sentences spoken by talkers situated in challenging noisy environments recorded using a 6-channel tablet based microphone array. CHiME-4 increases the level of difficulty by constraining the number of microphones available for testing.

(MGB 2015) The Multi-Genre Broadcast (MGB) Challenge

The Multi-Genre Broadcast (MGB) Challenge is an evaluation of speech recognition, speaker diarization, dialect detection and lightly supervised alignment using TV recordings in English and Arabic. The speech data is broad and multi-genre, spanning the whole range of TV output, and represents a challenging task for speech technology. In 2015, the challenge used data from the British Broadcasting Corporation (BBC). It was an official challenge of the 2015 IEEE Automatic Speech Recognition and Understanding Workshop. 

(CHiME 2012) 2nd CHiME Speech Separation and Recognition Challenge

One year ago the 2011 PASCAL CHiME Speech Separation and Recognition Challenge considered the problem of recognising speech mixed in two-channel nonstationary noise typical of everyday listening conditions. Following the success of this challenge we are now organising a new challenge that, while keeping the same setting, extends the difficulty along two independent tracks: a larger vocabulary size and a more realistic mixing process that accounts for small head movements made while speaking.

(CHiME 2011) 1st CHiME Speech Separation and Recognition Challenge

In 2006 the PASCAL network funded the 1st Speech Separation challengewhich addressed the problem of separating and recognising speech mixed with speech. We are now launching a successor to this challenge that aims to tackle speech separation and recognition in more typical everyday listening conditions. The challenge employs noise background that has been collected from a real family living room using binaural microphones. 

Autonomous Systems

ICAS 2021 Challenge: Self-awareness in Heterogeneous Multi-Robot Systems

A new Challenge on self-awareness in heterogeneous multi-robot systems has been organized within the first International Conference on Autonomous Systems (ICAS 2021). The research field of this competition is the unsupervised anomaly detection through self-aware autonomous systems, which is an active topic involving IEEE Signal Processing Society, also through the Autonomous Systems Initiative, and Intelligent Transportation Systems Society.

Data Science

WOSDETC Drone-vs-Bird Detection Challenge: IEEE ICASSP 2023

The challenge aims at identifying novel signal processing solutions for discriminating between birds and drones appearing in video sequences. The specific goal is to detect a drone appearing at some time in a scene where birds may also be present, under different conditions. The algorithm should raise an alarm and provide a position estimate only when a drone is present, while not issuing alarms on birds. A dataset for training is made available upon request.

The First TReNDS Neuroimaging Competition: Multiscanner normative age and assessments prediction with brain function, structure, and connectivity

Translational utility is the ability of certain biomedical imaging features to capture useful subject-level characteristics in clinical settings, yielding sensible descriptions and/or predictions for individualized treatment trajectory. An important step in achieving translational utility is to demonstrate the association between imaging features and individual characteristics, such as sex, age, and other relevant assessments, on a large out-of-sample unaffected population (no diagnosed illnesses). This initial step then provides a strong normative basis for comparison with patient populations in clinical settings. Detailed information. Website.