Blog

You are here

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

IEEE Signal Processing Society Blog


The SPS blog aims to raise awareness about signal processing and Society-related topics to a general interest audience in an engaging, informal, and non-technical way. If you're interested in contributing to the SPS blog, please contact the SPS Blog Team at sps-blog@ieee.org for more information.

Revitalizing Underwater Image Enhancement in the Deep Learning Era

By: 
Dr. Chongyi Li

Underwater image enhancement has drawn considerable attention in both image processing and underwater vision. Due to the complicated underwater environment and lighting conditions, enhancing underwater image is a challenging problem. 

Full Story
10 Mar.

How can we make cameras smarter to better analyze humans?

By: 
Dr. Joe (Zhou) Ren

This blog describes 4 computer vision algorithms for better human analysis, that understand human hand, gesture, pose, and action from various input modalities.

Full Story

Deep-learning-based audio-visual speech enhancement

By: 
Dr. Daniel Michelsanti

We all experienced the discomfort of communicating with our friends at a cocktail party or in a pub with loud background music. When difficult acoustic scenarios like these occur, we tend to rely on several visual cues, such as lips and mouth movement of the speaker, in order to understand the speech of interest.

Full Story

PANNs: Large-scale Pretrained Audio Neural Networks for Audio Pattern Recognition

By: 
Dr. Qiuqiang Kong

Audio pattern recognition is an important research topic in the machine learning area, and includes several tasks such as audio tagging, acoustic scene classification, music classification, speech emotion classification and sound event detection. In this blog, we introduce pretrained audio neural networks (PANNs) trained on the large-scale AudioSet dataset. These PANNs are transferred to other audio related tasks. We investigate the performance and computational complexity of PANNs modeled by a variety of convolutional neural networks. We propose an architecture called Wavegram-Logmel-CNN using both log-mel spectrogram and waveform as input feature.

Full Story

Frontal-Centers Guided Face: Boosting Face Recognition by Learning Pose-Invariant Features

By: 
Yingfan Tao

Recent years, face recognition has made a remarkable breakthrough due to the emergence of deep learning. However, compared with frontal face recognition, many deep face recognition models still suffer serious performance degradation when handling profile faces. To address this issue, we propose a novel Frontal-Centers Guided Loss (FCGFace) to obtain highly discriminative features for face recognition. Most existing discriminative feature learning approaches project features from the same class into a separated latent subspace.

Full Story

Recent Advances of Deep Learning within X-ray Security Imaging

By: 
Dr. Samet Akcay

This blog explores modern deep learning applications as well as traditional machine learning techniques for automated X-ray security imaging.

Full Story

Reconfigurable Intelligent Surfaces Aided Robust Systems

By: 
Dr. Gui Zhou and Dr. Cunhua Pan

A framework of robust transmission design for reconfigurable intelligent surfaces (RIS) aided systems has been proposed to address the imperfect cascaded channel state information issue.

Full Story

Advancing Technological Equity in Speech and Language Processing: Aspects, Challenges, Successes, and Future Actions

By: 
Dr. Helen Meng

Recent years have seen great strides being made in R&D of speech and language technologies. As these technologies continue to permeate our daily lives, they need to support diverse users and usage contexts, especially those with inputs that deviate from the mainstream.

Full Story

Model-Driven Deep Learning for MIMO Detection

By: 
Dr. Hengtao He

In this blog, we investigate the model-driven deep learning for multiple input-multiple output (MIMO) detection. In particular, the MIMO detector is specially designed by unfolding an iterative algorithm and adding some trainable parameters.

Full Story

Estimation in Multi-Object State Space Model

By: 
Dr. Ba-Ngu Vo

A brief introduction to state estimation in multi-object system that arises from applications where the number of objects and their states are unknown and vary randomly with time. State space model (SSM) is a fundamental concept in system theory that permeated through many fields of study.

Full Story

Pages

SPS Social Media

IEEE SPS Educational Resources

IEEE SPS Resource Center

IEEE SPS YouTube Channel