SPS Feed

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

The Latest News, Articles, and Events in Signal Processing

01 Jan 2023

Practical Public Template Attack Attacks on CRYSTALS-Dilithium With Randomness Leakages

IEEE Transactions on Information Forensics and Security

Side-channel security has become a significant concern in the NIST post-quantum cryptography standardization process. The lattice-based CRYSTALS-Dilithium (abbr. Dilithium) becomes the primary signature standard algorithm recommended by NIST for most use cases in July 2022 due to its excellent performance in security and efficiency. Compared to Dilithium’s rich theoretical security analysis results, the side-channel security of its physical implementations needs to be further explored.

01 Jan 2023

Spatial-Angular Versatile Convolution for Light Field Reconstruction

IEEE Transactions on Computational Imaging

Spatial-angular separable convolution (SAS-conv) has been widely used for efficient and effective 4D light field (LF) feature embedding in different tasks, which mimics a 4D convolution by alternatively operating on 2D spatial slices and 2D angular slices. In this paper, we argue that, despite its global intensity modeling capabilities, SAS-conv can only embed local geometry information into the features, resulting in inferior performances in the regions with textures and occlusions. Because the epipolar lines are highly related to the scene depth, we introduce the concept of spatial-angular correlated convolution (SAC-conv).

01 Jan 2023

Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features

IEEE Transactions on Audio, Speech and Language Processing

This study proposes a cross-domain multi-objective speech assessment model, called MOSA-Net, which can simultaneously estimate the speech quality, intelligibility, and distortion assessment scores of an input speech signal. MOSA-Net comprises a convolutional neural network and bidirectional long short-term memory architecture for representation extraction, and a multiplicative attention layer and a fully connected layer for each assessment metric prediction. Additionally, cross-domain features (spectral and time-domain features) and latent representations from self-supervised learned (SSL) models are used as inputs to combine rich acoustic information to obtain more accurate assessments.

01 Jan 2023

A Diffeomorphic Flow-Based Variational Framework for Multi-Speaker Emotion Conversion

IEEE Transactions on Audio, Speech and Language Processing

This paper introduces a new framework for non-parallel emotion conversion in speech. Our framework is based on two key contributions. First, we propose a stochastic version of the popular Cycle-GAN model. Our modified loss function introduces a Kullback–Leibler (KL) divergence term that aligns the source and target data distributions learned by the generators, thus overcoming the limitations of sample-wise generation. By using a variational approximation to this stochastic loss function, we show that our KL divergence term can be implemented via a paired density discriminator.

01 Jan 2023

Integrating Lattice-Free MMI Into End-to-End Speech Recognition

IEEE Transactions on Audio, Speech and Language Processing

In automatic speech recognition (ASR) research, discriminative criteria have achieved superior performance in DNN-HMM systems. Given this success, the adoption of discriminative criteria is promising to boost the performance of end-to-end (E2E) ASR systems. With this motivation, previous works have introduced the minimum Bayesian risk (MBR, one of the discriminative criteria) into E2E ASR systems. However, the effectiveness and efficiency of the MBR-based methods are compromised: the MBR criterion is only used in system training, which creates a mismatch between training and decoding;

01 Jan 2023

Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks

IEEE Transactions on Audio, Speech and Language Processing

Emotional voice conversion (VC) aims to convert a neutral voice to an emotional one while retaining the linguistic information and speaker identity. We note that the decoupling of emotional features from other speech information (such as content, speaker identity, etc.) is the key to achieving promising performance. Some recent attempts of speech representation decoupling on the neutral speech cannot work well on the emotional speech, due to the more complex entanglement of acoustic properties in the latter.

01 Jan 2023

Clean vs. Overlapped Speech-Music Detection Using Harmonic-Percussive Features and Multi-Task Learning

IEEE Transactions on Audio, Speech and Language Processing

Detection of speech and music signals in isolated and overlapped conditions is an essential preprocessing step for many audio applications. Speech signals have wavy and continuous harmonics, while music signals exhibit horizontally linear and discontinuous harmonic patterns. Music signals also contain more percussive components than speech signals, manifested as vertical striations in the spectrograms.

01 Jan 2023

ET: Edge-Enhanced Transformer for Image Splicing Detection

IEEE Signal Processing Letters

A key challenge of image splicing detection is how to localize integral tampered regions without false alarm. Although current forgery detection approaches have achieved promising performance, the integrality and false alarm are overlooked. In this paper, we argue that the insufficient use of splicing boundary is a main reason for poor accuracy. To tackle this problem, we propose an Edge-enhanced Transformer (ET) for tampered region localization. Specifically, to capture rich tampering traces, a two-branch edge-aware transformer is built to integrate the splicing edge clues into the forgery localization network, generating forgery features and edge features.

01 Jan 2023

Learn to Zoom in Single Image Super-Resolution

IEEE Signal Processing Letters

In this letter, we propose a novel solution to the problem of single image super-resolution at multiple scaling factors, with a single network architecture. In applications where only a detail needs to be super-resolved, traditional solutions must choose to use as input either the low-resolution detail, thus losing the information about the context, or the whole low-resolution image and then crop the desired output detail, which is quite wasteful in terms of computations and storage.

01 Jan 2023

Spatial Diversity in Radar Detection via Active Reconfigurable Intelligent Surfaces

IEEE Signal Processing Letters

Active reconfigurable intelligent surfaces (RISs) are a novel and promising technology that allows controlling the radio propagation environment while compensating for the product path loss along the RIS-assisted path. In this letter, we consider the classical radar detection problem and propose to use an active RIS to get a second independent look at a prospective target illuminated by the radar transmitter.

01 Jan 2023

False Discovery Rate (FDR) and Familywise Error Rate (FER) Rules for Model Selection in Signal Processing Applications

IEEE Open Journal of Signal Processing

Model selection is an omnipresent problem in signal processing applications. The Akaike information criterion (AIC) and the Bayesian information criterion (BIC) are the most commonly used solutions to this problem. These criteria have been found to have satisfactory performance in many cases and had a dominant role in the model selection literature since their introduction several decades ago, despite numerous attempts to dethrone them. Model selection can be viewed as a multiple hypothesis testing problem.

01 Jan 2023

Natural Thresholding Algorithms for Signal Recovery With Sparsity

IEEE Open Journal of Signal Processing

The algorithms based on the technique of optimal k -thresholding (OT) were recently proposed for signal recovery, and they are very different from the traditional family of hard thresholding methods. However, the computational cost for OT-based algorithms remains high at the current stage of their development. This stimulates the development of the so-called natural thresholding (NT) algorithm and its variants in this paper. The family of NT algorithms is developed through the first-order approximation of the so-called regularized optimal k -thresholding model, and thus the computational cost for this family of algorithms is significantly lower than that of the OT-based algorithms.

01 Jan 2023

Coded Illumination for 3D Lensless Imaging

IEEE Open Journal of Signal Processing

Mask-based lensless cameras offer a novel design for imaging systems by replacing the lens in a conventional camera with a layer of coded mask. Each pixel of the lensless camera encodes the information of the entire 3D scene. Existing methods for 3D reconstruction from lensless measurements suffer from poor spatial and depth resolution.

01 Jan 2023

Towards Better Domain Adaptation for Self-Supervised Models: A Case Study of Child ASR

IEEE Journal of Selected Topics in Signal Processing

Recently, self-supervised learning (SSL) from unlabelled speech data has gained increased attention in the automatic speech recognition (ASR) community. Typical SSL methods include autoregressive predictive coding (APC), Wav2vec2.0, and hidden unit BERT (HuBERT). However, SSL models are biased to the pretraining data. When SSL models are finetuned with data from another domain, domain shifting occurs and might cause limited knowledge transfer for downstream tasks.

01 Jan 2023

Improving Automatic Speech Recognition Performance for Low-Resource Languages With Self-Supervised Models

IEEE Journal of Selected Topics in Signal Processing

Speech self-supervised learning has attracted much attention due to its promising performance in multiple downstream tasks, and has become a new growth engine for speech recognition in low-resource languages. In this paper, we exploit and analyze a series of wav2vec pre-trained models for speech recognition in 15 low-resource languages in the OpenASR21 Challenge.

01 Jan 2023

Self-Supervised Language Learning From Raw Audio: Lessons From the Zero Resource Speech Challenge

IEEE Journal of Selected Topics in Signal Processing

Although supervised deep learning has revolutionized speech and audio processing, it has necessitated the building of specialist models for individual tasks and application scenarios. It is likewise difficult to apply this to dialects and languages for which only limited labeled data is available. Self-supervised representation learning methods promise a single universal model that would benefit a wide variety of tasks and domains.

01 Jan 2023

Self-Supervised Speech Representation Learning: A Review

IEEE Journal of Selected Topics in Signal Processing

01 Jan 2023

Editorial Editorial of Special Issue on Self-Supervised Learning for Speech and Audio Processing

IEEE Journal of Selected Topics in Signal Processing

The papers in this special section focus on self-supervised learning for speech and audio processing. A current trend in the machine learning community is the adoption of self-supervised approaches to pretrain deep networks. Self-supervised learning utilizes proxy-supervised learning tasks (or pretext tasks) - for example, distinguishing parts of the input signal from distractors or reconstructing masked input segments conditioned on unmasked segments—to obtain training data from unlabeled corpora.

01 Jan 2023

2022 IEEE Signal Processing Society Awards to be Presented in Greece

The IEEE SPS congratulates the following SPS members who will receive the Society’s prestigious awards during ICASSP 2023 in Greece.

01 Jan 2023

SPM Special Issue on Hypercomplex Signal and Image Processing

Novel computational signal and image analysis approaches based on feature-rich mathematical/computational frameworks continue to push the limits of the technological envelope, thus providing optimized and efficient solutions.

SPS Social Media

IEEE SPS Facebook Page https://www.facebook.com/ieeeSPS
IEEE SPS X Page https://x.com/IEEEsps
IEEE SPS Instagram Page https://www.instagram.com/ieeesps/?hl=en
IEEE SPS LinkedIn Page https://www.linkedin.com/company/ieeesps/
IEEE SPS YouTube Channel https://www.youtube.com/ieeeSPS

IEEE SPS Educational Resources

IEEE SPS Resource Center

IEEE SPS YouTube Channel

© Copyright 2025 IEEE - All rights reserved. Use of this website signifies your agreement to the IEEE Terms and Conditions.
A public charity, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

webinar_cube.jpg

SPS JSTSP Webinar: Distributed Signal Processing for Extremely Large-Scale Antenna Array Systems

nominate_blue.jpg

Call for Nominations for Chair, Women in Signal Processing Committee (WISP)

Nominate-Blog-Header.jpg

Call for Nominations for Chair, Scholarship Committee

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

SPS Feed

Top Reasons to Join SPS Today!

The Latest News, Articles, and Events in Signal Processing

Practical Public Template Attack Attacks on CRYSTALS-Dilithium With Randomness Leakages

Spatial-Angular Versatile Convolution for Light Field Reconstruction

Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features

A Diffeomorphic Flow-Based Variational Framework for Multi-Speaker Emotion Conversion

Integrating Lattice-Free MMI Into End-to-End Speech Recognition

Decoupling Speaker-Independent Emotions for Voice Conversion via Source-Filter Networks

Clean vs. Overlapped Speech-Music Detection Using Harmonic-Percussive Features and Multi-Task Learning

ET: Edge-Enhanced Transformer for Image Splicing Detection

Learn to Zoom in Single Image Super-Resolution

Spatial Diversity in Radar Detection via Active Reconfigurable Intelligent Surfaces

False Discovery Rate (FDR) and Familywise Error Rate (FER) Rules for Model Selection in Signal Processing Applications

Natural Thresholding Algorithms for Signal Recovery With Sparsity

Coded Illumination for 3D Lensless Imaging

Towards Better Domain Adaptation for Self-Supervised Models: A Case Study of Child ASR

Improving Automatic Speech Recognition Performance for Low-Resource Languages With Self-Supervised Models

Self-Supervised Language Learning From Raw Audio: Lessons From the Zero Resource Speech Challenge

Self-Supervised Speech Representation Learning: A Review

Editorial Editorial of Special Issue on Self-Supervised Learning for Speech and Audio Processing

Award_Nomination_Open_slider.jpg

2022 IEEE Signal Processing Society Awards to be Presented in Greece

iStock_000037092432_Small.jpg

SPM Special Issue on Hypercomplex Signal and Image Processing

Pages

SPS Social Media

IEEE SPS Educational Resources

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

SPS Feed

Search form

You are here

Top Reasons to Join SPS Today!

The Latest News, Articles, and Events in Signal Processing

Pages

SPS Social Media

IEEE SPS Educational Resources