| IEEE Signal Processing Society

Simultaneous Multifrequency Demodulation for Single-Shot Multiple-Path ToF Imaging

Tags

IEEE TCI Article

Indirect Time-of-Flight (iToF) sensors measure the received signal's phase shift or time delay to calculate depth. In realistic conditions, however, recovering depth is challenging as reflections from secondary scattering areas or translucent objects may interfere with the direct reflection, resulting in inaccurate 3D estimates.

Differentiable Uncalibrated Imaging

Tags

IEEE TCI Article

We propose a differentiable imaging framework to address uncertainty in measurement coordinates such as sensor locations and projection angles. We formulate the problem as measurement interpolation at unknown nodes supervised through the forward operator. To solve it we apply implicit neural networks, also known as neural fields, which are naturally differentiable with respect to the input coordinates. We also develop differentiable spline interpolators which perform as well as neural networks, require less time to optimize and have well-understood properties.

Disentangling Prosody Representations With Unsupervised Speech Reconstruction

Tags

IEEE TASLP Article

Human speech can be characterized by different components, including semantic content, speaker identity and prosodic information. Significant progress has been made in disentangling representations for semantic content and speaker identity in speech recognition and speaker verification tasks respectively. However, it is still an open challenging question to extract prosodic information because of the intrinsic association of different attributes, such as timbre and rhythm, and because of the need for supervised training schemes to achieve robust speech recognition.

Speech Dereverberation With Frequency Domain Autoregressive Modeling

Tags

IEEE TASLP Article

Speech applications in far-field real world settings often deal with signals that are corrupted by reverberation. The task of dereverberation constitutes an important step to improve the audible quality and to reduce the error rates in applications like automatic speech recognition (ASR). We propose a unified framework of speech dereverberation for improving the speech quality and the ASR performance using the approach of envelope-carrier decomposition provided by an autoregressive (AR) model.

Operation-Augmented Numerical Reasoning for Question Answering

Tags

IEEE TASLP Article

Question answering requiring numerical reasoning, which generally involves symbolic operations such as sorting, counting, and addition, is a challenging task. To address such a problem, existing mixture-of-experts (MoE)-based methods design several specific answer predictors to handle different types of questions and achieve promising performance. However, they ignore the modeling and exploitation of fine-grained reasoning-related operations to support numerical reasoning, encountering the inadequacy in reasoning capability and interpretability.

Statistical Analysis for Speaker Recognition Evaluation With Data Dependence and Three Score Distributions

Tags

IEEE TASLP Article

The speaker recognition evaluation is conducted in a framework in which three score distributions and two decision thresholds are employed, and the statistic of interest is an average of the two weighted sums of the probabilities of type I and type II errors at the two thresholds correspondingly. And data dependence caused by multiple use of the same subjects exists ubiquitously in order to generate more samples because of limited resources.

Population Monte Carlo With Normalizing Flow

Tags

IEEE SPL Article

Adaptive importance sampling (AIS) methods provide a useful alternative to Markov Chain Monte Carlo (MCMC) algorithms for performing inference of intractable distributions. Population Monte Carlo (PMC) algorithms constitute a family of AIS approaches which adapt the proposal distributions iteratively to improve the approximation of the target distribution.

ALAN: Self-Attention Is Not All You Need for Image Super-Resolution

Tags

IEEE SPL Article

Vision Transformer (ViT)-based image super-resolution (SR) methods have achieved impressive performance and surpassed CNN-based SR methods by utilizing Multi-Head Self-Attention (MHSA) to model long-range dependencies. However, the quadratic complexity of MHSA and the inefficiency of non-parallelized window partition seriously affect the inference speed, hindering these SR methods from being applied to application scenarios requiring speed and quality.

Learning Deep Scene Curve for Fast and Robust Underwater Image Enhancement

Tags

IEEE SPL Article

Learning-based approaches inspired by the scattering model for enhancing underwater imagery have gained prominence. Nevertheless, these methods often suffer from time-consuming attributable to their sizable model dimensions. Moreover, they face challenges in adapting unknown scenes, primarily because the scattering model's original design was intended for atmospheric rather than marine condition.

Decoding Silent Reading EEG Signals Using Adaptive Feature Graph Convolutional Network

Tags

IEEE SPL Article

Decoding silent reading Electroencephalography (EEG) signals is challenging because of its low signal-to-noise ratio. In addition, EEG signals are typically non-Euclidean structured, therefore merely using a two-dimensional matrix to represent the variation of sampling points of each channel in time cannot richly represent the spatial connection between channels.

Subscribe to

Publications & Resources

Conferences & Events

Professional Development

Community & Involvement

About IEEE SPS

For Volunteers

Simultaneous Multifrequency Demodulation for Single-Shot Multiple-Path ToF Imaging

Tags

Differentiable Uncalibrated Imaging

Tags

Disentangling Prosody Representations With Unsupervised Speech Reconstruction

Tags

Speech Dereverberation With Frequency Domain Autoregressive Modeling

Tags

Operation-Augmented Numerical Reasoning for Question Answering

Tags

Statistical Analysis for Speaker Recognition Evaluation With Data Dependence and Three Score Distributions

Tags

Population Monte Carlo With Normalizing Flow

Tags

ALAN: Self-Attention Is Not All You Need for Image Super-Resolution

Tags

Learning Deep Scene Curve for Fast and Robust Underwater Image Enhancement

Tags

Decoding Silent Reading EEG Signals Using Adaptive Feature Graph Convolutional Network

Tags