Skip to main content

Late Reverberation Cancellation Using Bayesian Estimation of Multi-Channel Linear Predictors and Student's t-Source Prior

Multi-channel linear prediction (MCLP) can model the late reverberation in the short-time Fourier transform domain using a delayed linear predictor and the prediction residual is taken as the desired early reflection component. Traditionally, a Gaussian source model with time-dependent precision (inverse of variance) is considered for the desired signal. In this paper, we propose a Student's t-distribution model for the desired signal, which is realized as a Gaussian source with a Gamma distributed precision.

Sound Event Detection in the DCASE 2017 Challenge

Each edition of the challenge on Detection and Classification of Acoustic Scenes and Events (DCASE) contained several tasks involving sound event detection in different setups. DCASE 2017 presented participants with three such tasks, each having specific datasets and detection requirements: Task 2, in which target sound events were very rare in both training and testing data, Task 3 having overlapping events annotated in real-life audio, and Task 4, in which only weakly labeled data were available for training.

About TASLPRO

Scope

The IEEE Transactions on Audio, Speech and Language Processing (TASLPRO) is dedicated to innovative theory and methods for processing signals representing audio, speech and language, and their applications. This includes analysis, synthesis, enhancement, transformation, classification and interpretation of such signals as well as the design, development, and evaluation of associated signal processing systems.

Machine learning and pattern analysis applied to any of the above areas is also welcome.

Tracking Multiple Audio Sources With the von Mises Distribution and Variational EM

In this letter, we address the problem of simultaneously tracking several moving audio sources, namely the problem of estimating source trajectories from a sequence of observed features. We propose to use the von Mises distribution to model audio-source directions of arrival with circular random variables. This leads to a Bayesian filtering formulation, which is intractable because of the combinatorial explosion of associating observed variables with latent variables, over time. We propose a variational approximation of the filtering distribution.

Decision Tree Based Sea-Surface Weak Target Detection With False Alarm Rate Controllable

Aiming at accurate weak sea-surface target detection, this letter devotes to designing a learning-based detector that can work well even in varying detection environments. We first exploit the concept of the fractal theory to extract three representative features in the time and frequency domains and construct a three-dimensional feature space. We then combine the constructed feature space with the decision tree approach to design an environment-adaptive detector.

A Moment-Based Estimation Strategy for Underdetermined Single-Sensor Blind Source Separation

We investigate the blind identification and separation of underdetermined linear instantaneous mixtures with a single sensor and an arbitrary known number of sources with finite known support and uniform distribution. We propose channel estimators based on the high-order statistics of the received signal and on the rotational symmetries of the source constellations. Explicit expressions for distinct and equal rotation orders are derived. The proposed estimators are used as initializers for the iterative least squares with enumeration algorithm to enhance its convergence properties.

Bilevel Optimization Using Stationary Point of Lower-Level Objective Function for Discriminative Basis Learning in Nonnegative Matrix Factorization

In this letter, we address an audio signal separation problem and propose a new effective algorithm for solving a bilevel optimization in discriminative nonnegative matrix factorization (NMF). Recently, discriminative training of NMF bases has been developed for better signal separation in supervised NMF (SNMF), which exploits a priori training of given sample signals.

About SP Letters

Scope

The IEEE Signal Processing Letters is an archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing.