TASLPRO Featured Articles

One of the challenges in computational acoustics is the identification of models that can simulate and predict the physical behavior of a system generating an acoustic signal. Whenever such models are used for commercial applications, an additional constraint is the time to market, making automation of the sound design process desirable.

Constrained Learned Feature Extraction for Acoustic Scene Classification

TASLP Volume 27 Issue 8

Deep neural networks (DNNs) have been proven to be powerful models for acoustic scene classification tasks. State-of-the-art DNNs have millions of connections and are computationally intensive, making them difficult to deploy on systems with limited resources.

Tailoring an Interpretable Neural Language Model

TASLPRO Featured Articles

TASLP Volume 27 Issue 7

Neural networks have shown great potential in language modeling. Currently, the dominant approach to language modeling is based on recurrent neural networks (RNNs) and convolutional neural networks (CNNs). Nonetheless, it is not clear why RNNs and CNNs are suitable for the language modeling task since these neural models are lack of interpretability.

Robust Joint Estimation of Multimicrophone Signal Model Parameters

TASLPRO Featured Articles

TASLP Volume 27 Issue 7

One of the biggest challenges in multimicrophone applications is the estimation of the parameters of the signal model, such as the power spectral densities (PSDs) of the sources, the early (relative) acoustic transfer functions of the sources with respect to the microphones, the PSD of late reverberation, and the PSDs of microphone-self noise.

Low Resource Keyword Search With Synthesized Crosslingual Exemplars

TASLPRO Featured Articles

TASLP Volume 27 Issue 7

The transfer of acoustic data across languages has been shown to improve keyword search (KWS) performance in data-scarce settings. In this paper, we propose a way of performing this transfer that reduces the impact of the prevalence of out-of-vocabulary (OOV) terms on KWS in such a setting.

Subjective and Objective Assessment of Monaural and Binaural Aspects of Audio Quality

TASLPRO Featured Articles

TASLP Volume 27 Issue 7

Recently, the binaural auditory-model-based quality prediction (BAM-Q) was successfully applied to predict binaural audio quality degradations, while the generalized power-spectrum model for quality (GPSM q ) has been demonstrated to account for a large variety of monaural signal distortions.

A Geometric Model for Prediction of Spatial Aliasing in 2.5D Sound Field Synthesis

TASLPRO Featured Articles

TASLP Volume 27 Issue 6

The avoidance of spatial aliasing is a major challenge in the practical implementation of sound field synthesis. Such methods aim at a physically…

GlotNet—A Raw Waveform Model for the Glottal Excitation in Statistical Parametric Speech Synthesis

TASLPRO Featured Articles

TASLP Volume 27 Issue 6

Recently, generative neural network models which operate directly on raw audio, such as WaveNet, have improved the state of the art in text-to-speech…