Improving and Unfolding Statistical Models of Nonstationary Signals

You are here

Inside Signal Processing Newsletter Home Page

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

News and Resources for Members of the IEEE Signal Processing Society

Improving and Unfolding Statistical Models of Nonstationary Signals

Wisdom, Scott Thomas

Advisor: Atlas Les, Pitton, James

Improving the modeling and processing of nonstationary signals remains an important yet challenging problem. In the past, the most effective approach for processing these signals has been statistical modeling. Statistical models can effectively encode domain knowledge and lead to principled algorithms for the fundamental tasks of enhancement, detection, and classification. However, the performance of statistical models can be limited because they inherently make assumptions about the distribution of the data. Deep neural networks, in contrast, have recently outperformed state-of-the-art statistical models of nonstationary signals. Deep neural networks are completely data-driven, and learn to set their parameters by training on large datasets that are assumed to match the distribution of the data.

This dissertation follows two approaches for improving modeling and processing of nonstationary signals. The first approach examines conventional model assumptions and suggests improvements that lead to improved performance for processing nonstationary signals. Specifically, noncircular distributions of the complex-valued short-time Fourier transform are shown to improve detection of realistic nonstationary signals. Then the parameterization of a recently-proposed recurrent neural network for processing nonstationary signals is reexamined. By using an optimization method that preserves the capacity of the recurrence matrix, superior performance is achieved on a battery of benchmarks that test the ability of recurrent neural networks to process nonstationary signals.

The second approach uses the recently-proposed framework of deep unfolding, which provides a principled means of transforming statistical model inference algorithms into deep networks. This dissertation expands the deep unfolding framework specifically for nonstationary signals. Using this framework, a model-based explanation is provided for state-of-the-art recurrent neural architectures, including gated recurrent unit and unitary recurrent neural networks. Additionally, deep unfolding results in deep network architectures that arise in principled ways from statistical model assumptions. This statistical model foundation provides initializations for the unfolded networks, which lead to better generalization, faster training, and competitive or superior performance on a variety of tasks, including single- and multichannel acoustic source separation and classification of acoustic signals.


IEEE SPS Educational Resources

IEEE SPS Resource Center

IEEE SPS YouTube Channel