Skip to main content

Simon Schwär

Multi-Scale Spectral Loss Revisited

SHARE:
Category
Proficiency
Language
Media Type
Pricing

SPS Members $0.00
IEEE Members $11.00
Non-members $15.00

Authors
Date
The Multi-Scale Spectral (MSS) loss is widely used for comparing audio signals, offering a good balance between temporal and spectral resolution, while allowing for phase differences between waveforms that are perceptually irrelevant. However, the configuration of this loss function, including parameters such as window type and size, hop size, and magnitude compression, is often chosen empirically and without explicitly considering the impact on loss behavior. This is particularly relevant in the context of differentiable digital signal processing (DDSP), where loss gradients are back-propagated through fixed DSP building blocks before they are used as learning signals. In this webinar, the presenter gives an overview of various MSS loss configurations and analyzes the effects of individual loss parameters in detail. Using common DDSP components such as oscillators and filters, they illustrate cases where trade-offs between configuration choices become important. Furthermore, they present examples where the MSS loss fails to provide meaningful gradients entirely and discuss potential workarounds proposed in literature.
Duration
0:54:33
Subtitles

IEEE SPS Education Center FAQs

The IEEE SPS Education Center is your hub for educational resources in signal processing. It offers a variety of materials tailored for students and professionals alike. You can explore content based on your specific interests and skill levels.

Select the program and click on the external link to the IEEE SPS Resource Center.

Educational credits in the form of professional development hours (PDHs) or continuing education units (CEUs) are available on select educational programs.