The Multi-Scale Spectral (MSS) loss is widely used for comparing audio signals, offering a good balance between temporal and spectral resolution, while allowing for phase differences between waveforms that are perceptually irrelevant. However, the configuration of this loss function, including parameters such as window type and size, hop size, and magnitude compression, is often chosen empirically and without explicitly considering the impact on loss behavior.