Skip to main content

Automatic Leaderboard: Evaluation of Singing Quality Without a Standard Reference

Automatic evaluation of singing quality can be done with the help of a reference singing or the digital sheet music of the song. However, such a standard reference is not always available. In this article, we propose a framework to rank a large pool of singers according to their singing quality without any standard reference.

Rate-Constrained Noise Reduction in Wireless Acoustic Sensor Networks

Wireless acoustic sensor networks (WASNs) can be used for centralized multi-microphone noise reduction, where the processing is done in a fusion center (FC). To perform the noise reduction, the data needs to be transmitted to the FC. Considering the limited battery life of the devices in a WASN, the total data rate at which the FC can communicate with the different network devices should be constrained.

Distortion-Adaptive Salient Object Detection in 360∘ Omnidirectional Images

Panoramic videos are becoming more and more easily obtained for common users. Although these videos have 360 field of view, they are usually displayed with perspective views, which needs the saliency informations for viewing angle selection. In this paper, we propose a saliency prediction network for 360 videos. Our network takes video frames and optical flows in cube map format as input, thus it does not suffer from image distorations of panoramic frames. 

Saliency Prediction Network for 360∘ Videos

Panoramic videos are becoming more and more easily obtained for common users. Although these videos have 360 field of view, they are usually displayed with perspective views, which needs the saliency informations for viewing angle selection. In this paper, we propose a saliency prediction network for 360 videos. Our network takes video frames and optical flows in cube map format as input, thus it does not suffer from image distorations of panoramic frames. 

State-of-the-Art in 360° Video/Image Processing: Perception, Assessment and Compression

Nowadays, 360° video/image has been increasingly popular and drawn great attention. The spherical viewing range of 360° video/image accounts for huge data, which pose the challenges to 360° video/image processing in solving the bottleneck of storage, transmission, etc. Accordingly, the recent years have witnessed the explosive emergence of works on 360° video/image processing.

Introduction to the Issue on Perception-Driven 360° Video Processing

Recent years have witnessed the rapid development of virtual reality (VR). Above 90% of VR content is in the form of 360° video, also called omnidirectional video or panoramic video. Generally speaking, 360° video offers immersive and interactive viewing experience, as the viewers are able to freely move their heads in the range of 360° × 180° to access different viewports.

Adaptive Reverberation Absorption Using Non-Stationary Masking Components Detection for Intelligibility Improvement

This letter proposes a new time domain absorption approach designed to reduce masking components of speech signals under noisy-reverberant conditions. In this method, the non-stationarity of corrupted signal segments is used to detect masking distortions based on a defined threshold.