1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.
Visual cues such as lip movements, when available, play an important role in speech communication. They are especially helpful for the hearing impaired population or in noisy environments. When not available, having a system to automatically generate talking faces in sync with input speech would enhance speech communication and enable many novel applications.
Automatic evaluation of singing quality can be done with the help of a reference singing or the digital sheet music of the song. However, such a standard reference is not always available. In this article, we propose a framework to rank a large pool of singers according to their singing quality without any standard reference.
Wireless acoustic sensor networks (WASNs) can be used for centralized multi-microphone noise reduction, where the processing is done in a fusion center (FC). To perform the noise reduction, the data needs to be transmitted to the FC. Considering the limited battery life of the devices in a WASN, the total data rate at which the FC can communicate with the different network devices should be constrained.
Panoramic videos are becoming more and more easily obtained for common users. Although these videos have
Panoramic videos are becoming more and more easily obtained for common users. Although these videos have
Nowadays, 360° video/image has been increasingly popular and drawn great attention. The spherical viewing range of 360° video/image accounts for huge data, which pose the challenges to 360° video/image processing in solving the bottleneck of storage, transmission, etc. Accordingly, the recent years have witnessed the explosive emergence of works on 360° video/image processing.
Recent years have witnessed the rapid development of virtual reality (VR). Above 90% of VR content is in the form of 360° video, also called omnidirectional video or panoramic video. Generally speaking, 360° video offers immersive and interactive viewing experience, as the viewers are able to freely move their heads in the range of 360° × 180° to access different viewports.
This correspondence proposes the use of a real-only equalizer (ROE), which acts on real signals derived from the received offset quadrature amplitude modulation (OQAM) symbols. For the same fading channel, we prove that both ROE and the widely linear equalizer (WLE) yield equivalent outputs.
This letter presents a high resolution method which separates close components of a multi-component linear frequency modulated (LFM) signal and eliminates their Cross-Terms (CTs). We first investigate the energy distribution of the Auto-Terms (ATs) and CTs in ambiguity plane.
This letter proposes a new time domain absorption approach designed to reduce masking components of speech signals under noisy-reverberant conditions. In this method, the non-stationarity of corrupted signal segments is used to detect masking distortions based on a defined threshold.
Date: 6-11 April 2025
Location: Hyderabad, India
Date: 14-19 April 2024
Location: Seoul, Korea
Date: 8-11 October 2023
Location: Kuala Lumpur, Malaysia
Date: 4-10 June 2023
Location: Rhodes Island, Greece