Get Involved with Technical Committees via the Affiliate TC Membership
The Signal Processing Society (SPS) has 12 Technical Committees and 3 Special Interest Groups (SIGs) that support a broad selection of signal processing-related activities defined by the scope of the Society.
Upcoming Distinguished Lectures
Please visit the Conferences and Events page on the IEEE Signal Processing Society website for Upcoming Lectures by Distinguished Lecturers.
Multi-Correlation Filters With Triangle-Structure Constraints for Object Tracking
Correlation filters (CFs) have been extensively used in tracking tasks due to their high efficiency although most of them regard the tracked target as a whole and are minimally effective in handling partial occlusion. In this study, we incorporate a part-based strategy into the framework of CFs and propose a novel multipart correlation tracker with triangle-structure constraints. Specifically, we train multiple CFs for the global object and local parts, which are then jointly applied to obtain the correlation response of any candidate during tracking.
Self-Learning Super-Resolution Using Convolutional Principal Component Analysis and Random Matching
Self-learning super-resolution (SLSR) algorithms have the advantage of being independent of an external training database. This paper proposes an SLSR algorithm that uses convolutional principal component analysis (CPCA) and random matching. The technologies of CPCA and random matching greatly improve the efficiency of self-learning. There are two main steps in this algorithm: forming the training and testing the data sets and patch matching. In the data set forming step, we propose the CPCA to extract the low-dimensional features of the data set.
An Iterative Image Dehazing Method With Polarization
This paper presents a joint dehazing and denoising scheme for an image taken in hazy conditions. Conventional image dehazing methods may amplify the noise depending on the distance and density of the haze. To suppress the noise and improve the dehazing performance, an imaging model is modified by adding the process of amplifying the noise in hazy conditions. This model offers depth-chromaticity compensation regularization for the transmission map and chromaticity-depth compensation regularization for dehazing the image.
Statistical Model-Based Detector via Texture Weight Map: Application in Re-Sampling Authentication
The problem of authenticating a re-sampled image has been investigated over many years. Currently, however, little research proposes a statistical model-based test, resulting in that statistical performance of the resampling detector could not be completely analyzed. To fill the gap, we utilize a parametric model to expose the traces of resampling forgery, which is described with the distribution of residual noise. Afterward, we propose a statistical model describing the residual noise from a resampled image.
About TMM
The scope of the Periodical is the various aspects of research in multimedia technology and applications of multimedia, including, but not limited to, circuits, networking, signal processing, systems, software, and systems integration, as represented by the Fields of Interest of the sponsors.
A Geometric Model for Prediction of Spatial Aliasing in 2.5D Sound Field Synthesis
The avoidance of spatial aliasing is a major challenge in the practical implementation of sound field synthesis. Such methods aim at a physically accurate reconstruction of a desired sound field inside a target region using a finite ensemble of loudspeakers. In the past, different theoretical treatises of the inherent spatial sampling process led to anti-aliasing criteria for simple loudspeaker array arrangements, e.g., lines and circles, and fundamental sound fields, e.g., plane and spherical waves. Many criteria were independent of the listener's position inside the target region.
GlotNet—A Raw Waveform Model for the Glottal Excitation in Statistical Parametric Speech Synthesis
Recently, generative neural network models which operate directly on raw audio, such as WaveNet, have improved the state of the art in text-to-speech synthesis (TTS). Moreover, there is increasing interest in using these models as statistical vocoders for generating speech waveforms from various acoustic features. However, there is also a need to reduce the model complexity, without compromising the synthesis quality.
