1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.
The inverse synthetic aperture radar (ISAR) imaging technique of a moving target with sparse sampling data has attracted wide attention due to its ability to reduce the data collection burden. However, traditional low-rank or 2D compressive sensing (CS)-based ISAR imaging methods can handle the random sampling or the separable sampling data only.
In this paper, we investigate beamforming design for cooperative secure transmission in cognitive two-way relay networks, where the cognitive transmitter (CT) with multiple antennas helps to forward the signals of two primary transmitters (PTs) and tries to protect the PTs from wiretapping by a single-antenna eavesdropper.
We consider a decentralized detection network whose aim is to infer a public hypothesis of interest. However, the raw sensor observations also allow the fusion center to infer private hypotheses that we wish to protect. We consider the case where there are an uncountable number of private hypotheses belonging to an uncertainty set, and develop local privacy mappings at every sensor so that the sanitized sensor information minimizes the Bayes error of detecting the public hypothesis at the fusion center while achieving information privacy for all private hypotheses.
In this paper, an agile smart attacker model in spectrum sensing of cognitive radio network (CRN) is introduced. This smart attacker does not make the channel busy all the time, instead it senses spectrum and when a primary user leaves, it occupies the spectrum by mimicking the signal characteristics of the primary users.
Sparsity and low-rank models have been popular for reconstructing images and videos from limited or corrupted measurements. Dictionary or transform learning methods are useful in applications such as denoising, inpainting, and medical image reconstruction.
Good temporal representations are crucial for video understanding, and the state-of-the-art video recognition framework is based on two-stream networks. In such framework, besides the regular ConvNets responsible for RGB frame inputs, a second network is introduced to handle the temporal representation, usually the optical flow (OF).
Three-dimensional (3-D) radar imaging can provide additional information along elevation dimension about the target with respect to the conventional 2-D radar imaging, but usually requires a huge amount of data collected over 3-D frequency-azimuth-elevation space, which motivates us to perform 3-D imaging by using sparsely sampled data. Traditional compressive sensing (CS) based 3-D imaging methods with sparse data convert the 3-D data into a long vector, and then complete the sensing and recovery steps.
Due to the recent situation we decided to extend the deadline for our open position on emotion recognition:
The Signal Processing (SP) research group at the Universität Hamburg in Germany is hiring a Research Associate / PhD student.
WE HAVE AN IMMEDIATE OPENING IN OUR »AUDIO AND MEDIA TECHNOLOGIES« DIVISION OF FRAUNHOFER IIS IN ERLANGEN, GERMANY, FOR A
VIDEO CODING SYSTEM ARCHITECT*
You have a vision about future directions in 2D video coding? You like to design technologies that are ready to conquer the world?
WE HAVE AN IMMEDIATE OPENING IN OUR »AUDIO AND MEDIA TECHNOLOGIES« DIVISION OF FRAUNHOFER IIS IN ERLANGEN, GERMANY, FOR A
VIDEO CODING APPLICATION ENGINEER & PRODUCT MANAGER*
Your passion is to bring innovative technologies into the market? You like communicating to potential customers, identifying their needs and proposing them suitable solutions?
This work presents a method that persuades acoustic reflections to be a favorable property for sound source localization. Whilst most real world spatial audio applications utilize prior knowledge of sound source position, estimating such positions in reverberant environments is still considered to be a difficult problem due to acoustic reflections.
Differential microphone arrays (DMAs) often encounter white noise amplification, especially at low frequencies. If the array geometry and the number of microphones are fixed, one can improve the white noise amplification problem by reducing the DMA order. With the existing differential beamforming methods, the DMA order can only be a positive integer number.
Recurrent neural networks (RNNs) can predict fundamental frequency (F 0 ) for statistical parametric speech synthesis systems, given linguistic features as input. However, these models assume conditional independence between consecutive F 0 values, given the RNN state. In a previous study, we proposed autoregressive (AR) neural F 0 models to capture the causal dependency of successive F 0 values.
This article addresses the problem of distance estimation using binaural hearing aid microphones in reverberant rooms. Among several distance indicators, the direct-to-reverberant energy ratio (DRR) has been shown to be more effective than other features. Therefore, we present two novel approaches to estimate the DRR of binaural signals.
In this letter, we propose a new approach to tracking a target that maneuvers based on the multiple-constant-turns model. Usually, the interactive-multiple-model (IMM) algorithm based on the extended Kalman filter (IMM-EKF) is employed for this problem with successful tracking performance.
Developing Semi-Supervised Seq2Seq (S4) learning for sequence transduction tasks in natural language processing (NLP), e.g. semantic parsing, is challenging, since both the input and the output sequences are discrete. This discrete nature makes trouble for methods which need gradients either from the input space or from the output space.
Utilizing a human-perception-related objective function to train a speech enhancement model has become a popular topic recently. The main reason is that the conventional mean squared error (MSE) loss cannot represent auditory perception well. One of the typical human-perception-related metrics, which is the perceptual evaluation of speech quality (PESQ), has been proven to provide a high correlation to the quality scores rated by humans.
Speech analysis could provide an indicator of Alzheimer's disease and help develop clinical tools for automatically detecting and monitoring disease progression. While previous studies have employed acoustic (speech) features for characterisation of Alzheimer's dementia, these studies focused on a few common prosodic features, often in combination with lexical and syntactic features which require transcription.
Clinical literature provides convincing evidence that language deficits in Alzheimer's disease (AD) allow for distinguishing patients with dementia from healthy subjects. Currently, computational approaches have widely investigated lexicosemantic aspects of discourse production, while pragmatic aspects like cohesion and coherence, are still mostly unexplored.
Obstructive sleep apnea (OSA) is a sleep disorder in which pharyngeal collapse during sleep causes complete (apnea) or partial (hypopnea) airway obstruction. OSA is common and can have severe implications, but often remains undiagnosed. The most widely used objective measure of OSA severity is the number of obstructive events per hour of sleep, known as the apnea-hypopnea index (AHI).