IEEE SPL Article

You are here

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

IEEE SPL Article

Recent years have witnessed remarkable success of Graph Fourier Transform (GFT) in point cloud attribute compression. Existing researches mainly utilize geometry distance to define graph structure for coding attribute (e.g., color), which may distribute high weights to the edges connecting points across texture boundaries. 

This paper addresses the target localization problem using time-of-arrival (TOA)-based technique under the non-line-of-sight (NLOS) environment. To alleviate the adverse effect of the NLOS error on localization, a total least square framework integrated with a regularization term (RTLS) is utilized, and with which the localization problem can get rid of the ill-posed issue. However, it is challenging to figure out the exact solution for the considered localization problem.

Deep neural networks in deep learning have been widely demonstrated to have higher accuracy and distinct advantages over traditional machine learning methods in extracting data features. While convolutional neural networks (CNNs) have shown great success in feature extraction and audio classification, it is important to note that real-time audios are dependent on previous scenes. Also, the main drawback of deep learning algorithms is that they need a huge number of datasets to indicate their efficient performance.

In this letter, we propose a novel linguistic steganographic method that directly conceals a token-level secret message in a seemingly-natural steganographic text generated by the off-the-shelf BERT model equipped with Gibbs sampling. Compared with all modification based linguistic steganographic methods, the proposed method does not modify a given cover text. Instead, the proposed method utilizes the secret message to directly generate the steganographic text.

Discriminative correlation filter (DCF)-based methods applied for UAV object tracking have received widespread attention due to their high efficiency. However, these methods are usually troubled by the boundary effect. Besides, the violent environment variations severely confuse trackers that neglect temporal environmental changes among consecutive frames, leading to unwanted tracking drift. In this letter, we propose a novel DCF-based tracking method to promote the insensitivity of the tracker under uncertain environmental changes.

Audio-guided face reenactment aims to generate authentic target faces that have matched facial expression of the input audio, and many learning-based methods have successfully achieved this. However, most methods can only reenact a particular person once trained or suffer from the low-quality generation of the target images. Also, nearly none of the current reenactment works consider the model size and running speed that are important for practical use.

In this paper, we propose an enhancing steganographic scheme by random generation and ensemble stego selection. Different from existing steganography that only focuses on distortion function designing, our scheme considers both distortion model and optimized stego generation. In specific, for given cover, we firstly train an universal steganalyzer to calculate its gradient map, which is referenced to randomly adjust cost distribution of this cover. 

Visual place recognition is one of the essential and challenging problems in the fields of robotics. In this letter, we for the first time explore the use of multi-modal fusion of semantic and visual modalities in dynamics-invariant space to improve place recognition in dynamic environments. We achieve this by first designing a novel deep learning architecture to generate the static semantic segmentation and recover the static image directly from the corresponding dynamic image. 

In this letter, we consider Bayesian parameterestimation using mixed-resolution data consisting of both analog and 1-bit quantized measurements. We investigate the use of the partially linear minimum mean-squared-error (PL-MMSE) estimator for this mixed-resolution scheme. The use of the PL-MMSE estimator, proposed for general models with “straightforward” and “complicated” parts, has not been demonstrated for quantized data. 

With the wide vision and high flexibility, unmanned aerial vehicle (UAV) has been widely used into object tracking in recent years. However, its limited computing capability poses a great challenges to tracking algorithms. On the other hand, Discriminative Correlation Filter (DCF) based trackers have attracted great attention due to their computational efficiency and superior accuracy. Many studies introduce spatial and temporal regularization into the DCF framework to achieve a more robust appearance model and further enhance the tracking performance. However, such algorithms generally set fixed spatial or temporal regularization parameters, which lack flexibility and adaptability under cluttered and challenging scenarios.


SPS on Twitter

  • On Wednesday, 26 October, join Dr. DeLiang Wang for a new SPS webinar, "Neural Spectrospatial Filter" - register no…
  • Join Dr. Peilan Wang and Dr Jun Fang for "Channel State Information Acquisition for Intelligent Reflecting Surface-…
  • The SPS Webinar Series continues on Monday, 10 October when Dr. Luisa Verdoliva presents "Media Forensics and DeepF…
  • DEADLINE EXTENDED: The IEEE Transactions on Multimedia is accepting submissions for a Special Issue on Point Cloud…
  • Short courses return to ! Register for live and remote sessions, "A Hands-on Approach for Implementing Sto…

SPS Videos

Signal Processing in Home Assistants


Multimedia Forensics

Careers in Signal Processing             


Under the Radar