1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.
Context
Date: 21 January 2025
Time: 10:00 AM ET (New York time)
Presenter(s): Boris Ivanovic
Date: 5 February 2025
Time: 9:30 AM ET (New York Time)
Speaker(s): Dr. Sinan Gezici
This is our sixth and final issue of 2024. It is hard to believe that a year has gone by since our term as the new editorial team started in January. In our first year, in addition to our usual array of technical overviews and Society news, we addressed a number of topics of significance for our community in the hopes of starting a discussion.
I am writing this short note as I am about to board a plane to Abu Dhabi to join those of you who are attending the 2024 edition of the International Conference on Image Processing (ICIP 2024). The team organizing ICIP 2024 has put together an outstanding technical program that includes world-class plenary speakers discussing research and industrial trends.
Multichannel acoustic signal processing is a well-established and powerful tool to exploit the spatial diversity between a target signal and nontarget or noise sources for signal enhancement. However, the textbook solutions for optimal data-dependent spatial filtering rest on the knowledge of second-order statistical moments of the signals, which have traditionally been difficult to acquire.
“All models are wrong, but some are useful” - understanding “models” as analytical mathematical models, this aphorism, originating from George Box in 1976, motivates the synthesis of model-based and data-driven audio signal processing as the leitmotif of this special issue.
Multichannel acoustic signal processing is a well-established and powerful tool to exploit the spatial diversity between a target signal and nontarget or noise sources for signal enhancement. However, the textbook solutions for optimal data-dependent spatial filtering rest on the knowledge of second-order statistical moments of the signals, which have traditionally been difficult to acquire.
Communication base stations can achieve high-precision tracking and accurate classification for multiple extended targets in the context of integrated communication and sensing by transmitting wideband signal. However, the time resources of the base stations are often limited. In the time-division operation mode, part of the time resources must be reserved to guarantee communication performance, while the rest of the resources must be properly allocated for better multi-target sensing performance.
We consider a least absolute deviation (LAD) approach to the robust phase retrieval problem that aims to recover a signal from its absolute measurements corrupted with sparse noise. To solve the resulting non-convex optimization problem, we propose a robust alternating minimization (Robust-AM) derived as an unconstrained Gauss-Newton method.
This paper explores constrained non-convex personalized federated learning (PFL), in which a group of workers train local models and a global model, under the coordination of a server. To address the challenges of efficient information exchange and robustness against the so-called Byzantine workers, we propose a projected stochastic gradient descent algorithm for PFL that simultaneously ensures Byzantine-robustness and communication efficiency.
Steganography is the art of covert communication that pursues the secrecy of concealment. In adaptive steganography, the most commonly used framework of steganography, the sender embeds a “secret message” signal within another “cover” signal with respect to a certain adaptive distortion function that measures the distortion incurred, contributing to the composite “stego” signal that resembles the cover, and the receiver extracts the “secret message” signal from the stego.
This article proposes a distributed time-varying optimization approach to address the dynamic resource allocation problem, leveraging a sliding mode technique. The algorithm integrates a fixed-time sliding mode component to ensure that the global equality constraints are met, and is coupled with a fixed-time distributed control mechanism involving the nonsmooth consensus idea for attaining the system's optimal state.
In this paper, a memory-enhanced distributed accelerated algorithm is proposed for solving large-scale systems of linear equations within the context of multi-agent systems. By employing a local predictor consisting of a linear combination of the nodes' current and previous values, the inclusion of two memory taps can be characterized such that the convergence of the distributed solution algorithm for coordinated computation is accelerated.
Deep matrix factorization (DMF) has the capability to discover hierarchical structures within raw data by factorizing matrices layer by layer, allowing it to utilize latent information for superior clustering performance. However, DMF-based approaches face limitations when dealing with complex and nonlinear raw data.
Graph Neural Networks (GNNs) have shown great promise in modeling relationships between nodes in a graph, but capturing higher-order relationships remains a challenge for large-scale networks. Previous studies have primarily attempted to utilize the information from higher-order neighbors in the graph, involving the incorporation of powers of the shift operator, such as the graph Laplacian or adjacency matrix.
In the domain of 3D Human Pose Estimation, which finds widespread daily applications, the requirement for convenient acquisition equipment continues to grow. To satisfy this demand, we focus on a short-baseline binocular setup that offers both portability and a geometric measurement capability that significantly reduces depth ambiguity.
Static meshes with texture maps have attracted considerable attention in both industrial manufacturing and academic research, leading to an urgent requirement for effective and robust objective quality evaluation. However, current model-based static mesh quality metrics (i.e., metrics that directly use the raw data of the static mesh to extract features and predict the quality) have obvious limitations: most of them only consider geometry information, while color information is ignored, and they have strict constraints for the meshes’ geometrical topology.
Pyramid Temporal Hierarchy Network (PTH-Net) is a new paradigm for dynamic facial expression recognition, applied directly to raw videos, without face detection and alignment. Unlike the traditional paradigm, which focus only on facial areas and often overlooks valuable information like body movements, PTH-Net preserves more critical information.