IEEE TIP Article

In the domain of 3D Human Pose Estimation, which finds widespread daily applications, the requirement for convenient acquisition equipment continues to grow. To satisfy this demand, we focus on a short-baseline binocular setup that offers both portability and a geometric measurement capability that significantly reduces depth ambiguity.

GeodesicPSIM: Predicting the Quality of Static Mesh With Texture Map via Geodesic Patch Similarity

TIP Volume 34 | 2025

TIP Articles

Static meshes with texture maps have attracted considerable attention in both industrial manufacturing and academic research, leading to an urgent requirement for effective and robust objective quality evaluation. However, current model-based static mesh quality metrics (i.e., metrics that directly use the raw data of the static mesh to extract features and predict the quality) have obvious limitations: most of them only consider geometry information, while color information is ignored, and they have strict constraints for the meshes’ geometrical topology.

PTH-Net: Dynamic Facial Expression Recognition Without Face Detection and Alignment

TIP Volume 34 | 2025

TIP Articles

Pyramid Temporal Hierarchy Network (PTH-Net) is a new paradigm for dynamic facial expression recognition, applied directly to raw videos, without face detection and alignment. Unlike the traditional paradigm, which focus only on facial areas and often overlooks valuable information like body movements, PTH-Net preserves more critical information.

Saliency Segmentation Oriented Deep Image Compression With Novel Bit Allocation

TIP Volume 34 | 2025

TIP Articles

Image compression distortion can cause performance degradation of machine analysis tasks, therefore recent years have witnessed fast progress in developing deep image compression methods optimized for machine perception. However, the investigation still lacks for saliency segmentation. First, in this paper we propose a deep compression network increasing local signal fidelity of important image pixels for saliency segmentation, which is different from existing methods utilizing the analysis network loss for backward propagation.

A Study of Subjective and Objective Quality Assessment of HDR Videos

TIP Volume 33 | 2024

TIP Articles

As compared to standard dynamic range (SDR) videos, high dynamic range (HDR) content is able to represent and display much wider and more accurate ranges of brightness and color, leading to more engaging and enjoyable visual experiences. HDR also implies increases in data volume, further challenging existing limits on bandwidth consumption and on the quality of delivered content.

Robust Remote Photoplethysmography Estimation With Environmental Noise Disentanglement

TIP Volume 33 | 2024

TIP Articles

Remote Photoplethysmography (rPPG) has been attracting increasing attention due to its potential in a wide range of application scenarios such as physical training, clinical monitoring, and face anti-spoofing. On top of conventional solutions, deep-learning approach starts to dominate in rPPG estimation and achieves top-level performance.

A Discrete-Mapping-Based Cross-Component Prediction Paradigm for Screen Content Coding

TIP Volume 33 | 2024

TIP Articles

Cross-component prediction is an important intra-prediction tool in the modern video coders. Existing prediction methods to exploit cross-component correlation include cross-component linear model and its extension of multi-model linear model. These models are designed for camera captured content. For screen content coding, where videos exhibit different signal characteristics, a cross-component prediction model tailored to their characteristics is desirable.

Dynamic Dense Graph Convolutional Network for Skeleton-Based Human Motion Prediction

TIP Volume 33 | 2024

TIP Articles

Graph Convolutional Networks (GCN) which typically follows a neural message passing framework to model dependencies among skeletal joints has achieved high success in skeleton-based human motion prediction task. Nevertheless, how to construct a graph from a skeleton sequence and how to perform message passing on the graph are still open problems, which severely affect the performance of GCN.

Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images

TIP Volume 32 | 2023

TIP Articles

As an important yet challenging task in Earth observation, change detection (CD) is undergoing a technological revolution, given the broadening application of deep learning. Nevertheless, existing deep learning-based CD methods still suffer from two salient issues: 1) incomplete temporal modeling, and 2) space-time coupling. In view of these issues, we propose a more explicit and sophisticated modeling of time and accordingly establish a pair-to-video change detection (P2V-CD) framework. First, a pseudo transition video that carries rich temporal information is constructed from the input image pair, interpreting CD as a problem of video understanding.

State-Aware Compositional Learning Toward Unbiased Training for Scene Graph Generation

TIP Volume 32 | 2023

TIP Articles

How to avoid biased predictions is an important and active research question in scene graph generation (SGG). Current state-of-the-art methods employ debiasing techniques such as resampling and causality analysis. However, the role of intrinsic cues in the features causing biased training has remained under-explored. In this paper, for the first time, we make the surprising observation that object identity information, in the form of object label embeddings (e.g. GLOVE), is principally responsible for biased predictions.

webinar_cube.jpg

SPS BSI Webinar: NeuroAI: From HoloBrain to HoloGraph

close-up-of-fiber-optic-cables-2024-11-03-07-51-25-utc.jpg

Waveforms for Computing Over the Air: A groundbreaking approach that redefines data aggregation

book-background-old-books-in-the-library-bookshe-2025-03-10-11-04-10-utc.jpg

Ode to Masterfully Written Textbooks: And remembering Simon Haykin [From the Editor]

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

IEEE TIP Article

Top Reasons to Join SPS Today!

IEEE TIP Article

RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation With Occlusion Handling

GeodesicPSIM: Predicting the Quality of Static Mesh With Texture Map via Geodesic Patch Similarity

PTH-Net: Dynamic Facial Expression Recognition Without Face Detection and Alignment

Saliency Segmentation Oriented Deep Image Compression With Novel Bit Allocation

A Study of Subjective and Objective Quality Assessment of HDR Videos

Robust Remote Photoplethysmography Estimation With Environmental Noise Disentanglement

A Discrete-Mapping-Based Cross-Component Prediction Paradigm for Screen Content Coding

Dynamic Dense Graph Convolutional Network for Skeleton-Based Human Motion Prediction

Transition Is a Process: Pair-to-Video Change Detection Networks for Very High Resolution Remote Sensing Images

State-Aware Compositional Learning Toward Unbiased Training for Scene Graph Generation

Pages

SPS Social Media

IEEE SPS Educational Resources

Waveforms for Computing Over the Air: A groundbreaking approach that redefines data aggregation

Ode to Masterfully Written Textbooks: And remembering Simon Haykin [From the Editor]

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

IEEE TIP Article

Search form

You are here

Top Reasons to Join SPS Today!

IEEE TIP Article

Pages

SPS Social Media

IEEE SPS Educational Resources