IEEE Transactions on Multimedia

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching

TMM Volume 25 | 2023

TMM Articles

Image-text matching, as a fundamental cross-modal task, bridges the gap between vision and language. The core is to accurately learn semantic alignment to find relevant shared semantics in image and text. Existing methods typically attend to all fragments with word-region similarity greater than empirical threshold zero as relevant shared semantics, e.g. , via a ReLU operation that forces the negative to zero and maintains the positive.

Decompose to Adapt: Cross-Domain Object Detection Via Feature Disentanglement

TMM Volume 25 | 2023

TMM Articles

Recent advances in unsupervised domain adaptation (UDA) techniques have witnessed great success in cross-domain computer vision tasks, enhancing the generalization ability of data-driven deep learning architectures by bridging the domain distribution gaps.

Block Division Convolutional Network With Implicit Deep Features Augmentation for Micro-Expression Recognition

TMM Volume 25 | 2023

TMM Articles

Despite the development of computer vision techniques, the micro-expression (ME) recognition task still remains a great challenge because MEs have very low intensity and short duration. However, the ME recognition is of great significance since it provides important clues for real affective states detection. This paper proposes a novel Block Division Convolutional Network (BDCNN) with the implicit deep features augmentation.

Deep Margin-Sensitive Representation Learning for Cross-Domain Facial Expression Recognition

TMM Volume 25 | 2023

TMM Articles

Cross-domain Facial Expression Recognition (FER) aims to safely transfer the learned knowledge from labeled source data to unlabeled target data, which is challenging due to the subtle difference between various expressions and the large discrepancy between domains. Existing methods mainly focus on reducing the domain shift for transferable features but fail to learn discriminative representations for recognizing facial expression, which may result in negative transfer under cross-domain settings.

3D Holoscopic Image Compression Based on Gaussian Mixture Model

TMM Volume 25 | 2023

TMM Articles

We introduce a Gaussian Mixture Model (GMM) framework for 3D holoscopic image compression in this paper. The elemental-images of the 3D holoscopic image are predicted using GMM and the parameters of GMM are estimated using the common Expectation-Maximization (EM) algorithm. GMM Model Optimization (GMO) is used in this framework to select the optimal number of distributions and avoid local optimum of EM at the same time.

Fast Human Pose Estimation in Compressed Videos

TMM Volume 25 | 2023

TMM Articles

Current approaches for human pose estimation in videos can be categorized into per-frame and warping-based methods. Both approaches have their pros and cons. For example, per-frame methods are generally more accurate, but they are often slow. Warping-based approaches are more efficient, but the performance is usually not good. To bridge the gap, in this paper, we propose a novel fast framework for human pose estimation to meet the real-time inference with controllable accuracy degradation in compressed video domain.

01 Jul

TMM_SI_2.jpg

Special Issue Deadlines

IEEE TMM Special Issue on When Multimedia Meets Food: Multimedia Computing for Food Data Analysis and Applications

Manuscript Due: August 1, 2023
Publication Date: TBD

30 Jan

TMM_SI_2.jpg

Special Issue Deadlines

IEEE TMM Special Issue on Pre-trained Models for Multi-modality Understanding

Manuscript Due: 30 January 2023
Publication Date: 30 September 2023
CFP Document

Raw Image Deblurring

TMM Volume 24 | 2022

TMM Articles

Deep learning-based blind image deblurring plays an essential role in solving image blur since all existing kernels are limited in modeling the real world blur. Thus far, researchers focus on powerful models to handle the deblurring problem and achieve decent results. For this work, in a new aspect, we discover the great opportunity for image enhancement (e.g., deblurring) directly from RAW images and investigate novel neural network structures benefiting RAW-based learning.

Correlation Graph Convolutional Network for Pedestrian Attribute Recognition

TMM Volume 24 | 2022

TMM Articles

The pedestrian attribute recognition aims at generating the structured description of pedestrian, which plays an important role in surveillance. However, it is difficult to achieve accurate recognition results due to diverse illumination, partial body occlusion and limited resolutions. Therefore, this paper proposes a comprehensive relationship framework for comprehensively describing and utilizing relations among attributes, describing different type of relations in the same dimension, and implementing complex transfers of relations in a GCN manner.

Pages

SPS Social Media

IEEE SPS Facebook Page https://www.facebook.com/ieeeSPS
IEEE SPS X Page https://x.com/IEEEsps
IEEE SPS Instagram Page https://www.instagram.com/ieeesps/?hl=en
IEEE SPS LinkedIn Page https://www.linkedin.com/company/ieeesps/
IEEE SPS YouTube Channel https://www.youtube.com/ieeeSPS

IEEE SPS Educational Resources

IEEE SPS Resource Center

IEEE SPS YouTube Channel

© Copyright 2025 IEEE - All rights reserved. Use of this website signifies your agreement to the IEEE Terms and Conditions.
A public charity, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

MLSP-2027.jpg

2027 IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2027)

ISPA-2025.jpg

2025 14th International Symposium on Image and Signal Processing and Analysis (ISPA)

ASILOMAR.jpg

2025 59th Asilomar Conference on Signals, Systems, and Computers

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

IEEE Transactions on Multimedia

Top Reasons to Join SPS Today!

Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching

Decompose to Adapt: Cross-Domain Object Detection Via Feature Disentanglement

Block Division Convolutional Network With Implicit Deep Features Augmentation for Micro-Expression Recognition

Deep Margin-Sensitive Representation Learning for Cross-Domain Facial Expression Recognition

3D Holoscopic Image Compression Based on Gaussian Mixture Model

Fast Human Pose Estimation in Compressed Videos

TMM_SI_2.jpg

IEEE TMM Special Issue on When Multimedia Meets Food: Multimedia Computing for Food Data Analysis and Applications

TMM_SI_2.jpg

IEEE TMM Special Issue on Pre-trained Models for Multi-modality Understanding

Raw Image Deblurring

Correlation Graph Convolutional Network for Pedestrian Attribute Recognition

Pages

SPS Social Media

IEEE SPS Educational Resources

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

IEEE Transactions on Multimedia

Search form

You are here

Top Reasons to Join SPS Today!

Pages

SPS Social Media

IEEE SPS Educational Resources