Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

TMM Volume 23 | 2021

Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval

TMM Featured Articles

By:

Xu Lu; Li Liu; Liqiang Nie; Xiaojun Chang; Huaxiang Zhang

Multi-modal hashing focuses on fusing different modalities and exploring the complementarity of heterogeneous multi-modal data for compact hash learning. However, existing multi-modal hashing methods still suffer from several problems, including: 1) Almost all existing methods generate unexplainable hash codes. They roughly assume that the contribution of each hash code bit to the retrieval results is the same, ignoring the discriminative information embedded in hash learning and semantic similarity in hash retrieval. Moreover, the length of hash code is empirically set, which will cause bit redundancy and affect retrieval accuracy. 2) Most existing methods exploit shallow models which fail to fully capture higher-level correlation of multi-modal data. 3) Most existing methods adopt online hashing strategy based on immutable direct projection, which generates query codes for new samples without considering the differences of semantic categories. In this paper, we propose a Semantic-driven Interpretable Deep Multi-modal Hashing (SIDMH) method to generate interpretable hash codes driven by semantic categories within a deep hashing architecture, which can solve all these three problems in an integrated model. The main contributions are: 1) A novel deep multi-modal hashing network is developed to progressively extract hidden representations of heterogeneous modality features and deeply exploit the complementarity of multi-modal data. 2) Learning interpretable hash codes, with discriminant information of different categories distinctively embedded into hash codes and their different impacts on hash retrieval intuitively explained. Besides, the code length depends on the number of categories in the dataset, which can reduce the bit redundancy and improve the retrieval accuracy. 3) The semantic-driven online hashing strategy encodes the significant branches and discards the negligible branches of each query sample according to the semantics contained in it, therefore it co...

Read on IEEE Xplore

Tags:

IEEE TMM Article

SPS Social Media

IEEE SPS Facebook Page https://www.facebook.com/ieeeSPS
IEEE SPS X Page https://x.com/IEEEsps
IEEE SPS Instagram Page https://www.instagram.com/ieeesps/?hl=en
IEEE SPS LinkedIn Page https://www.linkedin.com/company/ieeesps/
IEEE SPS YouTube Channel https://www.youtube.com/ieeeSPS

IEEE SPS Educational Resources

IEEE SPS Resource Center

IEEE SPS YouTube Channel

© Copyright 2025 IEEE - All rights reserved. Use of this website signifies your agreement to the IEEE Terms and Conditions.
A public charity, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

webinar_cube.jpg

SPS BSI Webinar: NeuroAI: From HoloBrain to HoloGraph

close-up-of-fiber-optic-cables-2024-11-03-07-51-25-utc.jpg

Waveforms for Computing Over the Air: A groundbreaking approach that redefines data aggregation

book-background-old-books-in-the-library-bookshe-2025-03-10-11-04-10-utc.jpg

Ode to Masterfully Written Textbooks: And remembering Simon Haykin [From the Editor]

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval

Transactions on Multimedia

Publications & Resources

For Authors

Meet the Candidates (870 x 350 px).png

Election_Results.jpg

TMM.png

Top Reasons to Join SPS Today!

Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval

SPS Social Media

IEEE SPS Educational Resources

webinar_cube.jpg

SPS BSI Webinar: NeuroAI: From HoloBrain to HoloGraph

close-up-of-fiber-optic-cables-2024-11-03-07-51-25-utc.jpg

Waveforms for Computing Over the Air: A groundbreaking approach that redefines data aggregation

book-background-old-books-in-the-library-bookshe-2025-03-10-11-04-10-utc.jpg

Ode to Masterfully Written Textbooks: And remembering Simon Haykin [From the Editor]

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval

Search form

You are here

Transactions on Multimedia

Publications & Resources

For Authors

Meet the Candidates (870 x 350 px).png

Election_Results.jpg

TMM.png

Top Reasons to Join SPS Today!

Semantic-Driven Interpretable Deep Multi-Modal Hashing for Large-Scale Multimedia Retrieval

SPS Social Media

IEEE SPS Educational Resources