Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

OJSP Volume 4 | 2023

Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval

OJSP Articles

By:

Rintaro Yanagi; Ren Togo; Takahiro Ogawa; Miki Haseyama

Question answering (QA)-based re-ranking methods for cross-modal retrieval have been recently proposed to further narrow down similar candidate images. The conventional QA-based re-ranking methods provide questions to users by analyzing candidate images, and the initial retrieval results are re-ranked based on the user's feedback. Contrary to these developments, only focusing on performance improvement makes it difficult to efficiently elicit the user's retrieval intention. To realize more useful QA-based re-ranking, considering the user interaction for eliciting the user's retrieval intention is required. In this paper, we propose a QA-based re-ranking method with considering two important factors for eliciting the user's retrieval intention: query-image relevance and recallability. Considering the query-image relevance enables to only focus on the candidate images related to the provided query text, while, focusing on the recallability enables users to easily answer the provided question. With these procedures, our method can efficiently and effectively elicit the user's retrieval intention. Experimental results using Microsoft Common Objects in Context and computationally constructed dataset including similar candidate images show that our method can improve the performance of the cross-modal retrieval methods and the QA-based re-ranking methods.

Introduction

Multimedia information, especially images, has become familiar with the recent spread of wearable cameras and smartphones. We frequently record our lives as images, and the opportunity for sharing these depicted images has been increasing [1]. On the other hand, with these opportunities, manually managing and finding images on personal devices becomes taking a lot of effort [2]. Recently, to support such a situation, cross-modal retrieval methods that use a text as a query have been proposed as an effective image retrieval method [3], [4], [5], [6], [7], [8]. Since we use texts in our daily life, using them as the query is convenient and has a wide range of applications [9]. Specifically, the cross-modal retrieval methods embed the provided text query and each candidate image in a shared space, and the embedded features are used for retrieving the relevant images. By especially focusing on the refinement of the embedding procedures, the conventional methods have improved the image retrieval performance.

Read on IEEE Xplore

Tags:

IEEE OJSP Article

SPS Social Media

IEEE SPS Facebook Page https://www.facebook.com/ieeeSPS
IEEE SPS X Page https://x.com/IEEEsps
IEEE SPS Instagram Page https://www.instagram.com/ieeesps/?hl=en
IEEE SPS LinkedIn Page https://www.linkedin.com/company/ieeesps/
IEEE SPS YouTube Channel https://www.youtube.com/ieeeSPS

IEEE SPS Educational Resources

IEEE SPS Resource Center

IEEE SPS YouTube Channel

© Copyright 2025 IEEE - All rights reserved. Use of this website signifies your agreement to the IEEE Terms and Conditions.
A public charity, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

ISBI_Logo_CFP.jpg

Call for Proposals: 2028 IEEE International Symposium on Biomedical Imaging (ISBI)

ICASSP 2026

Call for Papers for ICASSP 2026 Now Open!

ICASSP 2026

Call for Papers for ICASSP 2026 Now Open!

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval

Open Journal of Signal Processing

Publications & Resources

For Authors

ICASSP 2026

ICASSP 2026

general_get_involved_tc_article_full.jpg

Top Reasons to Join SPS Today!

Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval

Introduction

SPS Social Media

IEEE SPS Educational Resources

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval

Search form

You are here

Open Journal of Signal Processing

Publications & Resources

For Authors

Top Reasons to Join SPS Today!

Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval

Introduction

SPS Social Media

IEEE SPS Educational Resources