Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection

TIP Volume 32 | 2023

Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection

By:

Fangfang Wang; Xiaogang Xu; Yifeng Chen; Xi Li

To robustly detect arbitrary-shaped scene texts, bottom-up methods are widely explored for their flexibility. Due to the highly homogeneous texture and cluttered distribution of scene texts, it is nontrivial for segmentation-based methods to discover the separatrixes between adjacent instances. To effectively separate nearby texts, many methods adopt the seed expansion strategy that segments shrunken text regions as seed areas, and then iteratively expands the seed areas into intact text regions. In seek of a more straightforward way that does not rely on seed area segmentation and avoid possible error accumulation brought by iterative processing, we propose a redundancy removal strategy. In this work, we directly explore two types of fuzzy semantics-text and separatrix-that do not possess specific boundaries, and separate cluttered instances by excluding the separatrix pixels from text regions. To deal with the fuzzy semantic boundaries, we also conduct reliability analysis in both optimization and inference stage to suppress false positive pixels at ambiguous locations. Experiments on benchmark datasets demonstrate the effectiveness of our method.

Abitrary-shaped scene text detection aims to accurately locate tight text regions of arbitrary shapes from natural scene images. It has wide-range applications such as text recognition, scene parsing and automatic pilot. The main challenge of robust scene text detection lies in the complex appearance of texts, such as arbitrary shape, skewed viewpoint and large aspect ratio.

To deal with the arbitrary shapes, mainstream methods seek bottom-up solutions for their flexibility and treat text detection as a segmentation problem. However, as mixtures of stroke and background pixels, text regions are highly homogeneous textures that do not possess natural and clear boundaries. Besides, as shown in Figure 1 (a), scene texts are often in cluttered distribution and sometimes even contiguous due to the coarse polygon annotations. Thus, effectively separating cluttered instances becomes the most intractable problem in segmentation-based methods. False positive pixels along the instance separatrix areas often merge adjacent instances, which can have a dramatic influence on the detection results even though these pixels are of a very small proportion in the whole image. A typical solution is two-stage processing [1], [2], [3] which avoids directly discovering text separatrixes. They tend to segment shrunken text regions at first to find separated instance seeds, and then expand these seed areas iteratively and exhaustively to recover the intact text regions. Though the seed area extraction and iterative region expansion strategy can help separate cluttered instances, its performance is highly relied on the seed area segmentation, and error may accumulate throughout the iterative expansion procedure. Given that, we seek a more straightforward strategy to discover the specific instance separatrixes by directly modeling its unique semantics.

Read on IEEE Xplore

Tags:

IEEE TIP Article

webinar_general_dsi.jpg

SA-TWG Webinar: Channel Estimation for Beyond Diagonal RIS via Tensor Decomposition

BISP_TC_Webinar.jpg

SPS Webinar: An Anomaly Detection Framework with Compressed Transformer Architecture for Tiny ML

webinar_ASI.jpg

SPS Webinar: Presentation Attack Detection on ID Cards

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection

Publications & Resources

Transactions on Image Processing

For Authors

SP-Magazine-Front_Cover-March-2025.jpg

CAI_2027_Call_for_Proposals.png

nominate_2_general.jpg

Top Reasons to Join SPS Today!

Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection

SPS Social Media

IEEE SPS Educational Resources

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection

Search form

You are here

Publications & Resources

Transactions on Image Processing

For Authors

Top Reasons to Join SPS Today!

Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection

SPS Social Media

IEEE SPS Educational Resources