Region-aware Arbitrary-shaped Text Detection with Progressive Fusion

dc.contributor.authorWang, Qitong
dc.contributor.authorFu, Bin
dc.contributor.authorLi, Ming
dc.contributor.authorHe, Junjun
dc.contributor.authorPeng, Xi
dc.contributor.authorQiao, Yu
dc.date.accessioned2022-08-17T14:23:29Z
dc.date.available2022-08-17T14:23:29Z
dc.date.issued2022-08-04
dc.description© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. This article was originally published in IEEE Transactions on Multimedia. The version of record is available at: https://doi.org/10.1109/TMM.2022.3181448.en_US
dc.description.abstractSegmentation-based text detectors are flexible to capture arbitrary-shaped text regions. Due to large geometry variance, it is necessary to construct effective and robust representations to identify text regions with various shapes and scales. In this paper, we focus on designing effective multi-scale contextual features for locating text instances. Specially, we develop a Region Context Module (RCM) to summarize the semantic response and adaptively extract text-region-aware information in a limited local area. To construct complementary multi-scale contextual representations, multiple RCM branches with different scales are employed and integrated via Progressive Fusion Module (PFM). Our proposed RCM and PFM serve as the plug-and-play modules which can be incorporated into existing scene text detection platforms to further boost detection performance. Extensive experiments show that our methods achieve state-of-the-art performances on Total-Text, SCUT-CTW1500 and MSRA-TD500 datasets. The code with models will become publicly available at https://github.com/wqtwjt1996/RP-Text.en_US
dc.description.sponsorshipThis work is partially supported by the Joint Lab of CAS-HK, the Shenzhen Research Program (JSGG20191129141212311, RCJC20200714114557087), the Shanghai Committee of Science and Technology (Grant No. 21DZ1100100).en_US
dc.identifier.citationQ. Wang, B. Fu, M. Li, J. He, X. Peng and Y. Qiao, "Region-aware Arbitrary-shaped Text Detection with Progressive Fusion," in IEEE Transactions on Multimedia, 2022, doi: 10.1109/TMM.2022.3181448.en_US
dc.identifier.issn1941-0077
dc.identifier.urihttps://udspace.udel.edu/handle/19716/31202
dc.language.isoen_USen_US
dc.publisherIEEE Transactions on Multimediaen_US
dc.subjectscene text detectionen_US
dc.subjectscene understandingen_US
dc.subjectdeep learningen_US
dc.titleRegion-aware Arbitrary-shaped Text Detection with Progressive Fusionen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Region-aware Arbitrary-shaped Text Detection with.pdf
Size:
18.58 MB
Format:
Adobe Portable Document Format
Description:
Main article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.22 KB
Format:
Item-specific license agreed upon to submission
Description: