Research Article | Open Access

Scene text removal via cascaded text stroke detection and erasing

National Laboratory of Pattern Recognition, Institute of Automation, Beijing 100049, China
School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100084, China

*Xuewei Bian and Chaoqun Wang contributed equally to this work.

Graphical Abstract


Recent learning-based approaches show promising performance improvement for the scene text removal task but usually leave several remnants of text and provide visually unpleasant results. In this work, a novel end-to-end framework is proposed based on accurate text stroke detection. Specifically, the text removal problem is decoupled into text stroke detection and stroke removal; we design separate networks to solve these two subproblems, the latter being a generative network. These two networks are combined as a processing unit, which is cascaded to obtain our final model for text removal. Experimental results demonstrate that the proposed method substantially outperforms the state-of-the-art for locating and erasing scene text. A new large-scale real-world dataset with 12,120 images has been constructed and is being made available to facilitate research, as current publicly available datasets are mainly synthetic so cannot properly measure the performance of different methods.


Computational Visual Media
Pages 273-287
Cite this article:
Bian X, Wang C, Quan W, et al. Scene text removal via cascaded text stroke detection and erasing. Computational Visual Media, 2022, 8(2): 273-287.








Received: 22 February 2021
Accepted: 26 May 2021
Published: 06 December 2021
© The Author(s) 2021.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduc-tion in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit

Other papers from this open access journal are available free of charge from

