Scene Text Recognition Based on Improved CRNN

Wenhua Yu

Mayire Ibrayim and Askar Hamdulla

Resumen

Text recognition is an important research topic in computer vision. Scene text, which refers to the text in real scenes, sometimes needs to meet the requirement of attracting attention, and there is the situation such as deformation. At the same time, the image acquisition process is affected by factors such as occlusion, noise, and obstruction, making scene text recognition tasks more challenging. In this paper, we improve the CRNN model for text recognition, which has relatively low accuracy, poor performance in recognizing irregular text, and only considers obtaining text sequence information from a single aspect, resulting in incomplete information acquisition. Firstly, to address the problems of low text recognition accuracy and poor recognition of irregular text, we add label smoothing to ensure the model?s generalization ability. Then, we introduce the smoothing loss function from speech recognition into the field of text recognition, and add a language model to increase information acquisition channels, ultimately achieving the goal of improving text recognition accuracy. This method was experimentally verified on six public datasets and compared with other advanced methods. The experimental results show that this method performs well in most benchmark tests, and the improved model outperforms the original model in recognition performance.

Palabras claves

CRNN - text recognition - label smoothing - language model - deep learning

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 14 Parte: 7 (2023)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Information
Applied Sciences

DOI

https://doi.org/10.3390/info14070369

Artículos similares

CSFF-Net: Scene Text Detection Based on Cross-Scale Feature Fusion

Acceso

Yuan Li, Mayire Ibrayim and Askar Hamdulla

In the last years, methods for detecting text in real scenes have made significant progress with an increase in neural networks. However, due to the limitation of the receptive field of the central nervous system and the simple representation of text by ... ver más

Revista: Information

Compact and Accurate Scene Text Detector

Acceso

Minjun Jeon and Young-Seob Jeong

Scene text detection is the task of detecting word boxes in given images. The accuracy of text detection has been greatly elevated using deep learning models, especially convolutional neural networks. Previous studies commonly aimed at developing more ac... ver más

Revista: Applied Sciences

A Novel Approach to Wearable Image Recognition Systems to Aid Visually Impaired People

Acceso

Shiwei Chen, Dayue Yao, Huiliang Cao and Chong Shen

Action and identification problems are the challenges that visually impaired people often encounter in their lives. The high price of existing commercial intelligent auxiliary equipment has placed enormous economic pressure on most visually impaired peop... ver más

Revista: Applied Sciences

Multimedia Storytelling in Journalism: Exploring Narrative Techniques in Snow Fall

Acceso

Kobie Van Krieken

News stories aim to create an immersive reading experience by virtually transporting the audience to the described scenes. In print journalism, this experience is facilitated by text-linguistic narrative techniques, such as detailed scene reconstructions... ver más

Revista: Information

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas