Abstract
Text detection in natural scene images becomes highly demanded for unstructured data in banking. In this paper, we propose a new deep learning algorithm called MSER, Hu-moment and Deep learning for Text detection (MHDT) based on Maximum Stable Extremal Regions (MSER) and Hu-moment features. Firstly, we extract MSERs as candidate characters. Secondly, a character classifier is introduced with Hu-moment features to reduce the number of input for clustering. After single linkage clustering, a text classifier trained from a Deep Brief Network is used to delete non-text. The proposed algorithm is evaluated on the ICDAR database, and the experimental results show that the proposed algorithm yields high precision and recall rate.
Original language | English |
---|---|
Title of host publication | ICMLC '19: Proceedings of the 2019 11th International Conference on Machine Learning and Computing |
Publisher | Association for Computing Machinery (ACM) |
Pages | 295-300 |
Number of pages | 6 |
ISBN (Print) | 9781450366007 |
DOIs | |
Publication status | Published - 22 Feb 2019 |
Event | 11th International Conference on Machine Learning and Computing, ICMLC 2019 - Zhuhai, China Duration: 22 Feb 2019 → 24 Feb 2019 https://dl.acm.org/doi/proceedings/10.1145/3318299 |
Conference
Conference | 11th International Conference on Machine Learning and Computing, ICMLC 2019 |
---|---|
Country/Territory | China |
City | Zhuhai |
Period | 22/02/19 → 24/02/19 |
Internet address |
Scopus Subject Areas
- Software
- Human-Computer Interaction
- Computer Vision and Pattern Recognition
- Computer Networks and Communications
User-Defined Keywords
- Deep learning
- Text detection
- Unstructured data