In the field of computer vision, large-scale image classification tasks are both
important and highly challenging. With the ongoing advances in deep learning and
optical character recognition (OCR) technologies, neural networks designed to perform
large-scale classification play an essential role in facilitating OCR systems.
In this study, we developed an automatic OCR system designed to identify up to 13,070
large-scale printed Chinese characters by using deep learning neural
networks and fine-tuning techniques. The proposed framework comprises four components,
including training dataset synthesis and background simulation,
image preprocessing and data augmentation, the process of training the model, and
transfer learning. The training data synthesis procedure is composed of a
character font generation step and a background simulation process. Three background
models are proposed to simulate the factors of the background noise and
anti-counterfeiting patterns on ID cards. To expand the diversity of the synthesized
training dataset, rotation and zooming data augmentation are applied. A
massive dataset comprising more than 19.6 million images was thus created to accommodate
the variations in the input images and improve the learning capacity
of the CNN model. Subsequently, we modified the GoogLeNet neural architecture by
replacing the FC layer with a global average pooling layer to avoid
overfitting caused by a massive amount of training data. Consequently, the number of
model parameters was reduced. Finally, we employed the transfer learning
technique to further refine the CNN model using a significantly small number of real
data samples. Experimental results show that the overall recognition
performance of the proposed approach is significantly better than that of prior methods
and thus demonstrate the effectiveness of proposed framework, which
exhibited a recognition accuracy as high as 99.39% on the constructed real ID card
dataset.
Full text available:
PDF Download Link.
Download synthesized dataset:
Download Link.
(SIZE: 8.3GB. You can find class names here:
Download Link.)
Download software package:
Download Link.)
(SIZE: 970MB. Include source code for generating
synthesis datasets and evaluation tool.
)
Notices:
We only provide the dataset of output classes 1812
because of the limit of storage. Please Mail to box15975316@gmail.com for
more help.