You have already added 0 works in your ORCID record related to the merged Research product.
You have already added 0 works in your ORCID record related to the merged Research product.
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
Content and Style Aware Generation of Text-Line Images for Handwriting Recognition
pmid: 34699351
Content and Style Aware Generation of Text-Line Images for Handwriting Recognition
Handwritten Text Recognition has achieved an impressive performance in public benchmarks. However, due to the high inter- and intra-class variability between handwriting styles, such recognizers need to be trained using huge volumes of manually labeled training data. To alleviate this labor-consuming problem, synthetic data produced with TrueType fonts has been often used in the training loop to gain volume and augment the handwriting style variability. However, there is a significant style bias between synthetic and real data which hinders the improvement of recognition performance. To deal with such limitations, we propose a generative method for handwritten text-line images, which is conditioned on both visual appearance and textual content. Our method is able to produce long text-line samples with diverse handwriting styles. Once properly trained, our method can also be adapted to new target data by only accessing unlabeled text-line images to mimic handwritten styles and produce images with any textual content. Extensive experiments have been done on making use of the generated samples to boost Handwritten Text Recognition performance. Both qualitative and quantitative results demonstrate that the proposed approach outperforms the current state of the art.
Accepted to TPAMI
- Autonomous University of Barcelona Spain
- Shantou University China (People's Republic of)
- Computer Vision Center Spain
Microsoft Academic Graph classification: Computer science computer.software_genre Synthetic data Style (sociolinguistics) Handwriting business.industry Volume (computing) Visual appearance Handwriting recognition State (computer science) Artificial intelligence Line (text file) business computer Natural language processing
FOS: Computer and information sciences, Handwriting, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Pattern Recognition, Automated, Artificial Intelligence, Applied Mathematics, Computational Theory and Mathematics, Computer Vision and Pattern Recognition, Algorithms, Software
FOS: Computer and information sciences, Handwriting, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Pattern Recognition, Automated, Artificial Intelligence, Applied Mathematics, Computational Theory and Mathematics, Computer Vision and Pattern Recognition, Algorithms, Software
Microsoft Academic Graph classification: Computer science computer.software_genre Synthetic data Style (sociolinguistics) Handwriting business.industry Volume (computing) Visual appearance Handwriting recognition State (computer science) Artificial intelligence Line (text file) business computer Natural language processing
50 references, page 1 of 5
[1] P. Krishnan and C. Jawahar, “Hwnet v2: An efficient word image representation for handwritten documents,” International Journal on Document Analysis and Recognition, vol. 22, no. 4, pp. 387-405, 2019.
[2] I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, “Generative adversarial nets,” in Proceedings of the Conference on Neural Information Processing Systems, 2014.
[3] T. Karras, S. Laine, M. Aittala, J. Hellsten, J. Lehtinen, and T. Aila, “Analyzing and improving the image quality of stylegan,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2020, pp. 8110-8119.
[4] M. Mirza and S. Osindero, “Conditional generative adversarial nets,” arXiv preprint arXiv:1411.1784, 2014.
[5] Y. Choi, M. Choi, M. Kim, J.-W. Ha, S. Kim, and J. Choo, “StarGAN: Unified generative adversarial networks for multi-domain image-to-image translation,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018.
[6] L. Yu, W. Zhang, J. Wang, and Y. Yu, “SeqGAN: Sequence generative adversarial nets with policy gradient,” in Proceedings of the AAAI Conference on Artificial Intelligence, 2017.
[7] D. Ha and D. Eck, “A neural representation of sketch drawings,” in Proceedings of the International Conference on Learning Representations, 2018.
[8] N. Zheng, Y. Jiang, and D. Huang, “Strokenet: A neural painting environment,” in Proceedings of the International Conference on Learning Representations, 2019.
[9] H.-W. Dong, W.-Y. Hsiao, L.-C. Yang, and Y.-H. Yang, “MuseGAN: Multi-track sequential generative adversarial networks for symbolic music generation and accompaniment,” in Proceedings of the AAAI Conference on Artificial Intelligence, 2018.
[10] S. Tulyakov, M.-Y. Liu, X. Yang, and J. Kautz, “MoCoGAN: Decomposing motion and content for video generation,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018.
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).5 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Average citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).5 popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.Top 10% influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).Average impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.Average Powered byBIP!