Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Character spotting and autonomous tagging: offline handwriting recognition for Bangla, Korean and other alphabetic scripts
Department of Physics and Engineering, Fort Lewis College, Durango, Colorado, USA.ORCID iD: 0000-0001-5445-5252
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab. Department of Electrical and Computer Engineering, Boise State University, Boise, Idaho, USA.
2022 (English)In: International Journal on Document Analysis and Recognition, ISSN 1433-2833, E-ISSN 1433-2825, Vol. 25, no 4, p. 245-263Article in journal (Refereed) Published
Abstract [en]

This paper demonstrates a framework for offline handwriting recognition using character spotting and autonomous tagging which works for any alphabetic script. Character spotting builds on the idea of object detection to find character elements in unsegmented word images. An autonomous tagging approach is introduced which automates the production of a character image training set by estimating character locations in a word based on typical character size. Although scripts can vary vividly from each other, our proposed approach provides a simple and powerful workflow for unconstrained offline recognition that should work for any alphabetic script with few adjustments. Here we demonstrate this approach with handwritten Bangla, obtaining a character recognition accuracy (CRA) of 94.8% and 91.12% with precision and autonomous tagging, respectively. Furthermore, we explained how character spotting and autonomous tagging can be implemented for other alphabetic scripts. We demonstrated that with handwritten Hangul/Korean obtaining a Jamo recognition accuracy (JRA) of 93.16% using a tiny fraction of the PE92 training set. The combination of character spotting and autonomous tagging takes away one of the biggest frustrations-data annotation by hand, and thus, we believe this has the potential to revolutionize the growth of offline recognition development.

Place, publisher, year, edition, pages
Springer Nature, 2022. Vol. 25, no 4, p. 245-263
Keywords [en]
Offline handwriting recognition, Bangla handwriting recognition, Korean handwriting recognition, Character spotting, Autonomous tagging
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
URN: urn:nbn:se:ltu:diva-93346DOI: 10.1007/s10032-022-00410-xISI: 000854727900001Scopus ID: 2-s2.0-85138292541OAI: oai:DiVA.org:ltu-93346DiVA, id: diva2:1708810
Conference
18th International Conference on Frontiers of Handwriting Recognition (ICFHR 2022), Hyderabad, India, December 4-7, 2022
Note

Godkänd;2022;Nivå 0;2022-12-01 (marisr);Konferensartikel i tidskrift

Available from: 2022-11-07 Created: 2022-11-07 Last updated: 2022-12-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Barney Smith, Elisa H.

Search in DiVA

By author/editor
Majid, NishatulBarney Smith, Elisa H.
By organisation
Embedded Internet Systems Lab
In the same journal
International Journal on Document Analysis and Recognition
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 206 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf