Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Amharic OCR: An End-to-End Learning
Department of Computer Science, University of Kaiserslautern, Germany; Faculty of Computing, Bahir Dar Institute of Technology, Ethiopia.
Department of Computer Science, University of Kaiserslautern, Germany.
School of Information Science, Addis Ababa University, Ethiopia.
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab.ORCID iD: 0000-0003-4029-6574
Show others and affiliations
2020 (English)In: Applied Sciences, E-ISSN 2076-3417, Vol. 10, no 3, article id 1117Article in journal (Refereed) Published
Abstract [en]

In this paper, we introduce an end-to-end Amharic text-line image recognition approach based on recurrent neural networks. Amharic is an indigenous Ethiopic script which follows a unique syllabic writing system adopted from an ancient Geez script. This script uses 34 consonant characters with the seven vowel variants of each (called basic characters) and other labialized characters derived by adding diacritical marks and/or removing parts of the basic characters. These associated diacritics on basic characters are relatively smaller in size, visually similar, and challenging to distinguish from the derived characters. Motivated by the recent success of end-to-end learning in pattern recognition, we propose a model which integrates a feature extractor, sequence learner, and transcriber in a unified module and then trained in an end-to-end fashion. The experimental results, on a printed and synthetic benchmark Amharic Optical Character Recognition (OCR) database called ADOCR, demonstrated that the proposed model outperforms state-of-the-art methods by 6.98% and 1.05%, respectively.

Place, publisher, year, edition, pages
MDPI, 2020. Vol. 10, no 3, article id 1117
Keywords [en]
Amharic script, CNN, CTC, end-to-end learning, LSTM, OCR, pattern recognition, text-line image
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
URN: urn:nbn:se:ltu:diva-78211DOI: 10.3390/app10031117ISI: 000525305900388Scopus ID: 2-s2.0-85081261258OAI: oai:DiVA.org:ltu-78211DiVA, id: diva2:1416836
Note

Validerad;2020;Nivå 2;2020-03-25 (alebob)

Available from: 2020-03-25 Created: 2020-03-25 Last updated: 2023-10-28Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Liwicki, Marcus

Search in DiVA

By author/editor
Liwicki, Marcus
By organisation
Embedded Internet Systems Lab
In the same journal
Applied Sciences
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 577 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf