Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
A Deep Learning based Arabic Script Recognition System: Benchmark on KHAT
Shaheed Banazir Bhutto University, Sheringal, Pakistan.
Computer Science Department, GGPGC No.1 Abbottabad, Pakistan.
Mindgarage, University of Kaiserslautern, Germany.
Al Khwarizmi Institute of Computer Science, UET Lahore, Pakistan.
Visa övriga samt affilieringar
2020 (Engelska)Ingår i: The International Arab Journal of Information Technology, ISSN 1683-3198, Vol. 17, nr 3, s. 299-305Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

This paper presents a deep learning benchmark on a complex dataset known as KFUPM Handwritten Arabic TexT (KHATT). The KHATT data-set consists of complex patterns of handwritten Arabic text-lines. This paper contributes mainly in three aspects i.e., (1) pre-processing, (2) deep learning based approach, and (3) data-augmentation. The pre-processing step includes pruning of white extra spaces plus de-skewing the skewed text-lines. We deploy a deep learning approach based on Multi-Dimensional Long Short-Term Memory (MDLSTM) networks and Connectionist Temporal Classification (CTC). The MDLSTM has the advantage of scanning the Arabic text-lines in all directions (horizontal and vertical) to cover dots, diacritics, strokes and fine inflammation. The data-augmentation with a deep learning approach proves to achieve better and promising improvement in results by gaining 80.02% Character Recognition (CR) over 75.08% as baseline.

Ort, förlag, år, upplaga, sidor
Zarqa University, Jordan , 2020. Vol. 17, nr 3, s. 299-305
Nyckelord [en]
Handwritten Arabic text recognition, deep learning, data augmentation
Nationell ämneskategori
Datavetenskap (datalogi)
Forskningsämne
Maskininlärning
Identifikatorer
URN: urn:nbn:se:ltu:diva-78876DOI: 10.34028/iajit/17/3/3ISI: 000529820700003Scopus ID: 2-s2.0-85086443300OAI: oai:DiVA.org:ltu-78876DiVA, id: diva2:1430307
Anmärkning

Validerad;2020;Nivå 2;2020-05-14 (alebob)

Tillgänglig från: 2020-05-14 Skapad: 2020-05-14 Senast uppdaterad: 2020-06-29Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Person

Liwicki, Marcus

Sök vidare i DiVA

Av författaren/redaktören
Liwicki, Marcus
Av organisationen
EISLAB
I samma tidskrift
The International Arab Journal of Information Technology
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetricpoäng

doi
urn-nbn
Totalt: 360 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf