Labeling, Cutting, Grouping: An Efficient Text Line Segmentation Method for Medieval ManuscriptsShow others and affiliations
2019 (English)In: The 15th IAPR International Conference on Document Analysis and Recognition: ICDAR 2019, IEEE, 2019, p. 1200-1206Conference paper, Published paper (Other academic)
Abstract [en]
This paper introduces a new way for text-line extraction by integrating deep-learning based pre-classification and state-of-the-art segmentation methods. Text-line extraction in complex handwritten documents poses a significant challenge, even to the most modern computer vision algorithms. Historical manuscripts are a particularly hard class of documents as they present several forms of noise, such as degradation, bleed-through, interlinear glosses, and elaborated scripts. In this work, we propose a novel method which uses semantic segmentation at pixel level as intermediate task, followed by a text-line extraction step. We measured the performance of our method on a recent dataset of challenging medieval manuscripts and surpassed state-of-the-art results by reducing the error by 80.7%. Furthermore, we demonstrate the effectiveness of our approach on various other datasets written in different scripts. Hence, our contribution is two-fold. First, we demonstrate that semantic pixel segmentation can be used as strong denoising pre-processing step before performing text line extraction. Second, we introduce a novel, simple and robust algorithm that leverages the high-quality semantic segmentation to achieve a text-line extraction performance of 99.42% line IU on a challenging dataset.
Place, publisher, year, edition, pages
IEEE, 2019. p. 1200-1206
Series
International Conference on Document Analysis and Recognition, ISSN 1520-5363, E-ISSN 2379-2140
Keywords [en]
textline segmentation, neural networks, document image analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
URN: urn:nbn:se:ltu:diva-78685DOI: 10.1109/ICDAR.2019.00194Scopus ID: 2-s2.0-85074775033OAI: oai:DiVA.org:ltu-78685DiVA, id: diva2:1426631
Conference
The 15th IAPR International Conference on Document Analysis and Recognition (ICDAR 2019), 20-25 September, Sydney, Australia
Note
ISBN för värdpublikation: 978-1-7281-3014-9, 978-1-7281-3015-6
2020-04-272020-04-272020-04-27Bibliographically approved