Towards End-to-End Semi-Supervised Table Detection with Deformable TransformerShow others and affiliations
2023 (English)In: Document Analysis and Recognition - ICDAR 2023, Part II / [ed] Gernot A. Fink, Rajiv Jain, Koichi Kise & Richard Zanibbi, Springer, 2023, p. 51-76Conference paper, Published paper (Refereed)
Abstract [en]
Table detection is the task of classifying and localizing table objects within document images. With the recent development in deep learning methods, we observe remarkable success in table detection. However, a significant amount of labeled data is required to train these models effectively. Many semi-supervised approaches are introduced to mitigate the need for a substantial amount of label data. These approaches use CNN-based detectors that rely on anchor proposals and post-processing stages such as NMS. To tackle these limitations, this paper presents a novel end-to-end semi-supervised table detection method that employs the deformable transformer for detecting table objects. We evaluate our semi-supervised method on PubLayNet, DocBank, ICADR-19 and TableBank datasets, and it achieves superior performance compared to previous methods. It outperforms the fully supervised method (Deformable transformer) by +3.4 points on 10% labels of TableBank-both dataset and the previous CNN-based semi-supervised approach (Soft Teacher) by +1.8 points on 10% labels of PubLayNet dataset. We hope this work opens new possibilities towards semi-supervised and unsupervised table detection methods.
Place, publisher, year, edition, pages
Springer, 2023. p. 51-76
Series
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), ISSN 0302-9743, E-ISSN 1611-3349 ; 14188
Keywords [en]
Deformable Transformer, Semi-Supervised Learning, Table Analysis, Table Detection
National Category
Computer Sciences Computer graphics and computer vision
Research subject
Machine Learning
Identifiers
URN: urn:nbn:se:ltu:diva-103377DOI: 10.1007/978-3-031-41679-8_4ISI: 001346405600004Scopus ID: 2-s2.0-85173579777OAI: oai:DiVA.org:ltu-103377DiVA, id: diva2:1823703
Conference
17th International Conference on Document Analysis and Recognition,(ICDAR 2023),San José, CA, United States, August 21-26, 2023
Note
ISBN for host publication: 978-3-031-41678-1 (print), 978-3-031-41679-8 (electronic);
Funder: the European project AIRISE (grant ID: 101092312)
2024-01-032024-01-032025-02-01Bibliographically approved