Cross-Depicted Historical Motif Categorization and Retrieval with Deep LearningShow others and affiliations
2020 (English)In: Journal of Imaging, ISSN 2313-433X, Vol. 6, no 7, article id 71Article in journal (Refereed) Published
Abstract [en]
In this paper, we tackle the problem of categorizing and identifying cross-depicted historical motifs using recent deep learning techniques, with aim of developing a content-based image retrieval system. As cross-depiction, we understand the problem that the same object can be represented (depicted) in various ways. The objects of interest in this research are watermarks, which are crucial for dating manuscripts. For watermarks, cross-depiction arises due to two reasons: (i) there are many similar representations of the same motif, and (ii) there are several ways of capturing the watermarks, i.e., as the watermarks are not visible on a scan or photograph, the watermarks are typically retrieved via hand tracing, rubbing, or special photographic techniques. This leads to different representations of the same (or similar) objects, making it hard for pattern recognition methods to recognize the watermarks. While this is a simple problem for human experts, computer vision techniques have problems generalizing from the various depiction possibilities. In this paper, we present a study where we use deep neural networks for categorization of watermarks with varying levels of detail. The macro-averaged F1-score on an imbalanced 12 category classification task is88.3%, the multi-labelling performance (Jaccard Index) on a 622 label task is79.5%. To analyze the usefulness of an image-based system for assisting humanities scholars in cataloguing manuscripts, we also measure the performance of similarity matching on expert-crafted test sets of varying sizes (50 and 1000 watermark samples). A significant outcome is that all relevant results belonging to the same super-class are found by our system (Mean Average Precision of 100%), despite the cross-depicted nature of the motifs. This result has not been achieved in the literature so far.
Place, publisher, year, edition, pages
MDPI, 2020. Vol. 6, no 7, article id 71
Keywords [en]
historical document, watermarks, image retrieval, convolutional neural network, deep learning
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
URN: urn:nbn:se:ltu:diva-80505DOI: 10.3390/jimaging6070071ISI: 000554281200001PubMedID: 34460664Scopus ID: 2-s2.0-85096723565OAI: oai:DiVA.org:ltu-80505DiVA, id: diva2:1459694
Note
Validerad;2020;Nivå 2;2020-08-20 (johcin)
2020-08-202020-08-202023-05-08Bibliographically approved