Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Towards Robust Object Detection in Floor Plan Images: A Data Augmentation Approach
Department of Computer Science, Technical University of Kaiserslautern, 67663 Kaiserslautern, Germany.ORCID iD: 0000-0003-2882-2789
Department of Computer Science, Technical University of Kaiserslautern, 67663 Kaiserslautern, Germany; Mindgarage, Department of Computer Science, Technical University of Kaiserslautern, 67663 Kaiserslautern, Germany; German Research Institute for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany.ORCID iD: 0000-0003-0456-6493
German Research Institute for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany.
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab.ORCID iD: 0000-0003-4029-6574
Show others and affiliations
2021 (English)In: Applied Sciences, E-ISSN 2076-3417, Vol. 11, no 23, article id 11174Article in journal (Refereed) Published
Abstract [en]

Object detection is one of the most critical tasks in the field of Computer vision. This task comprises identifying and localizing an object in the image. Architectural floor plans represent the layout of buildings and apartments. The floor plans consist of walls, windows, stairs, and other furniture objects. While recognizing floor plan objects is straightforward for humans, automatically processing floor plans and recognizing objects is challenging. In this work, we investigate the performance of the recently introduced Cascade Mask R-CNN network to solve object detection in floor plan images. Furthermore, we experimentally establish that deformable convolution works better than conventional convolutions in the proposed framework. Prior datasets for object detection in floor plan images are either publicly unavailable or contain few samples. We introduce SFPI, a novel synthetic floor plan dataset consisting of 10,000 images to address this issue. Our proposed method conveniently exceeds the previous state-of-the-art results on the SESYD dataset with an mAP of 98.1%. Moreover, it sets impressive baseline results on our novel SFPI dataset with an mAP of 99.8%. We believe that introducing the modern dataset enables the researcher to enhance the research in this domain.

Place, publisher, year, edition, pages
MDPI, 2021. Vol. 11, no 23, article id 11174
Keywords [en]
object detection, Cascade Mask R-CNN, floor plan images, deep learning, transfer learning, dataset augmentation, computer vision
National Category
Computer graphics and computer vision
Research subject
Machine Learning
Identifiers
URN: urn:nbn:se:ltu:diva-88321DOI: 10.3390/app112311174ISI: 000741981700001Scopus ID: 2-s2.0-85120000933OAI: oai:DiVA.org:ltu-88321DiVA, id: diva2:1619396
Note

Validerad;2022;Nivå 2;2022-01-01 (beamah);

Funder: INFINITY (ID 883293)

Available from: 2021-12-13 Created: 2021-12-13 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Liwicki, Marcus

Search in DiVA

By author/editor
Mishra, ShashankHashmi, Khurram AzeemLiwicki, MarcusAfzal, Muhammad Zeshan
By organisation
Embedded Internet Systems Lab
In the same journal
Applied Sciences
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 353 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf