Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Quadbox: Quadrilateral bounding box based scene text detection using vector regression
Department of Computer Science and Engineering, Indian Institute of Technology Roorkee, India.
Department of Computer Science and Engineering, Indian Institute of Technology Roorkee, India.
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab.ORCID iD: 0000-0001-8532-0895
Department of Computer Science and Engineering, Indian Institute of Technology Roorkee, India.
2021 (English)In: IEEE Access, E-ISSN 2169-3536, Vol. 9, p. 36802-36818Article in journal (Refereed) Published
Abstract [en]

Scene text appears with a wide range of sizes and arbitrary orientations. For detecting such text in the scene image, the quadrilateral bounding boxes provide a much tight bounding box compared to the rotated rectangle. In this work, a vector regression method has been proposed for text detection in the wild to generate a quadrilateral bounding box. The bounding box prediction using direct regression requires predicting the vectors from each position inside the quadrilateral. It needs to predict four-vectors, and each varies drastically in its length and orientation. It makes the vector prediction a difficult problem. To overcome this, we have proposed a centroid-centric vector regression by utilizing the geometry of quadrilateral. In this work, we have added the philosophy of indirect regression to direct regression by shifting all points within the quadrilateral to the centroid and afterward performed vector regression from shifted points. The experimental results show the improvement of the quadrilateral approach over the existing direct regression approach. The proposed method shows good performance on many existing public datasets. The proposed method also demonstrates good results on the unseen dataset without getting trained on it, which validates the approach’s generalization ability.

Place, publisher, year, edition, pages
IEEE, 2021. Vol. 9, p. 36802-36818
Keywords [en]
Scene text detection, direct regression, indirect regression, quadrilateral bounding boxes, centroid of the quadrilateral
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
URN: urn:nbn:se:ltu:diva-83173DOI: 10.1109/ACCESS.2021.3063030ISI: 000626496900001Scopus ID: 2-s2.0-85102236506OAI: oai:DiVA.org:ltu-83173DiVA, id: diva2:1534267
Funder
VinnovaEuropean Regional Development Fund (ERDF)
Note

Validerad;2021;Nivå 2;2021-03-22 (johcin);

Finansiär: Visvesvaraya Ph.D. fellowship

Available from: 2021-03-05 Created: 2021-03-05 Last updated: 2023-09-05Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Saini, Rajkumar

Search in DiVA

By author/editor
Saini, Rajkumar
By organisation
Embedded Internet Systems Lab
In the same journal
IEEE Access
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 255 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf