Quadbox: Quadrilateral bounding box based scene text detection using vector regression
2021 (English)In: IEEE Access, E-ISSN 2169-3536, Vol. 9, p. 36802-36818
Article in journal (Refereed) Published
Abstract [en]
Scene text appears with a wide range of sizes and arbitrary orientations. For detecting such text in the scene image, the quadrilateral bounding boxes provide a much tight bounding box compared to the rotated rectangle. In this work, a vector regression method has been proposed for text detection in the wild to generate a quadrilateral bounding box. The bounding box prediction using direct regression requires predicting the vectors from each position inside the quadrilateral. It needs to predict four-vectors, and each varies drastically in its length and orientation. It makes the vector prediction a difficult problem. To overcome this, we have proposed a centroid-centric vector regression by utilizing the geometry of quadrilateral. In this work, we have added the philosophy of indirect regression to direct regression by shifting all points within the quadrilateral to the centroid and afterward performed vector regression from shifted points. The experimental results show the improvement of the quadrilateral approach over the existing direct regression approach. The proposed method shows good performance on many existing public datasets. The proposed method also demonstrates good results on the unseen dataset without getting trained on it, which validates the approach’s generalization ability.
Place, publisher, year, edition, pages
IEEE, 2021. Vol. 9, p. 36802-36818
Keywords [en]
Scene text detection, direct regression, indirect regression, quadrilateral bounding boxes, centroid of the quadrilateral
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
URN: urn:nbn:se:ltu:diva-83173DOI: 10.1109/ACCESS.2021.3063030ISI: 000626496900001Scopus ID: 2-s2.0-85102236506OAI: oai:DiVA.org:ltu-83173DiVA, id: diva2:1534267
Funder
VinnovaEuropean Regional Development Fund (ERDF)
Note
Validerad;2021;Nivå 2;2021-03-22 (johcin);
Finansiär: Visvesvaraya Ph.D. fellowship
2021-03-052021-03-052023-09-05Bibliographically approved