Change search
Link to record
Permanent link

Direct link
Barney Smith, Elisa H.ORCID iD iconorcid.org/0000-0003-2039-3844
Alternative names
Publications (10 of 23) Show all publications
Liu, C. & Barney Smith, E. H. (2026). Watch and Act: Multi-orientation Open-Set Scene Text Recognition via Dynamic Expert Routing. In: X-G. Yin; D.Karatzas; D. Lopresti (Ed.), Document Analysis and Recognition – ICDAR 2025: 19th International Conference, Wuhan, China, September 16–21, 2025, Proceedings. Paper presented at 19th International Conference, Wuhan, China, September 16–21, 2025 (pp. 595-612). Springer Science and Business Media Deutschland GmbH, 3
Open this publication in new window or tab >>Watch and Act: Multi-orientation Open-Set Scene Text Recognition via Dynamic Expert Routing
2026 (English)In: Document Analysis and Recognition – ICDAR 2025: 19th International Conference, Wuhan, China, September 16–21, 2025, Proceedings / [ed] X-G. Yin; D.Karatzas; D. Lopresti, Springer Science and Business Media Deutschland GmbH , 2026, Vol. 3, p. 595-612Conference paper, Published paper (Refereed)
Abstract [en]

Text samples captured in natural scenes often come from different scripts written in various directions, resulting in significant diversities in character set, orientation, and text length. Current methods handle such diversities by routing input images to corresponding (sub)modules with hardwired rules that are usually tied to image aspect-ratio ranges. However, setting hardwired rules requires extensive knowledge of the testing data and model-building experience, hence is often suboptimal. To resolve this, we propose an end-to-end trainable watch-and-act (WnA) framework, which first watches thumbnails to generate a routing plan and then routes the samples to corresponding experts to produce the recognition results. The framework shows a significant robustness improvement over the corresponding rule-based baselines by 2% of line accuracy on the recent multi-orientation open-set text recognition benchmark. The proposed framework, as a system, also shows 3–10% line accuracy advantages over previous open-set text recognition methods on horizontal samples. 

Place, publisher, year, edition, pages
Springer Science and Business Media Deutschland GmbH, 2026
Series
Lecture Notes in Computer Science (LNCS), ISSN 0302-9743, E-ISSN 1611-3349
National Category
Computer Sciences Computer graphics and computer vision
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-115069 (URN)10.1007/978-3-032-04624-6_35 (DOI)2-s2.0-105017377350 (Scopus ID)
Conference
19th International Conference, Wuhan, China, September 16–21, 2025
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Note

ISBN for host publication: 978-3-032-04623-9, 978-3-032-04624-6

Available from: 2025-10-10 Created: 2025-10-10 Last updated: 2025-10-21Bibliographically approved
Corbillé, S. & Barney Smith, E. H. (2025). Applying Center Loss to Neural Networks for Sequence Prediction: A Study for Handwriting Recognition. In: 2025 International Joint Conference on Neural Networks (IJCNN): . Paper presented at International Joint Conference on Neural Networks (IJCNN), Rome, Italy, June 30 - July 05, 2025. IEEE
Open this publication in new window or tab >>Applying Center Loss to Neural Networks for Sequence Prediction: A Study for Handwriting Recognition
2025 (English)In: 2025 International Joint Conference on Neural Networks (IJCNN), IEEE, 2025Conference paper, Published paper (Refereed)
Abstract [en]

We propose a method to improve the overall accuracy of a neural network for predicting a sequence without using more training data nor adding more parameters. We apply a center loss at the sequence level as an auxiliary task. At every epoch we compute the center for each class, then we apply a center loss on each element of the sequence in order to reduce the intra-class distance. Center loss makes features more discriminative as well as compact in the feature space which increases the accuracy of the network and reduces overfitting. The network is trained jointly with the sequence prediction task and the center loss auxiliary task which increases the computation time only during training not in inference. We evaluate our method in a handwriting text recognition context on seven datasets. In addition to outperforming methods that do not use additional data for all datasets, our method achieves competitive results compared to those that do, with faster inference speed and fewer parameters. We also show that our method applied on a light neural network improves accuracy and is able to achieve competitive performance compared to deeper models. The advantage of using a light model is the processing speed needed for real applications. Code is available at https://github.com/simon-corbi/htr-ijcnn-2025.

Place, publisher, year, edition, pages
IEEE, 2025
National Category
Artificial Intelligence
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-115577 (URN)10.1109/IJCNN64981.2025.11227321 (DOI)
Conference
International Joint Conference on Neural Networks (IJCNN), Rome, Italy, June 30 - July 05, 2025
Funder
The Kempe Foundations, CSMK23-0109Knut and Alice Wallenberg Foundation
Note

ISBN for host publication:  979-8-3315-1042-8;

Available from: 2025-11-26 Created: 2025-11-26 Last updated: 2025-11-26Bibliographically approved
Pagliai, I., van Boven, G., Adewumi, T., Alkhaled, L., Gurung, N., Södergren, I. & Barney, E. (2024). Data Bias According to Bipol: Men are Naturally Right and It is the Role ofWomen to Follow Their Lead. In: Mourad Abbas; Abed Alhakim Freihat (Ed.), Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP-2024): . Paper presented at 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024), Trento, Italy, October 19-20, 2024 (pp. 34-46). Association for Computational Linguistics, Article ID 2024.icnlsp-1.5.
Open this publication in new window or tab >>Data Bias According to Bipol: Men are Naturally Right and It is the Role ofWomen to Follow Their Lead
Show others...
2024 (English)In: Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP-2024) / [ed] Mourad Abbas; Abed Alhakim Freihat, Association for Computational Linguistics , 2024, p. 34-46, article id 2024.icnlsp-1.5Conference paper, Published paper (Refereed)
Place, publisher, year, edition, pages
Association for Computational Linguistics, 2024
National Category
Computer and Information Sciences General Language Studies and Linguistics
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110841 (URN)
Conference
7th International Conference on Natural Language and Speech Processing (ICNLSP 2024), Trento, Italy, October 19-20, 2024
Note

ISBN for host publication: 9798891761650;

Funder: Wallenberg AI, Autonomous Systems and Software Program (WASP); Knut and Alice Wallenberg Foundation; Luleå University of Technology (LTU);

Available from: 2024-11-27 Created: 2024-11-27 Last updated: 2025-10-21Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part I. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part I
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 490
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14804
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110207 (URN)10.1007/978-3-031-70533-5 (DOI)978-3-031-70532-8 (ISBN)978-3-031-70533-5 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2025-10-21Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part II. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part II
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 446
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14805
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110210 (URN)10.1007/978-3-031-70536-6 (DOI)978-3-031-70535-9 (ISBN)978-3-031-70536-6 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2025-10-21Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference Athens, Greece, August 30 – September 4, 2024 Proceedings, Part III. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference Athens, Greece, August 30 – September 4, 2024 Proceedings, Part III
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 412
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14806
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110212 (URN)10.1007/978-3-031-70543-4 (DOI)978-3-031-70542-7 (ISBN)978-3-031-70543-4 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2025-10-21Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part IV. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part IV
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 458
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14807
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110214 (URN)10.1007/978-3-031-70546-5 (DOI)978-3-031-70545-8 (ISBN)978-3-031-70546-5 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2025-10-21Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part V. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part V
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 440
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14808
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110215 (URN)10.1007/978-3-031-70549-6 (DOI)978-3-031-70548-9 (ISBN)978-3-031-70549-6 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2025-10-21Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part VI. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part VI
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 444
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14809
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110216 (URN)10.1007/978-3-031-70552-6 (DOI)978-3-031-70551-9 (ISBN)978-3-031-70552-6 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2025-10-21Bibliographically approved
Barney Smith, E. H., Liwicki, M., Peng, L. & Marinai, S. (2024). Editorial for special issue on “advanced topics in document analysis and recognition”. International Journal on Document Analysis and Recognition, 27(3), 209-211
Open this publication in new window or tab >>Editorial for special issue on “advanced topics in document analysis and recognition”
2024 (English)In: International Journal on Document Analysis and Recognition, ISSN 1433-2833, E-ISSN 1433-2825, Vol. 27, no 3, p. 209-211Article in journal, Editorial material (Other academic) Published
Place, publisher, year, edition, pages
Springer Nature, 2024
National Category
Computer Sciences Computer graphics and computer vision
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-109150 (URN)10.1007/s10032-024-00494-7 (DOI)001291555800001 ()2-s2.0-85201225969 (Scopus ID)
Note

Godkänd;2024;Nivå 0;2024-09-03 (hanlid);

Available from: 2024-09-03 Created: 2024-09-03 Last updated: 2025-10-21Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-2039-3844

Search in DiVA

Show all publications