System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
Link to record
Permanent link

Direct link
Barney Smith, Elisa H.ORCID iD iconorcid.org/0000-0003-2039-3844
Alternative names
Publications (10 of 21) Show all publications
Pagliai, I., van Boven, G., Adewumi, T., Alkhaled, L., Gurung, N., Södergren, I. & Barney, E. (2024). Data Bias According to Bipol: Men are Naturally Right and It is the Role ofWomen to Follow Their Lead. In: Mourad Abbas; Abed Alhakim Freihat (Ed.), Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP-2024): . Paper presented at 7th International Conference on Natural Language and Speech Processing (ICNLSP 2024), Trento, Italy, October 19-20, 2024 (pp. 34-46). Association for Computational Linguistics, Article ID 2024.icnlsp-1.5.
Open this publication in new window or tab >>Data Bias According to Bipol: Men are Naturally Right and It is the Role ofWomen to Follow Their Lead
Show others...
2024 (English)In: Proceedings of the 7th International Conference on Natural Language and Speech Processing (ICNLSP-2024) / [ed] Mourad Abbas; Abed Alhakim Freihat, Association for Computational Linguistics , 2024, p. 34-46, article id 2024.icnlsp-1.5Conference paper, Published paper (Refereed)
Place, publisher, year, edition, pages
Association for Computational Linguistics, 2024
National Category
Computer and Information Sciences General Language Studies and Linguistics
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110841 (URN)
Conference
7th International Conference on Natural Language and Speech Processing (ICNLSP 2024), Trento, Italy, October 19-20, 2024
Note

ISBN for host publication: 9798891761650;

Funder: Wallenberg AI, Autonomous Systems and Software Program (WASP); Knut and Alice Wallenberg Foundation; Luleå University of Technology (LTU);

Available from: 2024-11-27 Created: 2024-11-27 Last updated: 2024-11-27Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part I. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part I
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 490
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14804
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110207 (URN)10.1007/978-3-031-70533-5 (DOI)978-3-031-70532-8 (ISBN)978-3-031-70533-5 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2024-10-15Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part II. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part II
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 446
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14805
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110210 (URN)10.1007/978-3-031-70536-6 (DOI)978-3-031-70535-9 (ISBN)978-3-031-70536-6 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2024-10-15Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference Athens, Greece, August 30 – September 4, 2024 Proceedings, Part III. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference Athens, Greece, August 30 – September 4, 2024 Proceedings, Part III
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 412
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14806
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110212 (URN)10.1007/978-3-031-70543-4 (DOI)978-3-031-70542-7 (ISBN)978-3-031-70543-4 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2024-10-15Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part IV. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part IV
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 458
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14807
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110214 (URN)10.1007/978-3-031-70546-5 (DOI)978-3-031-70545-8 (ISBN)978-3-031-70546-5 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2024-10-15Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part V. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part V
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 440
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14808
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110215 (URN)10.1007/978-3-031-70549-6 (DOI)978-3-031-70548-9 (ISBN)978-3-031-70549-6 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2024-10-15Bibliographically approved
Barney Smith, E. H., Liwicki, M. & Peng, L. (Eds.). (2024). Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part VI. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024. Springer Nature
Open this publication in new window or tab >>Document Analysis and Recognition - ICDAR 2024: 18th International Conference, Athens, Greece, August 30 – September 4, 2024 Proceedings, Part VI
2024 (English)Conference proceedings (editor) (Refereed)
Abstract [en]

This six-volume set LNCS 14804-14809 constitutes the proceedings of the 18th International Conference on Document Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–September 4, 2024.The total of 144 full papers presented in these proceedings were carefully selected from 263 submissions.The papers reflect topics such as: Document image processing; physical and logical layout analysis; text and symbol recognition; handwriting recognition; document analysis systems; document classification; indexing and retrieval of documents; document synthesis; extracting document semantics; NLP for document understanding; office automation; graphics recognition; human document interaction; document representation modeling and much more.

Place, publisher, year, edition, pages
Springer Nature, 2024. p. 444
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14809
Keywords
Document Analysis Systems, Handwriting Recognition, Scene Text Detection and Recognition, Document Image Processing, Historical Document Analysis, NLP for Document Understanding, Graphics, Diagram, and Math Recognition, Multimedia Document Analysis
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110216 (URN)10.1007/978-3-031-70552-6 (DOI)978-3-031-70551-9 (ISBN)978-3-031-70552-6 (ISBN)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2024-10-15Bibliographically approved
Barney Smith, E. H., Liwicki, M., Peng, L. & Marinai, S. (2024). Editorial for special issue on “advanced topics in document analysis and recognition”. International Journal on Document Analysis and Recognition, 27(3), 209-211
Open this publication in new window or tab >>Editorial for special issue on “advanced topics in document analysis and recognition”
2024 (English)In: International Journal on Document Analysis and Recognition, ISSN 1433-2833, E-ISSN 1433-2825, Vol. 27, no 3, p. 209-211Article in journal, Editorial material (Other academic) Published
Place, publisher, year, edition, pages
Springer Nature, 2024
National Category
Computer Sciences Computer graphics and computer vision
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-109150 (URN)10.1007/s10032-024-00494-7 (DOI)001291555800001 ()2-s2.0-85201225969 (Scopus ID)
Note

Godkänd;2024;Nivå 0;2024-09-03 (hanlid);

Available from: 2024-09-03 Created: 2024-09-03 Last updated: 2025-02-01Bibliographically approved
Adewumi, T., Habib, N., Alkhaled, L. & Barney, E. (2024). Instruction Makes a Difference. In: Giorgos Sfikas; George Retsinas (Ed.), Document Analysis Systems: 16th IAPR International Workshop, DAS 2024, Athens, Greece, August 30–31, 2024, Proceedings. Paper presented at 16th IAPR International Workshop on Document Analysis Systems (DAS 2024), Athens, Greece, August 30-31, 2024 (pp. 71-88). Springer Science and Business Media Deutschland GmbH
Open this publication in new window or tab >>Instruction Makes a Difference
2024 (English)In: Document Analysis Systems: 16th IAPR International Workshop, DAS 2024, Athens, Greece, August 30–31, 2024, Proceedings / [ed] Giorgos Sfikas; George Retsinas, Springer Science and Business Media Deutschland GmbH , 2024, p. 71-88Conference paper, Published paper (Refereed)
Abstract [en]

We introduce the Instruction Document Visual Question Answering (iDocVQA) dataset and the Large Language Document (LLaDoc) model, for training Language-Vision (LV) models for document analysis and predictions on document images, respectively. Usually, deep neural networks for the DocVQA task are trained on datasets lacking instructions. We show that using instruction-following datasets improves performance. We compare performance across document-related datasets using the recent state-of-the-art (SotA) Large Language and Vision Assistant (LLaVA)1.5 as the base model. We also evaluate the performance of the derived models for object hallucination using the Polling-based Object Probing Evaluation (POPE) dataset. The results show that instruction-tuning performance ranges from 11x to 32x of zero-shot performance and from 0.1% to 4.2% over non-instruction (traditional task) finetuning. Despite the gains, these still fall short of human performance (94.36%), implying there’s much room for improvement.

Place, publisher, year, edition, pages
Springer Science and Business Media Deutschland GmbH, 2024
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14994
Keywords
DocVQA, instruction-tuning, LLM, LMM
National Category
Computer Sciences Computer graphics and computer vision
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110169 (URN)10.1007/978-3-031-70442-0_5 (DOI)001334866300005 ()2-s2.0-85204640516 (Scopus ID)
Conference
16th IAPR International Workshop on Document Analysis Systems (DAS 2024), Athens, Greece, August 30-31, 2024
Funder
Knut and Alice Wallenberg Foundation
Note

Funder: Wallenberg AI, AutonomousSystems and Software Program (WASP)

ISBN for host publication: 978-3-031-70441-3, 978-3-031-70442-0

Available from: 2024-10-04 Created: 2024-10-04 Last updated: 2025-02-01Bibliographically approved
Liu, C., Corbillé, S. & Barney Smith, E. H. (2024). MOoSE: Multi-Orientation Sharing Experts for Open-Set Scene Text Recognition. In: Elisa H. Barney Smith; Marcus Liwicki; Liangrui Peng (Ed.), Document Analysis and Recognition, ICDAR 2024: 18th International Conference, Athens, Greece, August 30–September 4, 2024, Proceedings, Part V. Paper presented at 18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024 (pp. 93-110). Springer Science and Business Media Deutschland GmbH, 5
Open this publication in new window or tab >>MOoSE: Multi-Orientation Sharing Experts for Open-Set Scene Text Recognition
2024 (English)In: Document Analysis and Recognition, ICDAR 2024: 18th International Conference, Athens, Greece, August 30–September 4, 2024, Proceedings, Part V / [ed] Elisa H. Barney Smith; Marcus Liwicki; Liangrui Peng, Springer Science and Business Media Deutschland GmbH , 2024, Vol. 5, p. 93-110Conference paper, Published paper (Refereed)
Abstract [en]

Open-set text recognition, which aims to address both novel characters and previously seen ones, is one of the rising subtopics in the text recognition field. However, the current open-set text recognition solutions only focuses on horizontal text, which fail to model the real-life challenges posed by the variety of writing directions in real-world scene text. Multi-orientation text recognition, in general, faces challenges from diverse image aspect ratios, significant imbalance in data amount, and domain gaps between orientations. In this work, we first propose a Multi-Oriented Open-Set Text Recognition task (MOOSTR) to model the challenges of both novel characters and writing direction variety. We then propose a Multi-Orientation Sharing Experts (MOoSE) framework as a strong baseline solution. MOoSE uses a mixture-of-experts scheme to alleviate the domain gaps between orientations, while exploiting common structural knowledge among experts to alleviate the data scarcity that some experts face. The proposed MOoSE framework is validated by ablative experiments, and also tested for feasibility on an existing open-set text recognition benchmark. Code, models, and documents are available at: https://github.com/lancercat/Moose/

Place, publisher, year, edition, pages
Springer Science and Business Media Deutschland GmbH, 2024
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14808
Keywords
Open-set text recognition, multi-orientation text recognition, incremental learning
National Category
Computer Systems
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-110170 (URN)10.1007/978-3-031-70549-6_6 (DOI)001336397200006 ()2-s2.0-85204644262 (Scopus ID)
Conference
18th International Conference on Document Analysis and Recognition (ICDAR 2024), Athens, Greece, August 30–September 4, 2024
Funder
The Kempe Foundations, CSMK23-0109
Note

Funder: Wallenberg AI, Autonomous Systems and Software Program (WASP);

ISBN for host publication: 978-3-031-70548-9, 978-3-031-70549-6

Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2024-12-17Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-2039-3844

Search in DiVA

Show all publications