Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Towards Deep-learning-based Autonomous Navigation in the Short-loading Cycle
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab.ORCID iD: 0000-0002-4716-9765
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Earth-moving machines, such as wheel loaders, are a type of heavy-duty machinery used within the construction industry to perform vital tasks, such as digging, transporting, and mining applications. One of these tasks is the short-loading cycle, where an operator manoeuvres the wheel loader to move material from a pile to the tipping body of a dump truck, through navigation, scooping, and dumping. The short-loading cycle is a repetitive task performed in high quantities, often as part of a larger refinement process, making it interesting for automation.

The main objective of this thesis work is to investigate challenges facing the automation of the short-loading cycle, focusing in particular on subtasks that can be efficiently addressed with deep learning methods. A secondary objective is to examine how alternative development paths, such as scale models, or simulations, can be used to enable data-driven automation of the short-loading cycle, as directly experimenting on real vehicles has a high associated cost when large numbers of timesteps are needed to gather enough data.

To investigate the two objectives, the literature is systematically reviewed to identify research gaps, challenges, and the usage of deep learning techniques. Secondly, a set of deep learning techniques is investigated to address perception and actuation problems identified as challenging and important for the automation of the short-loading cycle.

The investigation of deep learning techniques involves training and validating a realtime object detector neural network to identify key components (wheels, tipping body, and cab) on a scale model dump truck while testing on a real vehicle. This resulted in a localisation and classification degradation of only 14% between the scale model and the real dump truck, with no additional training. In addition, an examination to minimize the annotation workload of humans found that it is possible to decrease the workload by 95% while still retaining similar detection performance by leveraging linear interpolation.

Lastly, this thesis presents an investigation regarding the usage of reinforcement learning for navigation during the short-loading cycle. The results indicate that training the agent in simulation is currently required as the agent obtains the maximum reward after timesteps in the order of millions before being capable of performing the task. The results suggest that the trained agent is capable of bridging the gap between simulation and reality to complete a simplified version of the navigation task during the short-loading cycle.

The experiments presented in this thesis provide proof of concept that indicates deep learning techniques can aid in the realisation of an autonomous solution. Moreover, the results show that development paths allowing for experiments providing large numbers of timesteps can facilitate the practical use of such techniques.

Place, publisher, year, edition, pages
Luleå: Luleå University of Technology, 2023.
Series
Doctoral thesis / Luleå University of Technology 1 jan 1997 → …, ISSN 1402-1544
National Category
Computer Sciences Robotics and automation
Research subject
Cyber-Physical Systems
Identifiers
URN: urn:nbn:se:ltu:diva-102486ISBN: 978-91-8048-442-8 (print)ISBN: 978-91-8048-443-5 (electronic)OAI: oai:DiVA.org:ltu-102486DiVA, id: diva2:1812757
Public defence
2024-01-30, E632, Luleå tekniska universitet, Luleå, 09:00 (English)
Opponent
Supervisors
Available from: 2023-11-17 Created: 2023-11-17 Last updated: 2025-02-05Bibliographically approved
List of papers
1. Machine Vision for Construction Equipment by Transfer Learning with Scale Models
Open this publication in new window or tab >>Machine Vision for Construction Equipment by Transfer Learning with Scale Models
2020 (English)In: 2020 International Joint Conference on Neural Networks (IJCNN), IEEE, 2020, article id 21108Conference paper, Published paper (Refereed)
Abstract [en]

Machine vision is required by autonomous heavy construction equipment to navigate and interact with the environment. Wheel loaders need the ability to identify different objects and other equipment to perform the task of automatically loading and dumping material on dump trucks, which can be achieved using deep neural networks. Training such networks from scratch requires the iterative collection of potentially large amounts of video data, which is challenging at construction sites because of the complexity of safely operating heavy equipment in realistic environments. Transfer learning, for which pretrained neural networks can be retrained for use at construction sites, is thus attractive, especially if data can be acquired without full-scale experiments. We investigate the possibility of using scalemodel data for training and validating two different pretrained networks and use real-world test data to examine their generalization capability. A dataset containing 268 images of a 1:16 scale model of a Volvo A60H dump truck is provided, as well as 64 test images of a full-size Volvo A25G dump truck. The code and dataset are publicly available 1 . The networks, both pretrained on the MS-COCO dataset, were fine-tuned to the created dataset, and the results indicate that both networks can learn the features of the scale-model dump truck (validation mAP of 0.82 for YOLOv3 and 0.95 for RetinaNet). Both networks can transfer these learned features to detect objects on a full-size dump truck with no additional training (test mAP of 0.70 for YOLOv3 and 0.79 for RetinaNet).

Place, publisher, year, edition, pages
IEEE, 2020
Series
International Joint Conference on Neural Networks (IJCNN), ISSN 2161-4393, E-ISSN 2161-4407
Keywords
construction equipment, automation, computer vision, deep learning, machine learning
National Category
Embedded Systems
Research subject
Cyber-Physical Systems
Identifiers
urn:nbn:se:ltu:diva-81008 (URN)10.1109/IJCNN48605.2020.9207577 (DOI)000626021407089 ()2-s2.0-85093872275 (Scopus ID)
Conference
2020 International Joint Conference on Neural Networks (IJCNN), 19-24 July, 2020, Glasgow, United Kingdom
Note

ISBN för värdpublikation: 978-1-7281-6926-2, 978-1-7281-6927-9

Available from: 2020-10-02 Created: 2020-10-02 Last updated: 2023-11-17Bibliographically approved
2. Deep-learning-based vision for earth-moving automation
Open this publication in new window or tab >>Deep-learning-based vision for earth-moving automation
2022 (English)In: Automation in Construction, ISSN 0926-5805, E-ISSN 1872-7891, Vol. 133, article id 104013Article, review/survey (Refereed) Published
Abstract [en]

Earth-moving machines are heavy-duty vehicles designed for construction operations involving earthworks. The tasks performed by such machines typically involve navigation and interaction with materials such as soil, gravel, and blasted rock. Skilled operators use a combination of visual, sound, tactile and possibly motion feedback to perform tasks efficiently. We survey the literature in this research area and analyse the relative importance of different sensor system modalities focusing on deep-learning-based vision and automation for the short-cycle loading task. This is a common and repetitive task that is attractive to automate. The analysis indicates that computer vision, in combination with onboard sensors, is more critical than coordinate-based positioning. Furthermore, we find that data-driven approaches, in general, have high potential in terms of productivity, adaptability, versatility and wear and tear with respect to automation system solutions. The main knowledge gaps identified relate to loading non-fine heterogeneous material and navigation during loading and unloading.

Place, publisher, year, edition, pages
Elsevier, 2022
Keywords
Automation, Short-cycle Loading, Construction equipment, Computer vision, Machine learning
National Category
Computer Sciences
Research subject
Cyber-Physical Systems; Machine Learning
Identifiers
urn:nbn:se:ltu:diva-87710 (URN)10.1016/j.autcon.2021.104013 (DOI)000716642800006 ()2-s2.0-85118161462 (Scopus ID)
Note

Validerad;2021;Nivå 2;2021-11-01 (johcin)

Available from: 2021-11-01 Created: 2021-11-01 Last updated: 2023-11-17Bibliographically approved
3. Semi-Automatic Video Frame Annotation for Construction Equipment Automation Using Scale-Models
Open this publication in new window or tab >>Semi-Automatic Video Frame Annotation for Construction Equipment Automation Using Scale-Models
2021 (English)In: IECON 2021 – 47th Annual Conference of the IEEE Industrial Electronics Society, IEEE, 2021Conference paper, Published paper (Refereed)
Abstract [en]

Data collection and annotation is a time consuming and costly process, yet necessary for machine vision. Automation of construction equipment relies on seeing and detecting different objects in the vehicle’s surroundings. Construction equipment is commonly used to perform frequent repetitive tasks, which are interesting to automate. An example of such a task is the short-loading cycle, where the material is moved from a pile into the tipping body of a dump truck for transport. To complete this task, the wheel loader needs to have the capability to locate the tipping body of the dump truck. The machine vision system also allows the vehicle to detect unforeseen dangers such as other vehicles and more importantly human workers. In this work, we investigate the viability to perform semi-automatic annotation of video data using linear interpolation. The data is collected using scale-models mimicking a wheel-loaders approach towards a dump truck during the short-loading cycle. To measure the viability of this type of solution, the workload is compared to the accuracy of the model, YOLOv3. The results indicate that it is possible to maintain the performance while decreasing the annotation workload by about 95%. This is an interesting result for this application domain, as safety is critical and retaining the vision system performance is more important than decreasing the annotation workload. The fact that the performance seems to retain with a large workload decrease is an encouraging sign.

Place, publisher, year, edition, pages
IEEE, 2021
Keywords
Autonomous construction equipment, semi-automatic annotation, video-stream data, object detection, linear interpolation
National Category
Computer graphics and computer vision
Research subject
Cyber-Physical Systems; Machine Learning
Identifiers
urn:nbn:se:ltu:diva-87866 (URN)10.1109/iecon48115.2021.9589255 (DOI)000767230601030 ()2-s2.0-85119470296 (Scopus ID)
Conference
47th Annual Conference of the IEEE Industrial Electronics Society (IECON 2021), 13-16 Oct. 2021,Toronto, ON, Canada
Note

ISBN för värdpublikation:978-1-6654-3554-3, 978-1-6654-0256-9

Available from: 2021-11-11 Created: 2021-11-11 Last updated: 2025-02-07Bibliographically approved
4. Autonomous Navigation of Wheel Loaders using Task Decomposition and Reinforcement Learning
Open this publication in new window or tab >>Autonomous Navigation of Wheel Loaders using Task Decomposition and Reinforcement Learning
2023 (English)In: 2023 IEEE 19th International Conferenceon Automation Science and Engineering (CASE), IEEE, 2023Conference paper, Published paper (Refereed)
Place, publisher, year, edition, pages
IEEE, 2023
Series
IEEE International Conference on Automation Science and Engineering (CASE), ISSN 2161-8070, E-ISSN 2161-8089
National Category
Robotics and automation
Research subject
Cyber-Physical Systems; Machine Learning
Identifiers
urn:nbn:se:ltu:diva-101844 (URN)10.1109/CASE56687.2023.10260481 (DOI)2-s2.0-85174394497 (Scopus ID)979-8-3503-2069-5 (ISBN)979-8-3503-2070-1 (ISBN)
Conference
2023 IEEE 19th International Conference on Automation Science and Engineering (CASE), 26-30 August 2023, Auckland, New Zealand
Note

Funder: Sweden’s Innovation Agency and the VALD project (no. 2021-05035);

Available from: 2023-10-30 Created: 2023-10-30 Last updated: 2025-02-09Bibliographically approved
5. Automating the Short-Loading Cycle: Survey and Integration Framework
Open this publication in new window or tab >>Automating the Short-Loading Cycle: Survey and Integration Framework
2024 (English)In: Applied Sciences, ISSN 2076-3417, Vol. 14, no 11, article id 4674Article in journal (Refereed) Published
Abstract [en]

The short-loading cycle is a construction task where a wheel loader scoops material from a nearby pile in order to move that material to the tipping body of a dump truck. The short-loading cycle is a vital task performed in high quantities and is often part of a more extensive never-ending process to move material for further refinement. This, together with the highly repetitive nature of the short-loading cycle, makes it a suitable candidate for automation. However, the short-loading cycle is a complex task where the mechanics of the wheel loader together with the interaction between the wheel loader and the environment needs to be considered. This must be achieved while maintaining some productivity goal and, concurrently, minimizing the used energy. The main objective of this work is to analyze the short-loading cycle, assess the current state of research in this field, and discuss the steps required to progress towards a minimal viable product consisting of individual automation solutions that can perform the short-loading cycle well enough to be used by early adopters. This is achieved through a comprehensive literature study and consequent analysis of the review results. From this analysis, the requirements of an MVP are defined and some gaps which are currently hindering the realization of the MVP are presented.

Place, publisher, year, edition, pages
MDPI, 2024
Keywords
short-loading cycle, automation, wheel loader, construction, data-driven approaches
National Category
Robotics and automation Computer Sciences
Research subject
Cyber-Physical Systems; Machine Learning
Identifiers
urn:nbn:se:ltu:diva-101849 (URN)10.3390/app14114674 (DOI)001245643100001 ()2-s2.0-85195976956 (Scopus ID)
Note

Validerad;2024;Nivå 2;2024-07-05 (joosat);

Full text: CC BY License;

Funder: Sweden’s Innovation Agency (grant number 2021-05035);

This article has previously appeared as a manuscript in a thesis.

Available from: 2023-10-30 Created: 2023-10-30 Last updated: 2025-02-05Bibliographically approved
6. Learning the Approach During the Short-loading Cycle Using Reinforcement Learning
Open this publication in new window or tab >>Learning the Approach During the Short-loading Cycle Using Reinforcement Learning
(English)Manuscript (preprint) (Other academic)
Abstract [en]

The short-loading cycle is a repetitive task performed in high quantities, making it a great alternative for automation. In the short-loading cycle, an expert operator navigates towards a pile, fills the bucket with material, navigates to a dump truck, and dumps the material into the tipping body. The operator has to balance the productivity goal while minimising the fuel usage, to maximise the overall efficiency of the cycle. In addition, difficult interactions, such as the tyre-to-surface interaction further complicate the cycle. These types of hard-to-model interactions that can be difficult to address with rule-based systems, together with the efficiency requirements, motivate us to examine the potential of data-driven approaches. In this paper, the possibility of teaching an agent through reinforcement learning to approach a dump truck's tipping body and get in position to dump material in the tipping body is examined. The agent is trained in a 3D simulated environment to perform a simplified navigation task. The trained agent is directly transferred to a real vehicle, to perform the same task, with no additional training. The results indicate that the agent can successfully learn to navigate towards the dump truck with a limited amount of control signals in simulation and when transferred to a real vehicle, exhibits the correct behaviour. 

National Category
Robotics and automation Computer Sciences
Research subject
Robotics and Artificial Intelligence
Identifiers
urn:nbn:se:ltu:diva-101851 (URN)
Projects
Visionsbaserad Automatisk Lastning och Dumpning
Available from: 2023-10-30 Created: 2023-10-30 Last updated: 2025-02-05

Open Access in DiVA

fulltext(2704 kB)393 downloads
File information
File name FULLTEXT01.pdfFile size 2704 kBChecksum SHA-512
e5fc33ff94daea2cbe152f03597dcc7be314abeb97fab6b94499e40b8c137e02d852cad96241f8afba2b53fc2de6b5b8b07fad64be469479f51dc66888b5c180
Type fulltextMimetype application/pdf

Authority records

Borngrund, Carl

Search in DiVA

By author/editor
Borngrund, Carl
By organisation
Embedded Internet Systems Lab
Computer SciencesRobotics and automation

Search outside of DiVA

GoogleGoogle Scholar
Total: 393 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1283 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf