System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Learning the Approach During the Short-loading Cycle Using Reinforcement Learning
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab.ORCID iD: 0000-0002-4716-9765
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Computer Science. Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab.ORCID iD: 0000-0001-5408-0008
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab.ORCID iD: 0000-0001-5662-825x
(English)Manuscript (preprint) (Other academic)
Abstract [en]

The short-loading cycle is a repetitive task performed in high quantities, making it a great alternative for automation. In the short-loading cycle, an expert operator navigates towards a pile, fills the bucket with material, navigates to a dump truck, and dumps the material into the tipping body. The operator has to balance the productivity goal while minimising the fuel usage, to maximise the overall efficiency of the cycle. In addition, difficult interactions, such as the tyre-to-surface interaction further complicate the cycle. These types of hard-to-model interactions that can be difficult to address with rule-based systems, together with the efficiency requirements, motivate us to examine the potential of data-driven approaches. In this paper, the possibility of teaching an agent through reinforcement learning to approach a dump truck's tipping body and get in position to dump material in the tipping body is examined. The agent is trained in a 3D simulated environment to perform a simplified navigation task. The trained agent is directly transferred to a real vehicle, to perform the same task, with no additional training. The results indicate that the agent can successfully learn to navigate towards the dump truck with a limited amount of control signals in simulation and when transferred to a real vehicle, exhibits the correct behaviour. 

National Category
Robotics and automation Computer Sciences
Research subject
Robotics and Artificial Intelligence
Identifiers
URN: urn:nbn:se:ltu:diva-101851OAI: oai:DiVA.org:ltu-101851DiVA, id: diva2:1808156
Projects
Visionsbaserad Automatisk Lastning och DumpningAvailable from: 2023-10-30 Created: 2023-10-30 Last updated: 2025-02-05
In thesis
1. Towards Deep-learning-based Autonomous Navigation in the Short-loading Cycle
Open this publication in new window or tab >>Towards Deep-learning-based Autonomous Navigation in the Short-loading Cycle
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Earth-moving machines, such as wheel loaders, are a type of heavy-duty machinery used within the construction industry to perform vital tasks, such as digging, transporting, and mining applications. One of these tasks is the short-loading cycle, where an operator manoeuvres the wheel loader to move material from a pile to the tipping body of a dump truck, through navigation, scooping, and dumping. The short-loading cycle is a repetitive task performed in high quantities, often as part of a larger refinement process, making it interesting for automation.

The main objective of this thesis work is to investigate challenges facing the automation of the short-loading cycle, focusing in particular on subtasks that can be efficiently addressed with deep learning methods. A secondary objective is to examine how alternative development paths, such as scale models, or simulations, can be used to enable data-driven automation of the short-loading cycle, as directly experimenting on real vehicles has a high associated cost when large numbers of timesteps are needed to gather enough data.

To investigate the two objectives, the literature is systematically reviewed to identify research gaps, challenges, and the usage of deep learning techniques. Secondly, a set of deep learning techniques is investigated to address perception and actuation problems identified as challenging and important for the automation of the short-loading cycle.

The investigation of deep learning techniques involves training and validating a realtime object detector neural network to identify key components (wheels, tipping body, and cab) on a scale model dump truck while testing on a real vehicle. This resulted in a localisation and classification degradation of only 14% between the scale model and the real dump truck, with no additional training. In addition, an examination to minimize the annotation workload of humans found that it is possible to decrease the workload by 95% while still retaining similar detection performance by leveraging linear interpolation.

Lastly, this thesis presents an investigation regarding the usage of reinforcement learning for navigation during the short-loading cycle. The results indicate that training the agent in simulation is currently required as the agent obtains the maximum reward after timesteps in the order of millions before being capable of performing the task. The results suggest that the trained agent is capable of bridging the gap between simulation and reality to complete a simplified version of the navigation task during the short-loading cycle.

The experiments presented in this thesis provide proof of concept that indicates deep learning techniques can aid in the realisation of an autonomous solution. Moreover, the results show that development paths allowing for experiments providing large numbers of timesteps can facilitate the practical use of such techniques.

Place, publisher, year, edition, pages
Luleå: Luleå University of Technology, 2023
Series
Doctoral thesis / Luleå University of Technology 1 jan 1997 → …, ISSN 1402-1544
National Category
Computer Sciences Robotics and automation
Research subject
Cyber-Physical Systems
Identifiers
urn:nbn:se:ltu:diva-102486 (URN)978-91-8048-442-8 (ISBN)978-91-8048-443-5 (ISBN)
Public defence
2024-01-30, E632, Luleå tekniska universitet, Luleå, 09:00 (English)
Opponent
Supervisors
Available from: 2023-11-17 Created: 2023-11-17 Last updated: 2025-02-05Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

https://arxiv.org/abs/2406.13366

Authority records

Bodin, UlfSandin, Fredrik

Search in DiVA

By author/editor
Borngrund, CarlBodin, UlfSandin, FredrikAndreasson, Henrik
By organisation
Embedded Internet Systems LabComputer Science
Robotics and automationComputer Sciences

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 175 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf