Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Seagrass classification using unsupervised curriculum learning (UCL)
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab.ORCID iD: 0000-0002-5922-7889
Centre for AI and ML, School of Science, Edith Cowan University, Joondalup, WA, Australia.
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab.ORCID iD: 0000-0002-0546-116X
Centre for AI and ML, School of Science, Edith Cowan University, Joondalup, WA, Australia.
Show others and affiliations
2024 (English)In: Ecological Informatics, ISSN 1574-9541, E-ISSN 1878-0512, Vol. 83, article id 102804Article in journal (Refereed) Published
Abstract [en]

Seagrass ecosystems are pivotal in marine environments, serving as crucial habitats for diverse marine species and contributing significantly to carbon sequestration. Accurate classification of seagrass species from underwater images is imperative for monitoring and preserving these ecosystems. This paper introduces Unsupervised Curriculum Learning (UCL) to seagrass classification using the DeepSeagrass dataset. UCL progressively learns from simpler to more complex examples, enhancing the model's ability to discern seagrass features in a curriculum-driven manner. Experiments employing state-of-the-art deep learning architectures, convolutional neural networks (CNNs), show that UCL achieved overall 90.12 % precision and 89 % recall, which significantly improves classification accuracy and robustness, outperforming some traditional supervised learning approaches like SimCLR, and unsupervised approaches like Zero-shot CLIP. The methodology of UCL involves four main steps: high-dimensional feature extraction, pseudo-label generation through clustering, reliable sample selection, and fine-tuning the model. The iterative UCL framework refines CNN's learning of underwater images, demonstrating superior accuracy, generalization, and adaptability to unseen seagrass and background samples of undersea images. The findings presented in this paper contribute to the advancement of seagrass classification techniques, providing valuable insights into the conservation and management of marine ecosystems. The code and dataset are made publicly available and can be assessed here: https://github.com/nabid69/Unsupervised-Curriculum-Learning—UCL.

 

Place, publisher, year, edition, pages
Elsevier B.V. , 2024. Vol. 83, article id 102804
Keywords [en]
Seagrass, Deep learning, Unsupervised classification, Curriculum learning, Unsupervised curriculum learning, Underwater digital imaging
National Category
Computer Sciences Computer Vision and Robotics (Autonomous Systems)
Research subject
Machine Learning
Identifiers
URN: urn:nbn:se:ltu:diva-109778DOI: 10.1016/j.ecoinf.2024.102804ISI: 001307982900001Scopus ID: 2-s2.0-85202895926OAI: oai:DiVA.org:ltu-109778DiVA, id: diva2:1896065
Note

Validerad;2024;Nivå 2;2024-09-09 (hanlid);

Full text license: CC BY

Available from: 2024-09-09 Created: 2024-09-09 Last updated: 2024-12-20Bibliographically approved
In thesis
1. Unsupervised Curriculum Learning Case Study: Earth Observation UCL4EO
Open this publication in new window or tab >>Unsupervised Curriculum Learning Case Study: Earth Observation UCL4EO
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Earth Observation (EO) data is crucial for understanding, managing, and conserving our planet's ecosystem and its natural resources. This data enables humanity to monitor environmental changes, such as natural disasters, urban growth, and climate shifts, assisting informed decisions and proactive measures. Early EO heavily relied on statistical methods and expert domain knowledge, but the advent of machine learning has revolutionized EO data processing, enhancing efficiency and accuracy. Conventional ML models require expensive and labor-intensive data labeling. In contrast, unsupervised ML techniques can learn features from data without the need for manual labeling, making the process more efficient and cost-effective.

 

This thesis presents a UCL approach utilizing advanced DL models to classify EO data, referred to as UCL4EO. This approach eliminates the need for manual data labeling in training the DL model. The UCL framework comprises i) a DL model tailored for feature extraction from image data, ii) a clustering method to group deep features, and iii) a selection operation to capture representative samples from these clusters. The CNN extracts meaningful features from images, subjected to a clustering algorithm to create pseudo-labels. After identifying the initial clusters, representative samples from each cluster are chosen using the UCL selection operation to fine-tune the feature extractor. The stated process is repeated iteratively until convergence. The proposed UCL approach progressively learns and incorporates salient data features in an unsupervised manner by utilizing pseudo-labels.

 

UCL started as a proof of concept to show the viability of the method for binary classification on RS and aerial imagery. Specifically, the UCL framework is employed to identify water bodies using three RGB datasets, encompassing both low and high-resolution RS and aerial imagery. While UCL has been extensively examined with RGB imagery, it has been adapted to benefit from the enhanced capabilities of multi-spectral satellite imagery. This adaptation enables UCL to generalize to multi-spectral imagery from Sentinel-2 to detect forest fires in Australia. UCL undergoes subsequent improvements and is further investigated to identify utility poles in high-resolution UAV images. These gray-scale images of utility poles pose computer vision challenges, including issues like occlusion and cropping, where a significant portion of the image contains the background and only a slight appearance of the utility pole. Extensive experimentation on the mentioned tasks effectively showcases UCL's adaptive learning capabilities, producing promising results. The achieved accuracy surpassed those of supervised methods in cross-domain adaptation on similar tasks, underscoring the effectiveness of the proposed algorithm.

 

The scope of UCL has been extended to encompass multi-class classification tasks in the domain of RS data, referred to as Multi-class UCL. Multi-class UCL progressively acquires knowledge about various categories on multi-scale resolution. To investigate Multi-class UCL, we have used four publicly available datasets of Sentinel-2 and aerial imagery: EuroSAT, SAT-6, UCMerced, and RSSCN7. Comprehensive experiments conducted on the above-mentioned datasets revealed better cross-domain adaptation capabilities compared to supervised methods, thereby demonstrating the effectiveness of Multi-class UCL.

 

In these investigations, two datasets are generated using Sentinel-2 satellite imagery: one for water bodies - PakSAT and the other for Australian forest fires. However, cloud cover poses a significant challenge by obstructing the satellite's ability to capture clear images of the Earth's surface. To address this issue, available cloud masking techniques are employed to filter out images affected by cloud cover, ensuring the datasets contain only clear and usable data. Later, this thesis examines cloud detection and Cloud Optical Thickness (COT) estimation from Sentinel-2 imagery. We employed machine-learning techniques, achieving better performance than SCL designed by ESA for cloud cover tasks.

 

In addition to the application in RS data, UCL has been investigated in other domains of EO, such as undersea imagery. Furthermore, UCL has also been used for tasks like natural scene classification, medical imaging, and document analysis, demonstrating its versatility and broad applicability. Further exploration of UCL could involve improving the process of generating pseudo-labels through deep learning techniques.

Place, publisher, year, edition, pages
Luleå: Luleå University of Technology, 2024
Series
Doctoral thesis / Luleå University of Technology 1 jan 1997 → …, ISSN 1402-1544
Keywords
UCL, Earth Observation, EO, Remote Sensing, RS, Computer VIsion, Deep Learning, Unsupervised Learning
National Category
Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-109974 (URN)978-91-8048-632-3 (ISBN)978-91-8048-633-0 (ISBN)
Public defence
2024-11-08, E632, Luleå University of Technology, Luleå, 09:00 (English)
Opponent
Supervisors
Available from: 2024-09-16 Created: 2024-09-14 Last updated: 2024-10-18Bibliographically approved

Open Access in DiVA

fulltext(8067 kB)61 downloads
File information
File name FULLTEXT01.pdfFile size 8067 kBChecksum SHA-512
50fae0a6496dda54f677f26e390e908932ddbcb922a99790b5ff791319e31e94249e70c81e44419e2a09a471654c276fa33329ac0dd572d5bab2a36ee25eea2a
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Abid, NosheenKovács, GyörgyAdewumi, TosinLiwicki, Marcus

Search in DiVA

By author/editor
Abid, NosheenKovács, GyörgyAdewumi, TosinLiwicki, Marcus
By organisation
Embedded Internet Systems Lab
In the same journal
Ecological Informatics
Computer SciencesComputer Vision and Robotics (Autonomous Systems)

Search outside of DiVA

GoogleGoogle Scholar
Total: 61 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 259 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf