System disruptions
We are currently experiencing disruptions on the search portals due to high traffic. We are working to resolve the issue, you may temporarily encounter an error message.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Early Onset Yellow Rust Detection Guided By Remote Sensing Indices
BASF Digital Farming GmbH.
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Embedded Internet Systems Lab. BASF Digital Farming GmbH.ORCID iD: 0000-0002-8294-6056
BASF Digital Farming GmbH.ORCID iD: 0000-0003-2529-8782
2022 (English)In: Remote Sensing, E-ISSN 2072-4292Article in journal (Refereed) Submitted
Abstract [en]

Early warning systems help combat crop diseases and enable sustainable plant protection by optimizing the use of resources. The application of remote sensing to detect plant diseases like Wheat stripe rust, commonly known as yellow rust is based on the presumption, that the presence of a disease has a direct link with the photosynthesis capability and physical structure of a plant at both canopy and tissue level. This causes changes to the solar radiation absorption capability and thus alters the reflectance spectrum. In comparison to existing methods and technologies, remote sensing offers access to near real time information at both field and regional scale to build robust disease models. This study shows the capability of multispectral images along with weather, in-situ  and phenology data to detect the onset of yellow rust disease.  Crop details and disease observation data from field trials across the globe spanning four years (2015-2018) are combined with weather data to model disease severity over time as a value between 0-1 with 0 being no disease and 1 being the highest infestation level. Various tree based ensemble algorithms like CatBoost, Random Forest and XGBoost has been experimented with. The XGBoost model performs best with a mean absolute error of 0.1568 and root mean square error of 0.2081. Being a fast-spreading disease and having caused epidemics in the past, the above model alone is not enough. It is important to detect yellow rust disease early so farmers can be warned in advance and favorable management practices can be implemented. Vegetation indices like NDVI, NDRE and NDWI from remote sensing images have been used as auxiliary features along with disease severity predictions over time derived from the previous step to detect early onset of the disease. A rule-based approach is presented that uses a combination of both model output and changes in vegetation indices to predict an early disease progression window. Analysis on test trials shows that in 80% of the cases, the predicted progression window was ahead of the first disease observation on the field offering an opportunity to take timely action that could save yield. 

Place, publisher, year, edition, pages
MDPI, 2022.
National Category
Other Earth Sciences Information Systems
Identifiers
URN: urn:nbn:se:ltu:diva-89078OAI: oai:DiVA.org:ltu-89078DiVA, id: diva2:1634207
Available from: 2022-02-01 Created: 2022-02-01 Last updated: 2025-02-01
In thesis
1. Data driven crop disease modeling
Open this publication in new window or tab >>Data driven crop disease modeling
2022 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The concept of precision farming deals with the creation and use of data from machinery and sensors on and off the field to optimize resources and sustainably intensify food production to keep up with increasing demand. However, in the face of a growing amount of data being collected, smarter data processing and analysis techniques are needed and have prompted the evaluation and incorporation of artificial intelligence (AI) and machine learning (ML) techniques for multiple use cases right from seeding to harvesting. One such use case that has yet to fully gauge the propositions of AI and ML is crop disease prediction. Since multiple biotic and abiotic factors could be responsible for the occurrence of a disease, modeling requires finding suitable data associated with these factors from multiple farms for an extended time frame and developing smarter models able to capture underlying relationships between them.

 

This thesis presents research conducted to develop data-driven methodologies and optimization approaches for building crop disease models. The objective is realized by breaking down the task into three modules: (i) data collection; (ii) data processing and model building; and finally, (iii) the maintenance of models in production.

 

The traditional data collection approach for disease modeling is through setting up of trials which is expensive and labor-intensive which prompted the evaluation of other novel and free to access data sources. Therefore, in module one two studies were conducted to assess the suitability of social media platforms and remote sensing products. The results show that social media is not a viable option yet due to limited geo-referenced data and ambiguity in categorizing the discussions. On the other hand, vegetation indices derived from multispectral satellite imagery, despite their high spatial granularity, are an interesting addition to the modeling pipeline.

 

Moving on to module two, a study was conducted to demonstrate the process of fusing and preparing data from multiple sources with different formats collected in an extended time frame to be used for model building. The study establishes the relevance of using advanced machine learning models such as deep learning in the prediction of crop diseases. The results show that given the appropriate data preparation process at the right data granularity and the use of some smart tricks, neural network-based models hold the potential to outperform widely used models such as XGBoost. Since neural networks offer advantages such as multimodal learning, transfer learning, and automated feature engineering, which are crucial in building scalable models with heterogeneous data and reduced human effort, the observations of this study led to a follow-up study. This study investigates neural network-based algorithms specifically designed for tabular data and compares them against popular tree ensemble-based models. Apart from acting as a comprehensive analysis of the two families of techniques the results showed that although neural network-based models were not able to outperform tree-based models, they achieved comparable results and allowed for the creation of easier and more accurate models for new diseases by application of transfer learning.

 

Climate change leads to unexpected weather events and modified disease occurrence patterns that cause static models to drift rapidly. Models need to be maintained to ensure they are performing as required. Capturing real-time data and triggering retraining when enough new data has been collected can help maintain models by acting as a feedback loop for model improvement. This was attempted by collecting crowd-sourced data from a disease recognition app, but it was not usable in its current form and required further annotation. Since annotations are expensive and time-consuming, a study for real-life agricultural data retrieval and large-scale annotation flow optimization based on similarity search technique is presented which significantly optimizes the annotation process.

 

The results derived from these studies are highly relevant for progressing the United Nations Sustainable Development Goal of Zero Hunger. It is also expected to ease farmers' anxiety related to yield loss due to crop diseases and enhance their capability of planning and scheduling management practices by giving them an early warning of disease occurrence. The results have been verified through comparison with traditional crop disease prediction methods and interaction with experienced agronomists working for a major AgTech company.

Place, publisher, year, edition, pages
Luleå: Luleå University of Technology, 2022
Series
Doctoral thesis / Luleå University of Technology 1 jan 1997 → …, ISSN 1402-1544
Keywords
Crop Disease Prediction, Precision Agriculture, Digital Farming, Artificial Intelligence, Machine learning
National Category
Computer Sciences Agricultural Science
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-89086 (URN)978-91-8048-024-6 (ISBN)978-91-8048-025-3 (ISBN)
Public defence
2022-03-29, A 109, Luleå, 09:00 (English)
Opponent
Supervisors
Available from: 2022-02-02 Created: 2022-02-01 Last updated: 2022-10-28Bibliographically approved

Open Access in DiVA

No full text in DiVA

Authority records

Shankar, Priyamvada

Search in DiVA

By author/editor
Shankar, PriyamvadaMukhopadhaya, Sayan
By organisation
Embedded Internet Systems Lab
In the same journal
Remote Sensing
Other Earth SciencesInformation Systems

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 232 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf