Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Advanced Data Analytics Modelling for Air Quality Assessment
Luleå tekniska universitet, Institutionen för system- och rymdteknik.
2023 (engelsk)Independent thesis Advanced level (degree of Master (Two Years)), 20 poäng / 30 hpOppgave
Abstract [en]

 Air quality assessment plays a crucial role in understanding the impact of air pollution onhuman health and the environment. With the increasing demand for accurate assessment andprediction of air quality, advanced data analytics modelling techniques offer promisingsolutions. This thesis focuses on leveraging advanced data analytics to assess and analyse airpollution concentration levels in Italy over a 4km resolution using the FORAIR_IT datasetsimulated in ENEA on the CRESCO6 infrastructure, aiming to uncover valuable insights andidentifying the most appropriate AI models for predicting air pollution levels. The datacollection, understanding, and pre-processing procedures are discussed, followed by theapplication of big data training and forecasting using Apache Spark MLlib. The research alsoencompasses different phases, including descriptive and inferential analysis to understand theair pollution concentration dataset, hypothesis testing to examine the relationship betweenvarious pollutants, machine learning prediction using several regression models and anensemble machine learning approach and time series analysis on the entire dataset as well asthree major regions in Italy (Northern Italy – Lombardy, Central Italy – Lazio and SouthernItaly – Campania). The computation time for these regression models are also evaluated and acomparative analysis is done on the results obtained. The evaluation process and theexperimental setup involve the usage of the ENEAGRID/CRESCO6 HPC Infrastructure andApache Spark. This research has provided valuable insights into understanding air pollutionpatterns and improving prediction accuracy. The findings of this study have the potential todrive positive change in environmental management and decision-making processes, ultimatelyleading to healthier and more sustainable communities. As we continue to explore the vastpossibilities offered by advanced data analytics, this research serves as a foundation for futureadvancements in air quality assessment in Italy and the models are transferable to other regionsand provinces in Italy, paving the way for a cleaner and greener future. 

sted, utgiver, år, opplag, sider
2023. , s. 156
Emneord [en]
Air quality assessment, Advanced Data Analytics, Artificial Intelligence (AI), Machine Learning (ML), Big Data, Regression Models, Time Series Models, High Performance Computing (HPC), Air Pollution
HSV kategori
Identifikatorer
URN: urn:nbn:se:ltu:diva-101490OAI: oai:DiVA.org:ltu-101490DiVA, id: diva2:1801320
Eksternt samarbeid
ENEA Casaccia Research Center, Italy; Leeds Beckett University, United Kingdom
Fag / kurs
Student thesis, at least 30 credits
Utdanningsprogram
Master Programme in Green Networking and Cloud Computing
Presentation
2023-06-14, Municipality City Hall, Anacapri, Italy, Anacapri, 15:00 (engelsk)
Veileder
Examiner
Tilgjengelig fra: 2023-10-06 Laget: 2023-09-29 Sist oppdatert: 2023-10-09bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric

urn-nbn
Totalt: 256 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf