Planned maintenance
A system upgrade is planned for 10/12-2024, at 12:00-13:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
AutoDiagn: An Automated Real-time Diagnosis Framework for Big Data Systems
Newcastle University, United Kingdom; Bartin University, Turkey.
Newcastle University, United Kingdom.
Newcastle University, United Kingdom; Taibah University, Saudi Arabia.
Luleå University of Technology, Department of Computer Science, Electrical and Space Engineering, Computer Science.ORCID iD: 0000-0003-3489-7429
Show others and affiliations
2022 (English)In: IEEE Transactions on Computers, ISSN 0018-9340, E-ISSN 1557-9956, Vol. 71, no 5, p. 1035-1048Article in journal (Refereed) Published
Abstract [en]

Big data processing systems, such as Hadoop and Spark, usually work on large-scale, highly-concurrent, and multi-tenant environments that can easily cause hardware and software malfunctions or failures, thereby leading to performance degradation. Several systems and methods exist to detect big data processing systems' performance degradation, perform root-cause analysis, and even overcome the issues causing such degradation. However, these solutions focus on specific problems such as straggler and inefficient resource utilization. There is a lack of a generic and extensible framework to support the real-time diagnosis of big data systems. In this paper, we propose, develop and validate AutoDiagn. This generic and flexible framework provides holistic monitoring of a big data system while detecting performance degradation and enabling root-cause analysis. We present the implementation and evaluation of AutoDiagn that interacts with a Hadoop cluster deployed on a public cloud and tested with real-world benchmark applications. Experimental results show that AutoDiagn has a small resource footprint, high throughput and low latency.

Place, publisher, year, edition, pages
USA: IEEE, 2022. Vol. 71, no 5, p. 1035-1048
Keywords [en]
Root-cause analysis, Big data systems, QoS, Hadoop, Performance
National Category
Computer Systems
Research subject
Pervasive Mobile Computing
Identifiers
URN: urn:nbn:se:ltu:diva-83436DOI: 10.1109/TC.2021.3070639ISI: 000778905700004Scopus ID: 2-s2.0-85103783969OAI: oai:DiVA.org:ltu-83436DiVA, id: diva2:1540405
Note

Validerad;2022;Nivå 2;2022-04-19 (sofila);

Funder: Turkish Ministry of National Education; UKRI (EP/T021985/1, EP/R033293/1, EP/T022582/1); National Natural Science Foundation of China (62072408); Zhejiang Provincial Natural Science Foundation ofChina (LY20F020030)

Available from: 2021-03-29 Created: 2021-03-29 Last updated: 2023-03-28Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Mitra, Karan

Search in DiVA

By author/editor
Mitra, Karan
By organisation
Computer Science
In the same journal
IEEE Transactions on Computers
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 353 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf