Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Characterizing Failure and Repair Time of Servers in a Hyper-scale Data Center
Luleå University of Technology, Department of Engineering Sciences and Mathematics, Energy Science.ORCID iD: 0000-0001-5870-0112
Luleå University of Technology, Department of Engineering Sciences and Mathematics, Energy Science.ORCID iD: 0000-0003-4443-7653
Luleå University of Technology, Department of Engineering Sciences and Mathematics, Energy Science.ORCID iD: 0000-0003-4074-9529
2020 (English)In: Proceedings of 2020 IEEE PES Innovative Smart Grid Technologies Europe (ISGT-Europe) 26-28 October, 2020, IEEE, 2020, p. 660-664Conference paper, Published paper (Refereed)
Abstract [en]

Hyper-scale data centers are used to host cloud computing interfaces to support the increasing demand for storage and computational resources. For achieving specific service level agreements (SLA), this infrastructure demands highly available cloud computing systems. It is necessary to analyze the server failure incidents to determine the way of improving the reliability of the system since the computational interruption causes financial losses for the data center owners. Regarding the reliability analysis, it is important to characterize the time to failure and time to repair of the servers. In this paper, a publicly available data set from Google cloud-cluster data center will be analyzed to find the distribution function for the time to failure and the time to repair for the servers in a cloud based data centers.

Place, publisher, year, edition, pages
IEEE, 2020. p. 660-664
Keywords [en]
availability, reliability, cloud clustering, data center
National Category
Other Electrical Engineering, Electronic Engineering, Information Engineering
Research subject
Electric Power Engineering
Identifiers
URN: urn:nbn:se:ltu:diva-80260DOI: 10.1109/ISGT-Europe47291.2020.9248891Scopus ID: 2-s2.0-85097336258OAI: oai:DiVA.org:ltu-80260DiVA, id: diva2:1455303
Conference
10th IEEE PES Innovative Smart Grid Technologies Europe (ISGT-Europe 2020), 26-28 October, 2020, Delft, The Netherlands (Virtual)
Note

ISBN för värdpublikation: 978-1-7281-7100-5

Available from: 2020-07-23 Created: 2020-07-23 Last updated: 2023-09-05Bibliographically approved
In thesis
1. On the Energy Efficiency and Reliability of Data Centers in Operation
Open this publication in new window or tab >>On the Energy Efficiency and Reliability of Data Centers in Operation
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The new generation information technology (IT) services like mobile Internet, Internet of things (IoT), cloud computing, processing of big data, applications of artificial intelligence, etc. are becoming popular with the development of the information and communication technology (ICT) industry. In this industry, the dependency on the data centers is also increasing to ensure the quality of services (QoS). Thus, the energy consumption of the data centers is increasing with the increasing demand for computational resources in it because the load sections of the data center with sensitive equipment run $24$ hours a day, $365$ days of the year. Regarding data center operation, it is becoming a technical challenge to make a trade-off between reducing the energy consumption to limit the operational costs and ensuring higher reliability of the data center.

A way to help data center operators to cope with the posed challenges is by identifying the ``right size of the computational resource'', considering the power losses and service availability of the data center. This endeavor requires power consumption models that can consider different load sections with different types of equipment. The power consumption models of the load sections can address the electrical load demand and the power losses, especially losses in the internal power conditioning system (IPCS). On the other hand, the service availability of the data center mainly depends on the availability of the computational resources like servers and on the availability of the power supply through the IPCS. It is important to characterize the servers' failure and repair times to develop the stochastic model of the server unavailability in operation. The availability of adequate power supply through the IPCS depends on its component failures and the power supply capacity of its components. The bottleneck of the power supply capacity of the IPCS is subjected to the power losses of the equipment in the IPCS. Additionally, the voltage disturbances like voltage dips and swells in the IPCS also interrupt the power supply units (PSUs) of the servers, which also degrades the QoS of the data center.

The outcomes of this thesis can be synthesized as follows: 1) A comparative analysis of the energy consumption models of the major load sections in the data center, and an analysis of the impact of the power losses in the IPCS on the outage probability of the servers. 2) Reliability indices to assess the adequacy of the computational resources in the data center considering the outages of power supplies and the servers in operation. 3) The impacts of voltages disturbances in the IPCS on the power supply outages, hence on the interruptions of servers. 4) An analysis of the trade-off between the energy efficiency and reliability in operational planning of the data center.

Place, publisher, year, edition, pages
Luleå: Luleå tekniska universitet, 2023. p. 210
Series
Doctoral thesis / Luleå University of Technology 1 jan 1997 → …, ISSN 1402-1544
National Category
Other Electrical Engineering, Electronic Engineering, Information Engineering
Research subject
Electric Power Engineering
Identifiers
urn:nbn:se:ltu:diva-96599 (URN)978-91-8048-307-0 (ISBN)978-91-8048-308-7 (ISBN)
Public defence
2023-06-16, Hörsal A, Luleå tekniska universitet, Skellefteå, 09:00 (English)
Opponent
Supervisors
Funder
Swedish Energy Agency, 24559
Available from: 2023-04-17 Created: 2023-04-16 Last updated: 2023-09-05Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Ahmed, Kazi Main UddinAlvarez, ManuelBollen, Math H. J.

Search in DiVA

By author/editor
Ahmed, Kazi Main UddinAlvarez, ManuelBollen, Math H. J.
By organisation
Energy Science
Other Electrical Engineering, Electronic Engineering, Information Engineering

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 601 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf