Operational message
There are currently operational disruptions. Troubleshooting is in progress.
Change search
Link to record
Permanent link

Direct link
Alternative names
Publications (10 of 12) Show all publications
Brännvall, R. (2024). The Inhibitor: ReLU and Addition-Based Attention for Efficient Transformers (Student Abstract). In: Michael Wooldridge; Jennifer Dy; Sriraam Natarajan (Ed.), Proceedings of the 38th AAAI Conference on Artificial Intelligence: . Paper presented at 38th AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, February 20-27, 2024 (pp. 23445-23446). AAAI Press, 38
Open this publication in new window or tab >>The Inhibitor: ReLU and Addition-Based Attention for Efficient Transformers (Student Abstract)
2024 (English)In: Proceedings of the 38th AAAI Conference on Artificial Intelligence / [ed] Michael Wooldridge; Jennifer Dy; Sriraam Natarajan, AAAI Press, 2024, Vol. 38, p. 23445-23446Conference paper, Oral presentation with published abstract (Refereed)
Place, publisher, year, edition, pages
AAAI Press, 2024
National Category
Computer and Information Sciences Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-105094 (URN)10.1609/aaai.v38i21.30422 (DOI)2-s2.0-85189627116 (Scopus ID)
Conference
38th AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, February 20-27, 2024
Note

ISBN for host publication: 978-1-57735-887-9; 

Available from: 2024-04-15 Created: 2024-04-15 Last updated: 2025-10-21Bibliographically approved
Brännvall, R., Gustafsson, J. & Sandin, F. (2023). Modular and Transferable Machine Learning for Heat Management and Reuse in Edge Data Centers. Energies, 16(5), Article ID 2255.
Open this publication in new window or tab >>Modular and Transferable Machine Learning for Heat Management and Reuse in Edge Data Centers
2023 (English)In: Energies, E-ISSN 1996-1073, Vol. 16, no 5, article id 2255Article in journal (Refereed) Published
Abstract [en]

This study investigates the use of transfer learning and modular design for adapting a pretrained model to optimize energy efficiency and heat reuse in edge data centers while meeting local conditions, such as alternative heat management and hardware configurations. A Physics-Informed Data-Driven Recurrent Neural Network (PIDD RNN) is trained on a small scale-model experiment of a six-server data center to control cooling fans and maintain the exhaust chamber temperature within safe limits. The model features a hierarchical regularizing structure that reduces the degrees of freedom by connecting parameters for related modules in the system. With a RMSE value of 1.69, the PIDD RNN outperforms both a conventional RNN (RMSE: 3.18), and a State Space Model (RMSE: 2.66). We investigate how this design facilitates transfer learning when the model is fine-tuned over a few epochs to small dataset from a second set-up with a server located in a wind tunnel. The transferred model outperforms a model trained from scratch over hundreds of epochs.

Place, publisher, year, edition, pages
MDPI, 2023
Keywords
edge data center, heat management, heat reuse, meta-learning, modular machine learning, recurrent neural network, transfer learning, transferable machine learning
National Category
Other Mechanical Engineering
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-96185 (URN)10.3390/en16052255 (DOI)000948094200001 ()2-s2.0-85149781307 (Scopus ID)
Funder
Vinnova
Note

Validerad;2023;Nivå 2;2023-03-20 (joosat);

Licens fulltext: CC BY License

Available from: 2023-03-20 Created: 2023-03-20 Last updated: 2025-10-21Bibliographically approved
Adewumi, O., Brännvall, R., Abid, N., Pahlavan, M., Sabah Sabry, S., Liwicki, F. & Liwicki, M. (2022). Småprat: DialoGPT for Natural Language Generation of Swedish Dialogue by Transfer Learning. In: Sigurd Løkse, Benjamin Ricaud (Ed.), Proceedings of the Northern Lights Deep Learning Workshop 2022: . Paper presented at Northern Lights Deep Learning Conference, (NLDL 2022), Tromsø, Norway, January 10-12, 2022. Septentrio Academic Publishing, 3
Open this publication in new window or tab >>Småprat: DialoGPT for Natural Language Generation of Swedish Dialogue by Transfer Learning
Show others...
2022 (English)In: Proceedings of the Northern Lights Deep Learning Workshop 2022 / [ed] Sigurd Løkse, Benjamin Ricaud, Septentrio Academic Publishing , 2022, Vol. 3Conference paper, Published paper (Refereed)
Abstract [en]

Building open-domain conversational systems (or chatbots) that produce convincing responses is a recognized challenge. Recent state-of-the-art (SoTA) transformer-based models for the generation of natural language dialogue have demonstrated impressive performance in simulating human-like, single-turn conversations in English.This work investigates, by an empirical study, the potential for transfer learning of such models to Swedish language. DialoGPT, an English language pre-trained model, is adapted by training on three different Swedish language conversational datasets obtained from publicly available sources: Reddit, Familjeliv and the GDC. Perplexity score (an automated intrinsic metric) and surveys by human evaluation were used to assess the performances of the fine-tuned models. We also compare the DialoGPT experiments with an attention-mechanism-based seq2seq baseline model, trained on the GDC dataset. The results indicate that the capacity for transfer learning can be exploited with considerable success. Human evaluators asked to score the simulated dialogues judged over 57% of the chatbot responses to be human-like for the model trained on the largest (Swedish) dataset. The work agrees with the hypothesis that deep monolingual models learn some abstractions which generalize across languages. We contribute the codes, datasets and model checkpoints and host the demos on the HuggingFace platform.

Place, publisher, year, edition, pages
Septentrio Academic Publishing, 2022
Series
Proceedings of the Northern Lights Deep Learning Workshop, ISSN 2703-6928
Keywords
Conversational Systems, Chatbots, Dialogue, DialoGPT, Swedish
National Category
Natural Language Processing Computer Sciences
Research subject
Machine Learning
Identifiers
urn:nbn:se:ltu:diva-90163 (URN)10.7557/18.6231 (DOI)
Conference
Northern Lights Deep Learning Conference, (NLDL 2022), Tromsø, Norway, January 10-12, 2022
Available from: 2022-04-13 Created: 2022-04-13 Last updated: 2025-10-21Bibliographically approved
Rizk, A., Seidelin, C., Kovács, G., Liwicki, M. & Brännvall, R. (2021). Defining Beneficiaries of Emerging Data Infrastructures Towards Effective Data Appropriation: Insights from the Swedish Space Data Lab. In: Audrius Lopata; Daina Gudonienė; Rita Butkienė (Ed.), Information and Software Technologies: . Paper presented at 27th International Conference on Information and Software Technologies (ICIST 2021), Kaunas, Lithuania, October 14-16, 2021 (pp. 32-47). Springer
Open this publication in new window or tab >>Defining Beneficiaries of Emerging Data Infrastructures Towards Effective Data Appropriation: Insights from the Swedish Space Data Lab
Show others...
2021 (English)In: Information and Software Technologies / [ed] Audrius Lopata; Daina Gudonienė; Rita Butkienė, Springer, 2021, p. 32-47Conference paper, Published paper (Refereed)
Abstract [en]

The increasing collection and usage of data and data analytics has prompted development of Data Labs. These labs are (ideally) a way for multiple beneficiaries to make use of the same data in ways that are value-generating for all. However, establishing data labs requires the mobilization of various infrastructural elements, such as beneficiaries, offerings and needed analytics talent, all of which are ambiguous and uncertain. The aim of this paper is to examine how such beneficiaries can be identified and understood for the nascent Swedish space data lab. The paper reports on the development of persona descriptions that aim to support and represent the needs of key beneficiaries of earth observation data. Our main results include three thorough persona descriptions that represent the lab’s respective beneficiaries and their distinct characteristics. We discuss the implications of the personas on addressing the infrastructural challenges, as well as the lab’s design. We conclude that personas provide emerging data labs with relatively stable beneficiary archetypes that supports the further development of the other infrastructure components. More research is needed to better understand how these persona descriptions may evolve, as well as how they may influence the continuous development process of the space data lab.

Place, publisher, year, edition, pages
Springer, 2021
Series
Communications in Computer and Information Science, ISSN 1865-0929, E-ISSN 1865-0937 ; 1486
Keywords
Beneficiary, Data appropriation, Data infrastructure, Data lab, Persona, Data Analytics, Earth observation data, Mobilisation, Space data, Swedishs, Laboratories
National Category
Computer Sciences
Research subject
Information systems; Machine Learning
Identifiers
urn:nbn:se:ltu:diva-88259 (URN)10.1007/978-3-030-88304-1_3 (DOI)000869711400003 ()2-s2.0-85118139059 (Scopus ID)
Conference
27th International Conference on Information and Software Technologies (ICIST 2021), Kaunas, Lithuania, October 14-16, 2021
Note

ISBN för värdpublikation: 978-3-030-88303-4, 978-3-030-88304-1

Available from: 2021-12-09 Created: 2021-12-09 Last updated: 2025-10-21Bibliographically approved
Brännvall, R., Öhman, J., Kovács, G. & Liwicki, M. (2020). Cross-Encoded Meta Embedding towards Transfer Learning. In: ESANN 2020 - Proceedings: . Paper presented at 28th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning,2-4 October, 2020, Bruges, Belgium (Online) (pp. 631-636). ESANN
Open this publication in new window or tab >>Cross-Encoded Meta Embedding towards Transfer Learning
2020 (English)In: ESANN 2020 - Proceedings, ESANN , 2020, p. 631-636Conference paper, Published paper (Refereed)
Abstract [en]

In this paper we generate word meta-embeddings from already existing embeddings using cross-encoding. Previous approaches can only work with words that exist in each source embedding, while the architecture presented here drops this requirement. We demonstrate the method using two pre-trained embeddings, namely GloVE and FastText. Furthermore, we propose additional improvements to the training process of the meta-embedding. Results on six standard tests for word similarity show that the meta-embedding trained outperforms the original embeddings. Moreover, this performance can be further increased with the proposed improvements, resulting in a competitive performance with those reported earlier.

Place, publisher, year, edition, pages
ESANN, 2020
National Category
Computer Sciences Other Electrical Engineering, Electronic Engineering, Information Engineering Applied Mechanics
Research subject
Machine Learning; Electronic systems; Experimental Mechanics
Identifiers
urn:nbn:se:ltu:diva-80637 (URN)2-s2.0-85099006558 (Scopus ID)
Conference
28th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning,2-4 October, 2020, Bruges, Belgium (Online)
Note

ISBN för värdpublikation: 978-2-87587-074-2

Available from: 2020-08-31 Created: 2020-08-31 Last updated: 2025-10-22Bibliographically approved
Brännvall, R., Mattson, L., Lundmark, E. & Vesterlund, M. (2020). Data Center Excess Heat Recovery: A Case Study of Apple Drying. In: Ryohei Yokoyama, Yoshiharu Amano (Ed.), ECOS 2020: Proceedings of the 33rd International Conference on Efficiency, Cost, Optimization, Simulation and Enviromental Impact of Energy Systems. Paper presented at 33rd International Conference on Efficiency, Cost, Optimization, Simulation and Enviromental Impact of Energy Systems, 29 June - 3 July, 2020, Osaka, Japan (pp. 2165-2174). ECOS 2020 Local Organizing Committee
Open this publication in new window or tab >>Data Center Excess Heat Recovery: A Case Study of Apple Drying
2020 (English)In: ECOS 2020: Proceedings of the 33rd International Conference on Efficiency, Cost, Optimization, Simulation and Enviromental Impact of Energy Systems / [ed] Ryohei Yokoyama, Yoshiharu Amano, ECOS 2020 Local Organizing Committee , 2020, p. 2165-2174Conference paper, Published paper (Refereed)
Abstract [en]

Finding synergies between heat producing and heat consuming actors in an economy provides opportunity for more efficient energy utilization and reduction of overall power consumption. We propose to use low-grade heat recovered from data centers directly in food processing industries, for example for the drying of fruit and berries. This study analyses how the heat output of industrial IT-load on servers can dry apples in a small-scale experimental set up.To keep the temperatures of the server exhaust airflow near a desired set-point we use a model predictive controller (MPC) re-purposed to the drying experiment set-up from a previous work that used machine learning models for cluster thermal management. Thus, conditions with for example 37 C for 8 hours drying can be obtained with results very similar to conventional drying of apples.The proposed solution increases the value output of the electricity used in a data center by capturing and using the excess heat that would otherwise be exhausted. The results from our experiments show that drying foods with excess heat from data center is possible with potential of strengthening the food processing industry and contribute to food self-sufficiency in northern Sweden.

Place, publisher, year, edition, pages
ECOS 2020 Local Organizing Committee, 2020
Keywords
Data center, Waste heat recovery, Industrial symbiosis, Drying process, Self-sufficiency
National Category
Energy Systems
Research subject
Electronic systems
Identifiers
urn:nbn:se:ltu:diva-78336 (URN)2-s2.0-85095775160 (Scopus ID)
Conference
33rd International Conference on Efficiency, Cost, Optimization, Simulation and Enviromental Impact of Energy Systems, 29 June - 3 July, 2020, Osaka, Japan
Available from: 2020-06-23 Created: 2020-06-23 Last updated: 2025-10-22Bibliographically approved
Brännvall, R., Siltala, M., Gustafsson, J., Sarkinen, J., Vesterlund, M. & Summers, J. (2020). EDGE: Microgrid Data Center with Mixed Energy Storage. In: e-Energy '20: Proceedings of the Eleventh ACM International Conference on Future Energy Systems. Paper presented at 11th ACM International Conference on Future Energy Systems (ACM e-Energy 2020), 22-26 June, 2020, Virtual Event, Australia (pp. 466-473). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>EDGE: Microgrid Data Center with Mixed Energy Storage
Show others...
2020 (English)In: e-Energy '20: Proceedings of the Eleventh ACM International Conference on Future Energy Systems, Association for Computing Machinery (ACM), 2020, p. 466-473Conference paper, Published paper (Refereed)
Abstract [en]

Low latency requirements are expected to increase with 5G telecommunications driving data and compute to EDGE data centers located in cities near to end users.

This article presents a testbed for such data centers that has been built at RISE ICE Datacenter in northern Sweden in order to perform full stack experiments on load balancing, cooling, micro-grid interactions and the use of renewable energy sources. This system is described with details on both hardware components and software implementations used for data collection and control. A use case for off-grid operation is presented to demonstrate how the test lab can be used for experiments on edge data center design, control and autonomous operation.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2020
National Category
Computer Sciences Other Electrical Engineering, Electronic Engineering, Information Engineering
Research subject
Electronic Systems
Identifiers
urn:nbn:se:ltu:diva-79951 (URN)10.1145/3396851.3402656 (DOI)001555676100060 ()2-s2.0-85088503483 (Scopus ID)
Conference
11th ACM International Conference on Future Energy Systems (ACM e-Energy 2020), 22-26 June, 2020, Virtual Event, Australia
Note

ISBN för värdpublikation: 978-1-4503-8009-6

Available from: 2020-06-23 Created: 2020-06-23 Last updated: 2025-10-22Bibliographically approved
Sarkinen, J., Brännvall, R., Gustafsson, J. & Summers, J. (2020). Experimental Analysis of Server Fan Control Strategies for Improved Data Center Air-based Thermal Management. In: Proceedings of the Nineteenth InterSociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems: ITherm 2020. Paper presented at 19th InterSociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems (ITherm 2020), 21-23 July, 2020, Virtual conference (pp. 341-349). IEEE
Open this publication in new window or tab >>Experimental Analysis of Server Fan Control Strategies for Improved Data Center Air-based Thermal Management
2020 (English)In: Proceedings of the Nineteenth InterSociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems: ITherm 2020, IEEE, 2020, p. 341-349Conference paper, Published paper (Other academic)
Abstract [en]

This paper analyzes the prospects of a holistic air-cooling strategy that enables synchronisation of data center facility fans and server fans to minimize data center energy use. Each server is equipped with a custom circuit board which controls the fans using a proportional, integral and derivative (PID) controller running on the servers operating system to maintain constant operating temperatures, irrespective of environmental conditions or workload. Experiments are carried out in a server wind tunnel which is controlled to mimic data center environmental conditions. The wind tunnel fan, humidifier and heater are controlled via separate PID controllers to maintain a prescribed pressure drop across the server with air entering at a defined temperature and humidity. The experiments demonstrate server operating temperatures which optimally trade off power losses versus server fan power, while examining the effect on the temperature difference, ∆T. Furthermore the results are theoretically applied to a direct fresh air cooled data center to obtain holistic sweet spots for the servers, revealing that the minimum energy use is already attained by factory control. Power consumption and Power Usage Effectiveness (PUE) are also compared, confirming that decreasing the PUE can increase the overall data center power consumption. Lastly the effect of decreased server inlet temperatures is examined showing that lower inlet temperatures can reduce both energy consumption and PUE.

Place, publisher, year, edition, pages
IEEE, 2020
Series
IEEE Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems (ITherm), ISSN 1087-9870, E-ISSN 2577-0799
Keywords
server thermal management, holistic data center cooling control, energy efficiency, current leakage, data center heat reuse, power usage effectiveness (PUE), server fan control
National Category
Control Engineering
Research subject
Control Engineering
Identifiers
urn:nbn:se:ltu:diva-78335 (URN)10.1109/ITherm45881.2020.9190337 (DOI)000701365300047 ()2-s2.0-85091784050 (Scopus ID)
Conference
19th InterSociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems (ITherm 2020), 21-23 July, 2020, Virtual conference
Note

Konferensbidraget har tidigare förekommit som manuskript i avhandling.

Available from: 2020-06-23 Created: 2020-06-23 Last updated: 2026-02-11Bibliographically approved
Brännvall, R. (2020). Machine learning based control of small-scale autonomous data centers. (Licentiate dissertation). Luleå: Luleå University of Technology
Open this publication in new window or tab >>Machine learning based control of small-scale autonomous data centers
2020 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

The low-latency requirements of 5G are expected to increase the demand for distributeddata storage and computing capabilities in the form of small-scale data centers (DC)located at the edge, near the interface between mobile and wired networks. These edgeDC will likely be of modular and standardized designs, although configurations, localresource constraints, environments and load profiles will vary and thereby increase theDC infrastructure diversity. Autonomy and energy efficiency are key objectives for thedesign, configuration and control of such data centers. Edge DCs are (by definition)decentralized and should continue operating without human intervention in the presenceof disturbances, such as intermittent power failures, failing components and overheating.Automatic control is also required for efficient use of renewable energy, batteries and theavailable communication, computing and data storage capacity.

These objectives demand data-driven models of the internal thermal and electricprocesses of an autonomous edge DC, since the resources required to manually defineand optimize the models for each DC would be prohibitive. In this thesis machinelearning methods that are implemented in a modular design are evaluated for thermalcontrol of such modular DCs. Experiments with small server clusters are presented, whichwere performed in order to investigate what parameters that are important in the designof advanced control strategies for autonomous edge DC. Furthermore, recent transferlearning results are discussed to understand how to develop data driven models thatcan be deployed to modular DC in varying configurations and environmental contextswithout training from scratch.

The first study demonstrates how a data driven thermal model for a small clusterof servers can be calibrated to sensor data and used for constructing a model predictivecontroller for the server cooling fan. The experimental investigations of cooling fancontrol continues in the next study which explores operational sweet-spots and energyefficient holistic control strategies. The machine learning based controller from the firststudy is then re-purposed to maintain environmental conditions in an exhaust chamberfavourable for drying apples, as part of a practical study how excess heat produced bycomputation can be used in the food processing industry. A fourth study describes theRISE EDGE lab - a test bed for small data centers - built with the intention to exploreand evaluate related technologies for micro-grids with renewable energy and batteries,5G connectivity and coolant storage. Finally the last work presented develops the modelfrom the first study towards an application for thermal based load balancing.

Place, publisher, year, edition, pages
Luleå: Luleå University of Technology, 2020
Series
Licentiate thesis / Luleå University of Technology, ISSN 1402-1757
National Category
Computer Systems Control Engineering
Research subject
Control Engineering
Identifiers
urn:nbn:se:ltu:diva-78337 (URN)978-91-7790-623-0 (ISBN)978-91-7790-624-7 (ISBN)
Presentation
2020-09-03, A109, Luleå tekniska universitet, Luleå, 10:00 (English)
Opponent
Supervisors
Available from: 2020-06-29 Created: 2020-06-23 Last updated: 2025-10-22Bibliographically approved
Siltala, M., Brännvall, R., Gustafsson, J. & Zhou, Q. (2020). Physical and Data-Driven Models for Edge Data Center Cooling System. In: 2020 Swedish Workshop on Data Science (SweDS): . Paper presented at 8th Swedish Workshop on Data Science (SweDS20), Luleå, Sweden, October 29-30, 2020. IEEE
Open this publication in new window or tab >>Physical and Data-Driven Models for Edge Data Center Cooling System
2020 (English)In: 2020 Swedish Workshop on Data Science (SweDS), IEEE, 2020Conference paper, Published paper (Refereed)
Place, publisher, year, edition, pages
IEEE, 2020
National Category
Other Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
urn:nbn:se:ltu:diva-81882 (URN)10.1109/SweDS51247.2020.9275588 (DOI)2-s2.0-85099112779 (Scopus ID)
Conference
8th Swedish Workshop on Data Science (SweDS20), Luleå, Sweden, October 29-30, 2020
Funder
Vinnova, 17002
Note

ISBN för värdpublikation: 978-1-7281-9204-8

Available from: 2020-12-07 Created: 2020-12-07 Last updated: 2025-10-22Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-4293-6408

Search in DiVA

Show all publications