EUTHERMO

Scalable deep reinforcement learning algorithms for building climate control and energy management

Started
November 1, 2021
Status
Completed
Share this project

Abstract

Controlling indoor comfort and optimizing energy flows in modern buildings has become an increasingly complex, integrating photovoltaics, batteries or electric vehicle chargers. Yet, effective building energy management remains a critical challenge, as buildings contribute to 32% of global primary energy use and a significant share of greenhouse gas emissions.

While traditional rule-based controllers are limited by their manual tuning and performance, advanced methods like Model Predictive Control (MPC) require costly and complex modeling, limiting their widespread adoption. Reinforcement learning (RL), particularly deep RL, has emerged as a promising alternative for building control due to its ability to handle system complexity and adapt to dynamic environments. However, RL applications remain largely confined to simulations, with limited real-world deployments.

To bridge this gap, Euthermo project focuses on developing scalable and transferable deep RL algorithms for building energy management. These methods are validated on the NEST demonstration building at the Empa campus. Additionally, the integration of linked data to enhance transfer learning and scalability is explored.

People

Collaborators

SDSC Team:
Carl Remlinger
Benjamín Béjar Haro

PI | Partners:

Empa, Urban Energy Systems Laboratory:

  • Dr. Bratislav Svetozarevic
  • Dr. James Allan
  • Philipp Heer

More info

description

Motivation

This project addresses the inadequacy of existing control strategies, both conventional (i.e. industrial) ones and advanced ones based on physical models, to provide optimal control performance of modern buildings with integrated energy generation, storage, and EV charging capacities. The project aspires to deliver practical solutions with high technology readiness while contributing to the broader fields of machine learning and automatic control.

  • Objective 1: performance of the learning control algorithms.
  • Objective 2: scalability and transferability of learning control algorithms.

Proposed Approach / Solution

The proposed solution involves a modular reinforcement learning (RL) agent designed to handle the complexity of modern building energy systems and ensure easy transferability across buildings. To train the agent effectively, a physically consistent neural network is employed to model the building environment and dynamics. This environment model serves as a training simulator, allowing the RL agent to safely explore and learn optimal control strategies.  Ultimately, the approach is validated under real-world conditions on the NEST demonstration building at Empa (see Figure 1). An illustrative example over a two-day episode showing room temperature, with heating control signals and weather conditions is provided Figure 2.

Impact

The outcomes of this project can accelerate the adoption of technologies that enhance occupant comfort while reducing energy consumption in buildings. They also facilitate the integration and management of renewable energy sources and electric vehicles within building systems.

Figure 1: UMAR unit, a residential unit of the Empa building NEST in Dübendorf, Switzerland. (a) NEST building. (b) Plan of UMAR unit. (c) Internal view of UMAR unit.
Figure 2: Temperature of room 274 over a 48-hour period, with heating panel valve openings as control signals and weather conditions as external inputs. The black dashed line indicates the comfort threshold.

Gallery

Annexe

Cover image source: Empa.ch

Additional resources

Bibliography

  1. Svetozarevic, B., Baumann, C., Muntwiler, S., Di Natale, L., Zeilinger, M. N., & Heer, P. (2022). Data-driven control of room temperature and bidirectional EV charging using deep reinforcement learning: Simulations and experiments. Applied Energy, 307, 118127.
  2. Di Natale, L., Svetozarevic, B., Heer, P., & Jones, C. N. (2021, November). Deep Reinforcement Learning for room temperature control: a black-box pipeline from data to policies. In Journal of Physics: Conference Series (Vol. 2042, No. 1, p. 012004). IOP Publishing.
  3. Di Natale, L., Svetozarevic, B., Heer, P., & Jones, C. N. (2022). Physically consistent neural networks for building thermal modeling: theory and analysis. Applied Energy, 325, 119806.
  4. Di Natale, L., Svetozarevic, B., Heer, P., & Jones, C. N. (2022, June). Near-optimal deep reinforcement learning policies from data for zone temperature control. In 2022 IEEE 17th International Conference on Control & Automation (ICCA) (pp. 698-703). IEEE.

Publications

Related Pages

More projects

MAGNIFY

In Progress
Machine learning Assisted larGe scale quaNtIfication of building energy FlexibilitY
Energy, Climate & Environment

SPI-GreenFjord

In Progress
Energy, Climate & Environment

SPI-PAMIR

In Progress
Energy, Climate & Environment

TREMA

Completed
Transforming real estate management with AI
Engineering

News

Latest news

First National Calls: 50 selected projects to start in 2025
March 12, 2025

First National Calls: 50 selected projects to start in 2025

First National Calls: 50 selected projects to start in 2025

50 proposals were selected through the review processes of the SDSC's first National Calls.
AIXD | Generative AI toolbox for architects and engineers
January 22, 2025

AIXD | Generative AI toolbox for architects and engineers

AIXD | Generative AI toolbox for architects and engineers

Introducing AIXD (AI-eXtended Design), a toolbox for forward and inverse modeling for exhaustive design exploration.
Smartair | An active learning algorithm for real-time acquisition and regression of flow field data
May 1, 2024

Smartair | An active learning algorithm for real-time acquisition and regression of flow field data

Smartair | An active learning algorithm for real-time acquisition and regression of flow field data

We’ve developed a smart solution for wind tunnel testing that learns as it works, providing accurate results faster. It provides an accurate mean flow field and turbulence field reconstruction while shortening the sampling time.

Contact us

Let’s talk Data Science

Do you need our services or expertise?
Contact us for your next Data Science project!