Andmebaasi logo
Valdkonnad ja kollektsioonid
Kogu ADA
Eesti
English
Deutsch
  1. Esileht
  2. Sirvi autori järgi

Sirvi Autor "Mykhailenko, Yaroslava" järgi

Tulemuste filtreerimiseks trükkige paar esimest tähte
Nüüd näidatakse 1 - 1 1
  • Tulemused lehekülje kohta
  • Sorteerimisvalikud
  • Laen...
    Pisipilt
    listelement.badge.dso-type Kirje ,
    Optimization of Battery Energy Storage System in the Estonian Energy Markets using Reinforcement Learning
    (Tartu Ülikool, 2025) Mykhailenko, Yaroslava; Cabral Pinheiro, Victor Henrique, juhendaja; Matiisen, Tambet, juhendaja; Scellier, Jean-Baptiste, juhendaja; Tartu Ülikool. Loodus- ja täppisteaduste valdkond; Tartu Ülikool. Arvutiteaduse instituut
    The integration of renewable energy sources into electricity markets has increased the need for efficient energy management solutions. Battery Energy Storage Systems (BESS) help balance fluctuating supply and demand by storing excess energy and supplying it during shortages, so determining an optimal charge-discharge schedule becomes a core optimization task for market participants. Currently, Eesti Energia addresses this task with linear optimization techniques that are effective yet limited, because they cannot fully capture nonlinear battery dynamics or adapt to complex market patterns. This thesis explores the potential of model-free reinforcement learning (RL) algorithms, specifically Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization (PPO), to optimize battery trading strategies in Estonia's day-ahead electricity market. The primary objective is to assess whether RL models can potentially surpass traditional linear optimization benchmarks in terms of profitability and decision quality. Results demonstrate that in a six-hour trading horizon the optimized DDPG agent consistently outperformed PPO and closely approached the performance of the linear optimizer, capturing 85.5% of its profit. When extended to a full 24-hour horizon, it relative performance fell to 65%. Qualitative analysis of evaluation logs confirmed market-aware behavior, with the agent charging when prices were low, discharging near peaks, and preserving capacity for anticipated high-price periods. Overall, the findings suggest that, when carefully tuned, model-free RL can provide a competitive alternative to linear optimization for battery trading in volatile electricity markets, with the potential to account for nonlinear battery aging and integrate multi-market signals. This lays the groundwork for future applications in even more dynamic settings such as balancing markets.

DSpace tarkvara autoriõigus © 2002-2025 LYRASIS

  • Teavituste seaded
  • Saada tagasisidet