EAt electrical enerpower distribution networks, energy is consumed at the same time as it is generated, since its storage is unfeasible on large scale. Therefore, short-term lload forecasting is required by Transmission System Operators (TSO) to manage energy grid operations and supplies. Load forecasting is needed to manage energy operationss are required very often everyday, then a lot of prediction calculations are executed during every forecasting interval. In Europe, owing to directives and new technologies, prediction systems will be on achange from hour to quarter-hour basiintervals. Therefore, a predictive system may not dispose of have sufficient time to compute all future forecasts. Prediction systems perform calculations throughout the day, calculating the same forecasts repeatedly as the predictedThis article shows a tested solution to the computation time approaches. However, there are forecasts that are no more problem, while increasing accurate than others that have already been made. If previous focy of the forecasts are used preferentially over these, then computational burden will be saved while accuracy increases. In this way, it will be possible to optimize the sing system. The solution is based on making a schedule of future quarter-hour systems and fulfill the execution time limits. A proposed algorithm estimates, which establishes which forecasts provide greater accuracy than previous ones, and then it makes a forecasting schedulecalculations must be executed at each moment of the day.
Short-term load forecasting (STLF) is requsired to manage the production and, distribution of electricity, and and economic operateions of electricity markets energy from the current hour to the following days. If the electricity oEvery Transmission System Operator overestimates future electricity load, there(TSO) needs accurate energy forecasts, otherwise it will besuffer extra production costs that will lead to economic losses. On the other hand, if the electricity demand is underestimated, power plants may not have sufficient reserves for their generators to meet the energy demanded by the grid, compromising its stability and risking the possibility of a blackoutwith corresponding economic losses. In addition, an accurate demand forecast indirectly facilitates the management of s allow for managing electrical energy from renewable energies. In addition to electricity operators, otOther entities benefit from accurate electricity load forecastrequire load forecasts to manage future operations, such as power marketers, independent system operators, or load aggregators.
Forecasting electricity load is a complex problem, which has been approached through various methods and from different points of view over the last few decades, which has led to a great variety of forecasting algorithms implemented in different electricity networks.
Many techniques are based on neural networks [1] [2] [3] [4] [5]. Other algorithms use statistical methods [6] [7]. Hybrid systems that combine neural networks with other techniques are also common [8] [9] [10] [11].
TSTLF systems calculate forecastsOs require fast and frequently most likely hourly. The system must obtain fast f forecasts, so the operator can to read the results and manage the actions that adjust to future load. The latest measurement systems tend to use quarter-hour intervals. The STLF systems that currently work wWhen operators with hourly intervals will considerably increase their computational burden, whenever they chan change to quarter-hour intervals. T, they will have to forecast four times more values due to increased granularity and of intervals per day, they will also do this four times more often, due to an increase in frequency because of the reduced available time. This paperarticle addresses the problem of computational burden while attempting to increase the increasing accuracy of the STLF systems already implemented.
The Spanish transmission system operator (TSO) is TSO, Red Eléctrica de España (REE) and it , is working on hourly intervals. It needs forecasts for 19 electrical regions, which is 19 times more calculations than a single STLF, with a total of 2.5 s for every hour that is predicted nationwide system. Due to time limits for submitting predictions, it is not always feasible to calculate all future hours. Therefore, there is a schedule that determines which future intervals are predicted during each hour of the day. However, this schedule was not made with a reasoned basis.
The REE forecasting system cannot keep the previous forecasting schedule with quarter-hour intervals, since it is too computationally heavy to work within the new time restriction. Therefore, there is need for it needs a new schedule to forecast only the most useful intervals. Furthermore, the new schedule must be based on a criterion that numerically determines which predic A numerical criterion is required to define the usefulness of forecasts, in order to design a method which decides the better calculations are most useful. Satisfying the need for a to execute. As a result of that need, the main motivation of this work is to make a systematic method to optimize schedules is the main motivation of this work.
IPreviously, it was agenerally assumed that as we approach to the forecast moment in time, the forecast becomes more accurate. However, this hypothesis does not always hold true. There is an obvious trend in which accuracy increases as more recent inform, since the information available (weather and load) has more correlation (weather anwith the forecasted load) becomes available. Nevertheless, sometimes, there seem to be some periods in which new forecasts are actually less accurate. If . However, this hypothesis does not always hold true. Sometimes, predictions calculated in the past are more accurate than recent ones. If the accuracy loss periods can be known in advance, then the unproductive forecasts made at these times can be sparcanceled, saving computational effort while achieving a more accurate forecastand gaining accuracy. This work aims to determine the optima
Al schedule of f forecasts so that the system only computes new forecasts when an accuracy improvement is expected.
The foreca calculations must needs to be computed within a time limit; therefore, each computer has a limit of N hourly values, beyond which the forecast would arrive lanumber N of maximum predictions to compute. This limit depends on thavailable time allowed and the calculnd computation speed, which again depends on the computer itself and on the forecasting algorithm used. In order to select the best N forecasts that can be calculated at each moment, a method to prioritize them needs to be developed. In addition, even if all predictions can be calculated, they may be counterproductive, since some of them have a greater predictive error than some of the previous oneslarger error than previous ones. The paper related to this article describes an algorithm that makes the optimal schedule of forecasts, so that the system only computes new forecasts when an accuracy improvement is expected.
The STLF field is extensive, since innumerable works have been published for decades; consequently, reviews of the state of the art have been published, such as those made by Mamum et al. [12], Hippert et al. [13], or Hong et al. [14].
The previous work on the STLF system used in this paper [15] compared the autoregressive and neural models used. The research defined which one performs better in different contexts, being determined by the model configuration, availability of data and the use of exogenous variables. On the other hand, the proposed rer research took into account the performance of the sameers model[15] for different time lapses.
Other researchers [16] [17] [18] [19] [20] [21] [22] built and compared different STLF mathematical models employing error measures as performance indicators. After that, they did not consider how to apply those models in an optimized schedule, to avoid producing larger errors than past predictions that had already been calculated. J. Mohammed et al. [2322] did something similar, which also included reliability indicators to assess the model’s performance.
Another example is the work carried out by G. Veljanovski et al. [2423], in which they proposed a forecasting system based on a neural network. They did not consider the best time at which to obtain data and execute the computation.
Weyermüller et al. [2524] built a minimalistic adaptive neuro-fuzzy inference model. It forecasts the load of one hour 24 h before, so this research could be applied to organize the calculation schedule if more forecast hours are added to the model.
The present work could complement automatic forecasting systems, since it offers an automated extra step at the end of the modeling process, in order to obtain an optimized execution schedule. An example of automatically modeled systems is the work conducted by L. Shufen et al. [2625], in which they proposed an algorithm to automate time series forecasting for nonexperts.
The analysis proposed in this work could be applied to future works of theoretical research. For example, the research by T. Panapongpakorn et al. [2726] or the work by D. Shuai [2827].
Jiang et al. [2928] examined their model for different anticipation times; they also compared different STLF models, taking into account error and computation times. However, anticipation times varied just from 5 min to 16 h ahead and they were used to assess models, in the same way that calculation times were employed to compare entire models.
There is research which focuses on reducing computational burden, such as that by A. McIlvenna et al. [3029]. This research aims to optimize the use of a previously built forecasting system regardless of which one it is.
With a different approach, M. Weimar et al. [3130] evaluated the improvement of a STLF system according to the economic savings with an econometric model. This is an example of how improving accuracy offers benefits that overcome developing costs.
The STLF system used in this research, duringas testing benchmark, is thate developed by the UMHniversity Miguel Hernández (UMH) [8], which was implemented in REE. The system has been operating for more than 4 years, and during this time REE and UMH have continued to collaborate in continuous improvement efforts [1531] [32].
The time a computer needs to calculate a set of predictions depends on three factors: the time that it takes to load new input data (I), the number of forecasts (n), and the time that it takes to make each prediction (P). So, the run time (t) can be modeled with the linear Eequation (1).
(1) |
Variables I and P depend on the computer and code employed. At the beginning of each hour, the forecasting system loads new input data (temperatures and previous measured load).
As mentioned before, due to the limitation of accuracy or computational burden, for each execution, there is a maximum number, N, of predictions that can be performed without exceeding the response time limit. Therefore, for each execution period, up to N predictions with greater value can be selected.
The result obtained by applying the proposed algorithm is a schedule, which defines the pforedictions to be ecasts to executed at each hour of the day. The algorithm does not depend on the mathematical model used, but on the errors that it makes regarding historical records. Therefore, the major benefit of applying the algorithm is the error reduction sparing us from calculations with worse errors than previous ones already executed. In this way, it can be applied to any system that is organized by time intervals and it is possible to choose which intervals to predict in each hour of the days.
The Figure 1 shows how to use and place the algorithm in a forecasting system. First, the maximum number of predictions that can be calculated per interval is obtained, which depends on available time and computing power, as discussed previously at equation (1). At the same time, the previous year can be predicted to calculate historical error records. Finally, the algorithm is applied to obtain a schedule that will serve to forecast load during the next year.
Figure 1. Process summary.
Other scheduling alternatives have been tested to compare the performance of the proposed algorithm. They are explsummainrized in Table 1.
Name | Explanation |
Proposed algorithm |
Employ the proposed algorithm to obtain a sechedule and then use it to forecast during the entire year. |
Complete schedule | Predict every future interval up to 9 days in advance, at every hour of the day (computationally unfeasible). |
Current planning | The current schedule from REE. |
Optimized algorithm | Employ the proposed algorithm, the with optimal number of calculations to minimize error. |
Random selection | Forecast a number, n, of random future intervals at each hour of the day. |
Last-day selection | This algorithm, at each moment, predicts the current day. It also forecasts the future day that has gone the longest time without updating, prioritizing those days which haver never been forecasted. |
Table 1. Scheduling options.
To measure the value of a prediction, a numerical indicator called accuracy improvement expectation (AIE) is used. As the name suggests, it reflpresecnts the expectation of improvement in accuracy of a predicted demand if it is recalculated. To calculate this parameter, historical records of predictions calculated under the same conditions are used; that is, forecasts calculated at the same time of day with the same advance period.
The implemented algorithm prioritizes the hourly forecasts according to larger AIE over the results of a full year, so only the firbest N valueforecasts will be predicexecuted in order to adhere to the time allowed.
Before executing the algorithm, it is necessary to determine the number of maximum forecasts, N, that the computer employed can execute. This is determined empirically by its computational speed and the calculation time limit, as explained at equation (1).
InThe paper related order to test tto this article explains the tests to validate the proposed algorithm,. The main test forecasts the year 2019 has been predicted. Twith every scheduling option, the accuracy average has been calculated for all the advances of all the hours of the year. Random selection performs worse for almost all cases; it is also inconsistent, so it is ruled out as a candidate. On the contrary, the proposed algorithm offers significantly better precision than the rest of the methods.
The year 2019 has been predicted with every method. In most advances, the optimized algorithm performs better than other methods. As a fiInal result, the year 2019 has been predicted with the current REE schedule, the algorithm, and by calculating all future hours. Except for the ninth day in advance addition, the algorithm always offers better accuracy. The Optimized Algorithm has a global improvement compared to the current schedule of the spanish TSO.
Last-day selection and optimized algorithm manage to compute predictions under 7 min and the first one requires less time. However, the optimized algorithm offers better accuracy in most cases.
The Spanish electricity system operator requires future load of 19 electrical regions. Nowadays, the entire forecasting horizon spans up to 240 h, thus the total of future loads that can be predicted extends to 4560, which require 10.16 min. However, if the quarter-hour system is employed, the number of future intervals to forecast will multiply by four. This new system wioulld entail 18,240 numbers to be calculated in 40.62 min, which is unfeasible since REE requires results before 7 min have passed.
According to Equation (1) and quarter-hour intervals, the maximum number of forecasts to compute in 7 min is 3140. So, there is time to forecast 165 intervals in every electrical region. Therefore 165 is the maximum value that can be used on the algorithm as number N of forecasts.
Thise developed research has developedoffers an algorithm that organizes the calculation schedule of a STLF system throughout thfor the entire day. The schedule obtained is adapted to the computational capacity of the computer while actually increasing the system accuracy. The methodology can be applied to any forecasting technique, even if computational burden is not an issue because it has been proven that limiting the number of forecasts can be beneficial for accuracy, as it has been demonstrated for the case of REE.
On the other hand, according to results, the main contribution of the work is to reduce the computational load of a predictive system without sacrificing accuracy. This will allow a transition to the quarter-hour system with an optimal execution schedule.
Thie s paper tudy offers a first approach to improvinge forecasting systems through calculation planning. Applying a similar study to other time series prediction systems could improve them in a similar way. As future work, it is proposed to use the algorithm to plan the new quarter-hour system of the Spanish TSO.