1. Background
1.1. Building Energy Consumption
The 2018 Global Status Report from the International Energy Agency (IEA) found that building operation accounts for 36% of global energy use
[1]. Furthermore, Yang et al. found that heating, ventilation, and air conditioning (HVAC) systems account for 40% of all building energy consumption
[2]. This leads to the knowledge that HVAC system consumption accounts for 14% of energy use across the world. Qin et al. found that in Hong Kong, 20.9% of commercial buildings operate continuously with faults, including sensor errors and actuator failures, which degrade their performance
[3]. There are many failure points in a building system due to the thousands of sensors, dampers, or other controllable devices in a building. Katipamula et al. found that operational faults in buildings are the cause for 15–30% lost energy in commercial buildings
[4].
In modern times, building management systems have been installed in many buildings. These systems provide an interface for continuous measurement and monitoring of the HVAC system components, which creates new possibilities for advanced fault detection and diagnostics.
Building management systems provide information about system components which can be analyzed using software tools. This information includes all sensor measurements in the building, such as actuator positioning or motor control. These measurements can be monitored for stability or anomalies, though interdependencies between components and self-correcting building programming can obscure the fault from detection.
Fault detection in building systems is an important energy conservation measure. HVAC system and lighting faults can increase energy consumption by up to 18%
[5] and degrade occupant comfort and productivity when thermal setpoints are not met
[6]. A sensor failure can increase energy consumption by misrepresenting the temperature of air through the system, which must be corrected by subsequent components. Failures such as these are represented in the data collected by building systems and may be detected using advanced analytics. Failures may be detected quickly and with limited additional manual analysis by using machine learning algorithms for fault detection and diagnostics in building systems, which reduces the building’s time spent in faulty operation and minimizes wasted energy and occupant productivity.
1.2. History of FDD in Building Systems
Katipamula et al.
[4] conducted a review of established methods for fault detection and diagnostics (FDD) in building systems. The authors divide diagnostic methods into three groups: quantitative model based, qualitative model based, and process history based.
As of the time of Katipamula’s publication (2005), the majority of completed research belonged to the qualitative model-based methods, which include the most popular current form of fault detection in industry: the rule-based method. The strengths of these models lie in their simplicity and interpretability. It is easy to explain these rules to engineers of all knowledge levels. Rule-based models also perform strongly in projects with large amounts of data compared to those with less data. Breuker and Braun conducted an experiment on a building with two sets of data: one with six measurements and another with ten measurements. They were able to conclude that performance of a rule-based FDD algorithm increased by a factor of two when doubling the measurements by system sensors and using higher-order models
[5].
The first iteration on advanced building analytics came in the form of model-based methods. Gertler et al. define model-based methods as relying on analytical redundancy rather than physical redundancy; the former involves the comparison between sensor values to calculated values from first-principles equations whereas the latter involves sensor values being compared to other sensor values
[6]. Bendapudi and Braun developed a chiller model from first-principles equations to be used in FDD calculations
[7]. However, these first-principles models are seldom used for FDD because of their required input parameters, which are extensive and may be missing from the building’s measurements. Lebrun and Bourdouxhe conducted a review of dynamic HVAC models, which covered over 500 references for all areas of the building system
[8]. Though these models have become more commonplace as computation speeds have improved, there have been comparatively fewer than 25% applied to FDD in commercial buildings
[9][10][11][12][13][14][15].
Rule-based systems are based on a series of if-then statements to produce rules that govern fault detection. These rules can be as simple as a single threshold limit or more complicated to include several statements chained together using and/or operators. PECI and Battelle conducted a review of commonly applied rules to validate their usefulness in real scenarios
[16]. Engineers have also applied rules derived from first-principles equations
[17][18]. House et al. developed a ruleset named “Air-Handling Unit Performance Assessment Rules” (APAR) aimed to detect faults in air handling units. The ruleset includes 28 rules to represent faults in common operation states in the air handling unit and possible causes for violations of these rules. The rules cover faults impacting occupant comfort, indoor air quality, energy, and equipment life. The authors note some drawbacks when using rule-based systems, namely that the concept of fault severity is missing when using singular rules; the output of a rule evaluation is binary and offers limited measures of how far the rule was violated. Fault severity metrics may be introduced by defining multiple rules, with a cost of added complexity to the model.
Zhao et al. also conducted a review of recent innovations in the use of artificial intelligence and machine learning methods for Automated Fault Detection and Diagnostics (AFDD) tasks
[19]. Their research divides artificial intelligence (AI) methods into two broad categories: knowledge-driven and data-driven methods. They determined that while there has been increased interest in all artificial intelligence algorithms over the past two decades, a large percentage of these projects have focused on data-driven methods. The authors found that of the articles reviewed, 79% were based on data-driven methods and 21% were based on knowledge-driven methods.
Chen et al. conducted a review of AFDD methods and found that of the major HVAC system categories (building, variable refrigerant flow, heat pump, air handling unit (AHU), variable air volume (VAV) terminal, chiller, and sensor), 33% of reviewed research covered AHUs and 25% covered chillers
[20]. Their references include knowledge and data-driven methods and provide a comprehensive picture of research in the field. Their survey is divided into three major sections: knowledge-driven, data-driven, and hybrid approaches. Similar to Katipamula and Zhao, the authors found that knowledge-driven AI approaches, still represent less than 25% of published research. Over 75% of AFDD research completed is in data-driven approaches.
The historical references above provide a summary of how FDD in building systems has evolved. This research discusses key historical developments and expands into Machine Learning (ML) technology research-related developments in the AFDD. It also includes current published applications of ML.
This research has been divided into the major parts shown in
Figure 1, all of which are classified as data-driven methods by Katipamula and other reviews mentioned previously
[4]. Within the focus of data-driven methods, there has been significant focus on supervised and unsupervised learning methods as well as accompanying methods such as Principal Components Analysis (PCA), which is often used as a preprocessing step before the former two methods
[20]. Historically, less complex methods such as clustering, rule-based methods have been used
[4]. Neural networks and Support Vector Machines (SVMs) have seen an exponential increase in attention in recent years as computer processing power has grown. With higher power, computers are able to optimize complicated networks in a fraction of the time and produce more accurate results.
Figure 1. Tree of research.
2. Modern Machine Learning for Fault Detection in Building Systems
Computation power has increased dramatically in recent years along with substantial increases in data collection, creating new possibilities for fault detection methods using machine learning. Zhao et al. conducted a review in 2019 of the trends in industry using artificial intelligence methods for FDD
[19]. The research divided the methods into two broad categories: data-driven methods and knowledge-driven methods. Subcategories of machine learning algorithms which further divide these broad categories include supervised learning and unsupervised learning.
Common faults in building systems have been found to be
[21]:
Supervised learning methods utilize the input features of a dataset and the known output classes to train a model. These methods require more processed data than other types of algorithms, since the input datasets require additional information to represent their output classes. These output classes may be provided from the sensors in the building, or they may be assigned manually.
Unsupervised learning problems differ from supervised learning in that the training datasets contain only input values and lack output values. The unsupervised algorithm determines a probable output based on a set of inputs measured from the system. An unsupervised learning problem is often solved using clustering, where a dataset becomes grouped into several clusters forming density clouds. The supervised learning counterpart to this problem is classification, where a dataset is classified based on the known characteristics of the classified input data. Each of these are basic examples of algorithms used for supervised and unsupervised learning; many other algorithms exist to solve each problem.
Tidriri et al. considered a hybrid approach that combines data-driven and model-driven methods
[22]. The authors found that performance of data-driven methods is highly dependent on the training data, while performance of model-driven methods is highly dependent on the mathematical model used in the analysis. The authors propose that a hybrid approach using data-driven methods for fault detection and model-driven methods for fault diagnostics could prove more successful than using data- or model-driven methods would individually. The authors also found that many researchers face problems bridging the two methods without an established framework, which makes the combination difficult to implement. While these are several examples of hybrid model applications
[23][24][25][26], research in this area is just beginning to increase in the number publications.
2.1. Feature Selection
Feature selection is applicable to both supervised and unsupervised learning algorithms and is used to trim the input dataset of redundant or unnecessary data, which improves its training speed and accuracy, depending on the approach. Researchers have investigated to find optimal approaches to feature selection.
Changrashekar et al. conducted a review of the impact of feature selection in an analysis
[27]. Changrashekar found that for one of their datasets, reducing from 34 features to just 9 features improved its modeled performance from 90% to 95%. Models containing excessive features may overfit the dataset and perform worse in testing. Though this suggests that pruning some features from the data is beneficial, another dataset including all features produced an accuracy of almost 80%, while reducing that dataset by 1 feature, to a total of 7, reduced its accuracy to just 71%. These examples show that pruning features from a dataset may degrade performance. The authors’ conclusions are that feature selection should be cross validated for each application.
Yan et al. evaluated the benefits of a feature selection algorithm for data preprocessing
[26]. Yan found that feature selection can help define the information which improves model performance most significantly. This may be used when deciding which sensors to purchase because each sensor has been ranked according to its importance to the model. Cost-limited analysis, which simulates projects with a limited financial budget, evaluated this tradeoff and found that using just 16 sensors in their chiller system can provide enough information for machine learning analysis with greater than 95% accuracy.
Yan et al. used the ReliefF algorithm, which calculates a feature score for each feature to determine their importance, alongside SVM analysis for feature selection with success
[28]. The authors used just 6 variables in the final model instead of the original 65 variables and produced models with greater than 90% accuracy.
2.2. Data-Driven Methods
Principal Component Analysis
Principal Component Analysis methods have been used extensively in sensor fault detection. PCA maps high-dimensional data into a lower dimension representing the dataset’s variance; the first Principal Component of the dataset is the dimension capturing the most of its variance. This process is illustrated in
Figure 2. Wang and Xiao applied PCA to AHU sensors in their studies
[29]. Li and Wen combined PCA methods and wavelet transforms to detect faults in AHUs
[30]. Du et al. combined PCA and Fisher’s Discriminant Analysis for FDD in sensors in VAV systems
[31].
Figure 2. Visual Representation of principal component analysis.
Hu et al. studied the sensitivity of fault detection in different fault severity levels in chiller sensors
[32]. However, there are limitations in using PCA methods for FDD. Zhao et al. found that principal components analysis uses linear assumptions, which are detrimental to performance because the chiller data are often nonlinear
[33].
Beghi et al. used PCA to distinguish anomalies from normal operation and reconstructed the contributions of each variable to order variables according to their improvement to the model’s accuracy
[34]. Mahadevan et al. used PCA and dynamic PCA to detect faults using established procedures and combined those results with a one-class support vector machine to perform fault diagnostics
[35]. Zhang et al. combined PCA and clustering to detect and diagnose faults in building sensors
[36].
Xiao et al. developed PCA models which monitor heat- and pressure-flow-balance in an AHU
[37]. Wang and Xiao expanded on this research by employing expert rules to assist in diagnosing faults and developing separate models for heat and pressure-flow balance
[38]. The authors research concludes with a third iteration that evaluates these models on simulated results
[39]. Many researchers have studied the abilities of PCA to detect and diagnose sensor faults throughout an HVAC system
[40][41][42].