Machine Learning for RC monitoring: Comparison
The reciprocating compressor (RC) is a key piece of equipment in petroleum and chemical industries. If the RC does not operate in the rated efficiency, it will lead to great economic loss to the company. Sometimes RCs are used to compress inflammable and explosive gases working under high pressures and temperatures, such as hydrogen, ethylene, and natural gas, which would threat human life once the machine malfunctions. Furthermore, due to the intricate structure of the compressor, a large amount of wearing parts, and the complicated interactional relationship between moving parts of the compressor, it is essential to monitor the compressor operating condition and detect failures of RCs accurately and in a timely manner. Operating condition detection and fault diagnosis are very important for reliable operation of reciprocating compressors. Machine learning is one of the most powerful tools in this field.

  • reciprocating compressor
  • condition monitoring
  • fault diagnosis
  • machine learning

1. Reciprocating Compressors

Figure 1.

 Principle of a one-stage reciprocating compressor. 1—suction valve; 2—discharge valve; 3—piston; 4—cylinder; 5—piston rod; 6—crosshead; 7—connecting rod; 8—crankshaft.

2. Four Major Machine Learning Methods

2.1. Artificial Neural Network (ANN)

2.1. Artificial Neural Network (ANN)

where ai-1 is the output vector of the (i-1)th hidden layer (note that when i=1, ai-1  is the input vector of the input layer of the whole network).
Figure 2. Graphical model of an ANN.

Figure 3. A typical Bayesian network.

Typically, in an ANN model, the transfer functions are selected by the designer, and the weights and biases are adjustable parameters which can be adjusted by the learning means such as error back propagation algorithm. Therefore, the input and output relationship of the network can meet a specific goal [12,13]. Thus, the ANN model can be used to deduce a function from the observations, which is helpful in solving complex problems. Hence, it can be broadly applied in fault diagnosis, which is an essential classification problem.

The Bayesian network (also called belief network) [14] is a directed acyclic graph (as shown in Figure 3) where the nodes, such as , are perceived to be the propositional variables. The arrow between two nodes means that the two nodes are related directly, and the weight therein is quantified by a conditional probability. The two essential natures of these networks are consistency and completeness, while the chain-rule representation of the joint distributions is employed to guarantee the two natures for its form [15,16]:
Figure 3. A typical Bayesian network.
It can be seen that in the right-chained formula, each variable appears once on the left side of the conditioning bar, which can facilitate the dependence quantification of the network. For instance, the chain rule representation for the network shown in Figure 3 is:
The Bayesian network is a methodology integrating the probability theory and graph theory. Not only can it visually exhibit the structure of real tasks by graph, but it can also exploit the structure based on the principle of the probability theory, which would diminish the complexity of reasoning. Therefore, the Bayesian network is applied in many various domains. The Bayesian network also provides a framework for new models, and therein a naive Bayes model is normally selected for classification and prediction of multi-dimensional discrete time series [17,18].

Figure 4). It can separate n-dimensional inputs into two parts associated with the real distinct classes. The hyperplane can be depicted as [19]:

where  is the normal vector of the hyperplane and  is the bias. To ensure the generalization ability of the SVM, the simplest maximal margin bound was adopted, which implies:

where () is the th sample of the training set, and . Formula (5) is actually a convex quadratic programming problem and hence has no local minima [20,21]. By converting the problem with the Kuhn–Tucker condition into the equivalent Lagrangian dual quadratic optimization problem, the parameters of the SVM, namely  and , can be obtained [19,22]. Moreover, except for the maximal margin bound, there are other available generalization bounds, such as margin percentile bounds, soft margin bounds, and so on.

Figure 4.

 A maximal margin hyperplane with its support vectors circled.

SVM is an initial tool designed for the binary classification. The strategies have to be established to accomplish multiclass classification. Three major SVMs based on distinctive structures are called the one-against-one SVM, one-against-all SVM, and directed acyclic graph (DAG) SVM [23,24].

2.4. Deep Learning (DL)

The deep learning model is composed of multiple processing modules and each module transforms the representation from the last layer to a higher and more abstract level in the current layer. With enough suitable modules combined, the extremely intricate relationships can be learned. The internal parameters of the deep learning machine are obtained by utilizing a backpropagation algorithm based on a large set of data. The convolution neural network (CNN), deep belief network (DBN), and auto-encoder are the three main deep learning methods. The CNN is designed to process data with the form of multiple arrays, such as time series and image data [26]. The DBN is an undirected bipartite graphical model stacked by several restricted Boltzmann machines. A Boltzmann machine (BM) is an energy-based model, and its modeling capacity can be improved by increasing the number of hidden variables [27].

An auto-encoder is a purely unsupervised representation learning algorithm. An auto-encoder consists of an encoder and a decoder. The encoder can transform the input into different representations, and the decoder can convert the new representation into the primary form. The auto-encoder can be used to reduce the dimensionality of the dataset, and for learning more abstract features [27,28].

3. Applications of Machine Learning in Fault Diagnosis of the Reciprocating Compressor

Since the performance of most machine learning methods mainly depends on the feature extractor used before the classification, the selection of the feature extractor depends on the characteristics of the raw signals. Therefore, the subsequent section is divided into four parts according to the nature of signals, and the different machine learning methods were specified by different paragraphs in each part.

3.1. Fault Diagnosis Based on Process Parameters

The parametric method is a diagnostic method for the reciprocating compressors based on process parameters including the compressor pressure, temperature, flow rate, etc.

3.1.1. Fault Diagnosis Based on p–V Diagram

The compressor pressure can be monitored via p–V diagram, which is one of the most typical process parameters. The p–V diagram is a two-dimensional cycle diagram which shows the variation trend of dynamic pressure in the compressor chamber with the working volume in a working cycle. The fault of the compressor valves, piston rings, support rings, and other components such as shaft, lubrication oil, and bearings can lead to the change of the pressure in the cylinder, and then the shape of the p–V diagram. Hence, the p–V diagram (cylinder pressure) is a very useful parameter for fault diagnosis in reciprocating compressors.

Guerra [38] extracted data from the dynamic pressure signal processed with a binned fast Fourier transform (FFT) and PCA for the detection of valve faults through Bayesian classification at 50% and 100% load.

The applications of three main traditional ML methods and deep learning in RC fault diagnosis based on p–V diagram were reviewed in this section; it is obvious that SVM and ANN are widely used in this field.

3.1.2. Fault Diagnosis Based on Pressures Measured in Other Volumes

Except for p–V diagrams, pressures measured in other volumes can also be used to recognize faults.

The applications of ML methods in RC fault diagnosis based on pressures measured in other volumes (except for cylinder) were reviewed in this section; it is suggested that research about RC fault detection based on pressures measured in volumes is few, and ANN and Bayes classifier were employed.

3.2. Fault Diagnosis Based on Vibration Signals

Vibration analysis is a typical monitoring method of RCs. Many faults in RCs lead to abnormal vibration which could be diagnosed from the vibration signals comprising lots of machinery information.

The applications of ML methods in RC fault diagnosis based on vibration signals were reviewed in this section. There are considerable studies focusing on the fault detection techniques based on vibration signals, and similar to p–V diagram, lots of different SVM models and ANN models were employed as classifiers in these cases, whereas Bayes classifier and deep learning were barely used.

3.3. Fault Diagnosis Based on Acoustic Emission (AE)

The applications of ML methods in RC fault diagnosis based on AE signals were reviewed in this section. The amount of studies in this field is less than for p–V diagram and vibration signal; also, the applications of classifiers mainly focused on ANN and SVM.

3.4. Fault Diagnosis Based on Multi-Source Signals

The faults in RCs are intricate, and it is difficult to recognize all of them by a single signal or parameter. Therefore, it is important to conduct studies on fault detection based on multi-source signals.

