基于支持向量機的旋轉(zhuǎn)機械故障診斷與預(yù)測方法研究
發(fā)布時間:2018-09-12 14:09
【摘要】:隨著現(xiàn)代科學(xué)技術(shù)的迅猛發(fā)展,旋轉(zhuǎn)機械不斷朝著大型化、復(fù)雜化、高速化、連續(xù)化和自動化的方向發(fā)展。這些發(fā)展在帶來較高生產(chǎn)效率的同時,對設(shè)備的安全運轉(zhuǎn)也提出了更高的要求,一旦發(fā)生故障,將造成巨大的經(jīng)濟損失,甚至?xí)䦟?dǎo)致災(zāi)難性的人員傷亡事故和嚴重的社會影響。對設(shè)備的故障進行診斷,以及根據(jù)歷史數(shù)據(jù)對設(shè)備運行狀態(tài)進行預(yù)測,是保證設(shè)備安全可靠運行的重要措施,只有如此才能及時有效的處理存在的問題,將故障消除在萌芽狀態(tài)。本文研究基于支持向量機理論對設(shè)備運行狀態(tài)進行故障診斷并根據(jù)歷史運行數(shù)據(jù)對設(shè)備未來狀態(tài)進行預(yù)測的方法,建立了設(shè)備運行狀態(tài)預(yù)報和故障診斷模型,并對一些關(guān)鍵問題的解決進行了深入研究。本文主要創(chuàng)新工作如下: 1、提出了基于遺傳算法的支持向量機樣本平衡方法 對于故障診斷來說,故障樣本一般少于正常樣本,所以普遍存在不平衡樣本問題。支持向量機在遇到樣本不平衡問題時,往往造成“少樣本類”的誤診。針對該問題,本文提出了基于遺傳算法的樣本平衡方法,利用遺傳算法中的交叉、變異方式生成子代樣本,對“少樣本類”進行繁殖擴充,得到更多的該類樣本進而達到兩類樣本的平衡。為使擴充的樣本更具有針對性,更有利于形成正確的“分類超平面”,給出了父代樣本的選擇方法以及子代樣本的評價方法。 2、分析了EMD方法產(chǎn)生虛假分量的原因,并提出了一種識別虛假分量的方法 作為特征提取的方法,EMD能夠較好的處理非平穩(wěn)、非線性問題,但利用EMD方法對信號進行處理時常常會引入虛假分量,影響分析的準確性,是嚴重制約EMD方法發(fā)展的瓶頸問題。為了消除虛假分量的影響,更好地發(fā)揮EMD方法在特征提取中的作用,本文提出基于K-L散度的虛假分量識別方法,該方法利用K-L散度來評價EMD分解得到的各個分量與原信號的關(guān)系程度,分量與原信號之間的K-L散度越小,關(guān)系程度越大,分量的真實性就越高,反之虛假性就越高,虛假分量通過設(shè)定閾值進行判別。同時,文中研究并給出了閡值設(shè)定方法。 3、提出了基于EMD特征提取的支持向量機算法 在根據(jù)歷史數(shù)據(jù)對運行狀態(tài)進行預(yù)測時,特征參數(shù)與預(yù)測點的關(guān)聯(lián)程度在很大程度上決定了預(yù)測值的準確性。目前,特征參數(shù)的選取方法主要有兩種:一種是基于實測數(shù)據(jù)的特征,即采集與預(yù)測值相關(guān)聯(lián)的影響因素作為特征,如對風電功率預(yù)測時選取風速、氣壓等因素作為特征,但對于振動等一些預(yù)測量來說,其影響因素往往十分復(fù)雜,不易明確,利用這種方式也就無法建立高精度預(yù)測模型;另一種是通過對時間序列的計算得到其特征參數(shù),這類方法中最具代表性且最常用的方法是基于相空間重構(gòu)的方法,方法利用混沌理論計算嵌入維數(shù)和時間延遲重構(gòu)相空間,得到時間序列的特征,然而嵌入維數(shù)以及時間延遲的確定只是從時間序列的動力學(xué)特性角度來考慮的,用其構(gòu)造的特征對于預(yù)測模型來說并不一定合適,所以往往會由于二者選取的不恰當而無法得到合適的特征,進而造成預(yù)測的精度大大降低。故本文針對預(yù)測模型特征選取的問題,提出了基于EMD特征提取的支持向量機算法(EMD-SVM),利用EMD分解后各時間點的分量值作為特征,并與該時間點對應(yīng)的時間序列值(目標值)共同構(gòu)成樣本,建立預(yù)測模型,并通過實驗證明其較高的準確性和穩(wěn)定性。 4、針對支持向量機存在大規(guī)模樣本問題提出了基于信息熵的樣本長度選擇方法 大規(guī)模訓(xùn)練樣本問題一直是困擾SVM計算速度提升的瓶頸,過多的訓(xùn)練樣本會大大的增加計算成本,而且不一定會帶來更準確的預(yù)測結(jié)果,甚至?xí)䦟?dǎo)致更嚴重的偏差。所以,訓(xùn)練樣本的長度必須控制在合適的范圍內(nèi)。針對該問題,本文提出了基于信息熵的樣本長度選擇方法。該方法的基本思想是選取與預(yù)測點最相關(guān)的歷史數(shù)據(jù)作為訓(xùn)練樣本,保證數(shù)據(jù)信息的完備性和不冗余?拷A(yù)測點的歷史數(shù)據(jù)通常與預(yù)測點之間關(guān)系越強,這些數(shù)據(jù)作為訓(xùn)練樣本對于預(yù)測點來說意義較大;隨著距離的增加(時間不斷向前推移)關(guān)聯(lián)性越弱,對于預(yù)測點意義較小,將這些數(shù)據(jù)加入到訓(xùn)練樣本中會表現(xiàn)出數(shù)據(jù)波動性增大,平穩(wěn)性降低,需要對這些點進行刪減。本文方法基于這一思想,在歷史數(shù)據(jù)中從前向后依次選取不同的位置作為起始點截取時間序列,計算不同起始點截得時間序列的信息熵值。將對應(yīng)信息熵最小的起始點作為新時間坐標軸的“0點坐標”,坐標軸負半軸的數(shù)據(jù)時間久遠且與當前狀態(tài)相關(guān)程度低,需剔除,正半軸數(shù)據(jù)與當前相關(guān)程度高,故用來作為訓(xùn)練樣本,如此可以保證信息的完備性,同時避免了建立模型時的尋優(yōu)過程對相關(guān)程度較低的訓(xùn)練樣本的“照料”。文中通過理論分析和實驗的方式從計算時間和預(yù)測精度的角度考察了方法的有效性。
[Abstract]:With the rapid development of modern science and technology, rotating machinery is developing toward the direction of large-scale, complex, high-speed, continuous and automation. These developments bring about higher production efficiency, at the same time, higher requirements for the safe operation of equipment, once a fault occurs, it will cause enormous economic losses, or even lead to it. Disastrous casualties and serious social impacts. Diagnosis of equipment failures and prediction of equipment operation status based on historical data are important measures to ensure the safe and reliable operation of equipment. Only in this way can problems be dealt with in a timely and effective manner, and the failure can be eliminated in the embryonic state. Support Vector Machine (SVM) theory is used to diagnose the running state of the equipment and predict the future state of the equipment according to the historical running data.
1, a sample balance method based on genetic algorithm for support vector machines is proposed.
For fault diagnosis, the number of fault samples is usually less than that of normal samples, so unbalanced samples are common. Support vector machines often cause misdiagnosis of "few sample classes" when they encounter unbalanced samples. In order to make the expanded sample more pertinent and more conducive to forming a correct "classification hyperplane", the selection method of parent sample and the evaluation method of offspring sample are given.
2, the causes of false components produced by the EMD method are analyzed, and a method to identify false components is proposed.
As a method of feature extraction, EMD can deal with non-stationary and non-linear problems well, but when EMD is used to process signals, false components are often introduced. The accuracy of impact analysis is a bottleneck problem which seriously restricts the development of EMD. In order to eliminate the influence of false components, EMD method can be better used in feature extraction. In this paper, a method of false component identification based on K-L divergence is proposed. This method uses K-L divergence to evaluate the relationship between each component obtained by EMD decomposition and the original signal. The smaller the K-L divergence between the component and the original signal, the greater the relationship, the higher the authenticity of the component, otherwise the higher the falseness, and the false component by setting a threshold. At the same time, the method of setting the threshold value is studied and given.
3, a support vector machine algorithm based on EMD feature extraction is proposed.
At present, there are two main methods to select feature parameters: one is based on the characteristics of measured data, that is, the factors associated with the predicted values are collected as features, such as wind power. Wind speed, air pressure and other factors are selected as the characteristics of power forecasting, but for some forecasting variables such as vibration, the influencing factors are often very complex and difficult to define, so it is impossible to establish a high-precision forecasting model by using this method; the other is to get the characteristic parameters by calculating the time series, which is the most representative method. And the most commonly used method is based on the phase space reconstruction method. The method uses chaos theory to calculate the embedding dimension and time delay to reconstruct the phase space and get the characteristics of time series. Because of the improper selection of the two features, the prediction accuracy will be greatly reduced. Therefore, this paper proposes a support vector machine algorithm based on EMD feature extraction (EMD-SVM), which uses the component values of each time point after EMD decomposition as the prediction model feature selection problem. It is characterized by the time series value (target value) corresponding to the time point to form a sample, and the prediction model is established, and its high accuracy and stability are proved by experiments.
4. Aiming at the problem of large-scale sample in support vector machine, a sample length selection method based on information entropy is proposed.
The problem of large-scale training samples has always been a bottleneck to speed up the calculation of SVM. Too many training samples will greatly increase the calculation cost, and will not necessarily lead to more accurate prediction results, or even lead to more serious deviations. Therefore, the length of training samples must be controlled within a suitable range. The basic idea of this method is to select the most relevant historical data as training samples to ensure the completeness and non-redundancy of data information. With the increase of distance (time goes forward) the correlation is weaker and the significance for predicting points is smaller. Adding these data into training samples will increase the data volatility and reduce the stationarity, which needs to be deleted. Based on this idea, this method selects the historical data from front to back in turn. The information entropy value of the time series is calculated by using the same position as the starting point and the minimum starting point of the corresponding information entropy is regarded as the "0-point coordinate" of the new time coordinate axis. It is used as training sample to ensure the completeness of the information and avoid the "care" of the training sample with low correlation in the process of establishing the model.
【學(xué)位授予單位】:華北電力大學(xué)
【學(xué)位級別】:博士
【學(xué)位授予年份】:2013
【分類號】:TH165.3
[Abstract]:With the rapid development of modern science and technology, rotating machinery is developing toward the direction of large-scale, complex, high-speed, continuous and automation. These developments bring about higher production efficiency, at the same time, higher requirements for the safe operation of equipment, once a fault occurs, it will cause enormous economic losses, or even lead to it. Disastrous casualties and serious social impacts. Diagnosis of equipment failures and prediction of equipment operation status based on historical data are important measures to ensure the safe and reliable operation of equipment. Only in this way can problems be dealt with in a timely and effective manner, and the failure can be eliminated in the embryonic state. Support Vector Machine (SVM) theory is used to diagnose the running state of the equipment and predict the future state of the equipment according to the historical running data.
1, a sample balance method based on genetic algorithm for support vector machines is proposed.
For fault diagnosis, the number of fault samples is usually less than that of normal samples, so unbalanced samples are common. Support vector machines often cause misdiagnosis of "few sample classes" when they encounter unbalanced samples. In order to make the expanded sample more pertinent and more conducive to forming a correct "classification hyperplane", the selection method of parent sample and the evaluation method of offspring sample are given.
2, the causes of false components produced by the EMD method are analyzed, and a method to identify false components is proposed.
As a method of feature extraction, EMD can deal with non-stationary and non-linear problems well, but when EMD is used to process signals, false components are often introduced. The accuracy of impact analysis is a bottleneck problem which seriously restricts the development of EMD. In order to eliminate the influence of false components, EMD method can be better used in feature extraction. In this paper, a method of false component identification based on K-L divergence is proposed. This method uses K-L divergence to evaluate the relationship between each component obtained by EMD decomposition and the original signal. The smaller the K-L divergence between the component and the original signal, the greater the relationship, the higher the authenticity of the component, otherwise the higher the falseness, and the false component by setting a threshold. At the same time, the method of setting the threshold value is studied and given.
3, a support vector machine algorithm based on EMD feature extraction is proposed.
At present, there are two main methods to select feature parameters: one is based on the characteristics of measured data, that is, the factors associated with the predicted values are collected as features, such as wind power. Wind speed, air pressure and other factors are selected as the characteristics of power forecasting, but for some forecasting variables such as vibration, the influencing factors are often very complex and difficult to define, so it is impossible to establish a high-precision forecasting model by using this method; the other is to get the characteristic parameters by calculating the time series, which is the most representative method. And the most commonly used method is based on the phase space reconstruction method. The method uses chaos theory to calculate the embedding dimension and time delay to reconstruct the phase space and get the characteristics of time series. Because of the improper selection of the two features, the prediction accuracy will be greatly reduced. Therefore, this paper proposes a support vector machine algorithm based on EMD feature extraction (EMD-SVM), which uses the component values of each time point after EMD decomposition as the prediction model feature selection problem. It is characterized by the time series value (target value) corresponding to the time point to form a sample, and the prediction model is established, and its high accuracy and stability are proved by experiments.
4. Aiming at the problem of large-scale sample in support vector machine, a sample length selection method based on information entropy is proposed.
The problem of large-scale training samples has always been a bottleneck to speed up the calculation of SVM. Too many training samples will greatly increase the calculation cost, and will not necessarily lead to more accurate prediction results, or even lead to more serious deviations. Therefore, the length of training samples must be controlled within a suitable range. The basic idea of this method is to select the most relevant historical data as training samples to ensure the completeness and non-redundancy of data information. With the increase of distance (time goes forward) the correlation is weaker and the significance for predicting points is smaller. Adding these data into training samples will increase the data volatility and reduce the stationarity, which needs to be deleted. Based on this idea, this method selects the historical data from front to back in turn. The information entropy value of the time series is calculated by using the same position as the starting point and the minimum starting point of the corresponding information entropy is regarded as the "0-point coordinate" of the new time coordinate axis. It is used as training sample to ensure the completeness of the information and avoid the "care" of the training sample with low correlation in the process of establishing the model.
【學(xué)位授予單位】:華北電力大學(xué)
【學(xué)位級別】:博士
【學(xué)位授予年份】:2013
【分類號】:TH165.3
【參考文獻】
相關(guān)期刊論文 前10條
1 王娟娟;任秋實;;基于SMO的不同懲罰系數(shù)的SVM算法[J];信息技術(shù);2006年10期
2 李雪耀;鄒曉杰;張汝波;錢真;;譜熵和主成分分析用于EMD分解研究[J];哈爾濱工程大學(xué)學(xué)報;2009年07期
3 程軍圣;于德介;楊宇;;基于EMD和SVM的滾動軸承故障診斷方法[J];航空動力學(xué)報;2006年03期
4 翟永杰;劉林;王朋;;基于ADE-SVM和模糊理論的電力系統(tǒng)中期負荷預(yù)測[J];電力系統(tǒng)保護與控制;2012年08期
5 劉向東 ,駱斌 ,陳兆乾;支持向量機最優(yōu)模型選擇的研究[J];計算機研究與發(fā)展;2005年04期
6 李林峰;孫長銀;;基于FCM聚類與SVM的電力系統(tǒng)短期負荷預(yù)測[J];江蘇電機工程;2007年03期
7 范勁松,方廷健;特征選擇和提取要素的分析及其評價[J];計算機工程與應(yīng)用;2001年13期
8 劉慧婷,張e,
本文編號:2239252
本文鏈接:http://sikaile.net/kejilunwen/jixiegongcheng/2239252.html
最近更新
教材專著