面向抗噪語音識別的SVM關(guān)鍵問題研究

發(fā)布時間：2018-07-18 10:04

【摘要】：語音識別是人機交互、模式識別的一項重要技術(shù),有著廣闊的應(yīng)用前景,對語音識別進行研究具有重要的理論意義和實用價值。目前多數(shù)語音識別系統(tǒng)只適合于識別“純凈”的語音,當存在噪聲或訓練和測試環(huán)境不同時,識別系統(tǒng)會出現(xiàn)性能急劇下降的現(xiàn)象,系統(tǒng)的性能還有待改進,但常用的語音識別方法難以達到很好的效果。支持向量機作為一種新型的模式識別方法,其理論依據(jù)是結(jié)構(gòu)風險最小化原則和VC維理論,該方法利用最大間隔原則、核函數(shù)等方法技巧,能較好地解決小樣本、高維數(shù)、非線性和局部最優(yōu)解等分類問題,比較適合語音信號特點,已經(jīng)初步應(yīng)用于語音識別領(lǐng)域。本文緊密圍繞如何提高基于支持向量機的語音識別系統(tǒng)綜合性能這個核心內(nèi)容展開研究,分別從抗噪語音識別系統(tǒng)的多類分類方法選擇、支持向量機核函數(shù)的選擇與構(gòu)造、加快語音識別系統(tǒng)訓練速度的局部支持向量機算法等角度出發(fā),對支持向量機在語音識別系統(tǒng)中的應(yīng)用進行了若干分析和研究。主要研究成果可以歸納為以下幾個方面:(1)詳細研究了支持向量機的理論基礎(chǔ)和基本原理,從理論上深入分析了支持向量機算法的魯棒性,從而選擇支持向量機作為本文的識別方法,構(gòu)建了基于支持向量機的語音識別系統(tǒng);詳細分析了語音識別的基礎(chǔ)原理、總體流程、模型訓練和模式匹配等內(nèi)容,研究了語音數(shù)據(jù)庫的設(shè)計和錄制流程,建立了中文500詞語音數(shù)據(jù)庫。(2)為了提高語音識別系統(tǒng)的抗噪性,本文深入研究了支持向量機解決多類分類問題的策略,首次將通信系統(tǒng)中的M-ary和糾錯輸出編碼原理引入到支持向量機的語音識別中,仿真實驗證明:在純凈和帶噪的語音環(huán)境下,糾錯輸出編碼方法具有很好的魯棒性、抗噪性和泛化識別能力。(3)核函數(shù)對于支持向量機來說很重要,其直接決定著支持向量機的最終性能,因此在支持向量機的理論研究中,核函數(shù)的選擇和構(gòu)造占據(jù)很重要的位置。本文提出了兩種新的核函數(shù):Logistic和ORF核函數(shù),分別證明了其為Mercer核函數(shù),通過核函數(shù)映射趨勢、雙螺旋測試問題、Vowel和TIDigits兩種孤立詞語音庫的實驗,驗證了新的核函數(shù)是有效的,應(yīng)用于語音識別具有很好的泛化性能和抗噪能力。(4)為了提高語音識別系統(tǒng)的實時性,加快標準支持向量機的訓練速度,考慮到語音樣本在特征上的局部相似性以及非鄰近樣本之間的弱相關(guān)性等特點,本文提出了一種改進的局部支持向量機算法模型,給出了改進算法的描述、局部核函數(shù)證明和具體的流程,通過Vowel、CASIA漢語數(shù)字串、ISOLET、中文500詞四種語音庫的實驗,驗證了改進的局部支持向量機算法在語音識別方面可以有效的縮短語音識別系統(tǒng)的訓練時間。
[Abstract]:Speech recognition is an important technology of human-computer interaction and pattern recognition, which has a broad application prospect. The research on speech recognition has important theoretical significance and practical value. At present, most speech recognition systems are only suitable for the recognition of "pure" speech. When there is noise or different training and testing environments, the performance of the recognition system will decline sharply, and the performance of the system needs to be improved. However, common speech recognition methods are difficult to achieve good results. As a new pattern recognition method, support vector machine (SVM) is based on structural risk minimization principle and VC dimension theory. The classification problems such as nonlinear and local optimal solutions are suitable for speech signal characteristics and have been applied to speech recognition. This paper focuses on how to improve the comprehensive performance of speech recognition system based on support vector machine. It selects the multi-class classification method of anti-noise speech recognition system, and selects and constructs the kernel function of support vector machine. From the point of view of accelerating the training speed of speech recognition system, the application of support vector machine in speech recognition system is analyzed and studied. The main research results can be summarized as follows: (1) the theoretical basis and basic principle of support vector machine are studied in detail, and the robustness of support vector machine algorithm is analyzed theoretically. Therefore, the support vector machine is selected as the recognition method in this paper, and the speech recognition system based on support vector machine is constructed, and the basic principle, general process, model training and pattern matching of speech recognition are analyzed in detail. The design and recording process of speech database are studied, and the Chinese 500 word speech database is established. (2) in order to improve the noise resistance of speech recognition system, this paper deeply studies the strategy of support vector machine (SVM) to solve the multi-class classification problem. The principle of M-ary and error-correcting output coding in communication system is introduced into the speech recognition of support vector machine for the first time. The simulation results show that in pure and noisy speech environment, the error correction output coding method has good robustness. (3) Kernel function is very important for support vector machine, which directly determines the final performance of support vector machine. Therefore, the selection and construction of kernel function occupy an important position in the theoretical research of support vector machine. In this paper, two new kernel functions, namely: Logistic and ORF kernel functions, are proposed. It is proved that they are Mercer kernel functions respectively. The new kernel functions are proved to be effective by the experiments of the double helix test problem, Vowel and TiDigits, of isolated word phonetic corpus, and the results show that the new kernel function is a Mercer kernel function. Application in speech recognition has good generalization performance and anti-noise ability. (4) in order to improve the real-time performance of speech recognition system, the training speed of standard support vector machine is accelerated. Considering the local similarity of speech samples and the weak correlation between non-adjacent samples, an improved local support vector machine (LSVM) algorithm model is proposed, and the description of the improved algorithm is given. Local kernel function proof and concrete flow, through the experiments of Vowelen CASIA Chinese digital string ISOLET and Chinese 500-word phonetic corpus, it is verified that the improved local support vector machine algorithm can effectively shorten the training time of speech recognition system in the aspect of speech recognition.
【學位授予單位】：太原理工大學
【學位級別】：博士
【學位授予年份】：2014
【分類號】：TN912.34;TP181

【參考文獻】

相關(guān)期刊論文前10條

1 肖志博;王煥鋼;肖英超;徐文立;;面向大規(guī)模數(shù)據(jù)集的單類支持向量機[J];東南大學學報(自然科學版);2013年S1期

2 努爾麥麥提·尤魯瓦斯;吾守爾·斯拉木;;面向大詞匯量的維吾爾語連續(xù)語音識別研究[J];計算機工程與應(yīng)用;2013年09期

3 朱瑩瑩;尹傳環(huán);牟少敏;;一種改進的局部支持向量機算法[J];計算機工程與科學;2013年02期

4 王磊;屈衛(wèi)東;;應(yīng)用PCA-SVM對伺服閥進行故障診斷[J];自動化儀表;2013年01期

5 秦春香;黃浩;;發(fā)音特征在維漢語音識別中的應(yīng)用[J];計算機工程;2012年23期

6 武曉敏;達瓦·伊德木草;吾守爾·斯拉木;;自然語料缺乏的民族語言連續(xù)語音識別[J];計算機工程;2012年12期

7 李冠宇;孟猛;;藏語拉薩話大詞表連續(xù)語音識別聲學模型研究[J];計算機工程;2012年05期

8 徐子豪;張騰飛;;基于語音識別和無線傳感網(wǎng)絡(luò)的智能家居系統(tǒng)設(shè)計[J];計算機測量與控制;2012年01期

9 尹傳環(huán);牟少敏;田盛豐;黃厚寬;朱瑩瑩;;局部支持向量機的研究進展[J];計算機科學;2012年01期

10 徐星;李元香;吳昱;金彤;;基于粒子群優(yōu)化算法的Logistic模型參數(shù)估計[J];電子學報;2010年S1期

相關(guān)會議論文前1條

1 蘇毅;吳文虎;鄭方;方棣棠;;基于支持向量機的語音識別研究[A];第六屆全國人機語音通訊學術(shù)會議（NCMMSC6）論文集[C];2001年

相關(guān)博士學位論文前10條

1 呂釗;噪聲環(huán)境下的語音識別算法研究[D];安徽大學;2011年

2 梁錦錦;支持向量機和支持向量域描述的若干問題研究[D];西安電子科技大學;2009年

3 杜俊;自動語音識別中的噪聲魯棒性方法[D];中國科學技術(shù)大學;2009年

4 吳斌;語音識別中的后處理技術(shù)研究[D];北京郵電大學;2008年

5 彭新俊;支持向量機若干問題及應(yīng)用研究[D];上海大學;2008年

6 龍潛;噪聲環(huán)境下的語音識別技術(shù)研究[D];中國科學技術(shù)大學;2007年

7 董婧;魯棒語音識別技術(shù)的研究[D];吉林大學;2007年

8 寧更新;抗噪聲語音識別新技術(shù)的研究[D];華南理工大學;2006年

9 吳濤;核函數(shù)的性質(zhì)、方法及其在障礙檢測中的應(yīng)用[D];中國人民解放軍國防科學技術(shù)大學;2003年

10 夏建濤;基于機器學習的高維多光譜數(shù)據(jù)分類[D];西北工業(yè)大學;2002年

相關(guān)碩士學位論文前9條

1 賀元元;支持向量機樣本預(yù)選取技術(shù)在語音識別中的應(yīng)用研究[D];太原理工大學;2012年

2 段繼康;多類分類支持向量機在語音識別中的應(yīng)用研究[D];太原理工大學;2010年

3 魏春明;語音變化分析及其在孤立詞識別中的應(yīng)用[D];浙江大學;2010年

4 童紅;孤立詞語音識別系統(tǒng)的技術(shù)研究[D];江蘇大學;2009年

5 郭月玲;支持向量機在語音識別中的應(yīng)用研究[D];太原理工大學;2009年

6 陳程;機載環(huán)境下的語音識別技術(shù)及實現(xiàn)[D];電子科技大學;2008年

7 崔毓菁;語音識別系統(tǒng)速度優(yōu)化算法研究[D];北京郵電大學;2008年

8 王一平;用遺傳算法改進HMM的語音識別算法研究[D];太原理工大學;2007年

9 羅迪君;基于核方法的糾錯輸出編碼多類分類算法改進[D];浙江大學;2006年

，

本文編號：2131564

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/wltx/2131564.html

上一篇：壓縮感知雷達超分辨率成像
下一篇：基于ARM9的智能家居系統(tǒng)設(shè)計

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

面向抗噪語音識別的SVM關(guān)鍵問題研究