基于文本無關(guān)的聲紋識(shí)別算法的研究及實(shí)現(xiàn)

發(fā)布時(shí)間：2018-12-20 06:32

【摘要】：隨著互聯(lián)網(wǎng)技術(shù)的迅猛發(fā)展,網(wǎng)絡(luò)逐漸覆蓋到了社會(huì)生活的各個(gè)角落。在互聯(lián)網(wǎng)環(huán)境中,傳統(tǒng)的身份認(rèn)證方法面臨巨大的挑戰(zhàn),越來越無法適應(yīng)實(shí)際應(yīng)用環(huán)境的需求。在所有的身份認(rèn)證方法中,生物特征身份識(shí)別技術(shù)是一種基于人類特有的生理和后天特性進(jìn)行的身份識(shí)別技術(shù),因其獨(dú)特的優(yōu)勢而在實(shí)際中得到了廣泛的應(yīng)用。在所有生物特征身份識(shí)別技術(shù)中,與文本無關(guān)的聲紋確認(rèn)技術(shù)被認(rèn)為是最具實(shí)用性的生物特征身份識(shí)別技術(shù)之一,該技術(shù)通過目標(biāo)說話人的語音對(duì)說話人的身份進(jìn)行確認(rèn),是語音識(shí)別研究的一個(gè)重要分支。在實(shí)際應(yīng)用環(huán)境中,受到采集設(shè)備、傳輸線路等多種因素的影響,最終得到的有效語音數(shù)據(jù)非常有限,進(jìn)而使得系統(tǒng)的識(shí)別性能和執(zhí)行效率很難達(dá)到理想的識(shí)別效果。因此,本文主要基于文本無關(guān)的短語音聲紋確認(rèn)方法進(jìn)行研究。在聲紋確認(rèn)系統(tǒng)中,系統(tǒng)的識(shí)別率和計(jì)算復(fù)雜度是衡量系統(tǒng)性能的重要指標(biāo)。傳統(tǒng)的UBM-MAP-GMM模型架構(gòu)在一定程度上解決了測試語音與訓(xùn)練語音失配的情況,系統(tǒng)識(shí)別性能也比較理想,然而在實(shí)際應(yīng)用中,面對(duì)短語音問題,該模型的運(yùn)算量需求較大,系統(tǒng)魯棒性較差。因此,本文從減少系統(tǒng)計(jì)算量、提高識(shí)別率等多個(gè)角度出發(fā)對(duì)聲紋識(shí)別算法進(jìn)行了研究,具體有以下幾個(gè)方面:1.分析了模型訓(xùn)練中模型初始值對(duì)EM算法的影響,針對(duì)傳統(tǒng)K-means算法隨機(jī)選擇初始聚類中心可能導(dǎo)致算法局部收斂的缺陷,提出了基于密度和距離的初始聚類中心選擇算法,對(duì)K-means算法進(jìn)行了改進(jìn),并且通過實(shí)驗(yàn)證明了算法。2.探討分析了UBM-MAP-GMM模型架構(gòu),針對(duì)其計(jì)算量大、個(gè)人聲紋模型GMM服從同一模型結(jié)構(gòu)及部分高斯分量對(duì)識(shí)別結(jié)果的影響,提出了基于UBM-CM-MAP-GMM模型架構(gòu)的聲紋確認(rèn)方法。實(shí)驗(yàn)證明,該方法使得算法在識(shí)別時(shí)間、等錯(cuò)誤率方面都有一定的改善。3.在UBM-CM-MAP-GMM模型架構(gòu)中,對(duì)聲紋模型GMM的混合度的取值進(jìn)行研究,實(shí)驗(yàn)數(shù)據(jù)顯示當(dāng)GMM混合度為UBM的一半時(shí)效果最好。4.在UBM-CM-MAP-GMM模型架構(gòu)上實(shí)現(xiàn)了短語音聲紋確認(rèn)軟件,并對(duì)軟件的識(shí)別效率進(jìn)行了實(shí)驗(yàn)分析與驗(yàn)證,相比于傳統(tǒng)的UBM-MAP-GMM模型架構(gòu),改進(jìn)算法使得計(jì)算量和等錯(cuò)誤率都一定程度的降低。
[Abstract]:With the rapid development of Internet technology, the network gradually covers every corner of social life. In the Internet environment, the traditional identity authentication method is facing a huge challenge, which is more and more unable to meet the needs of the practical application environment. Among all the authentication methods, biometric identification technology is a kind of identity recognition technology based on human physiological and acquired characteristics, which has been widely used in practice because of its unique advantages. Among all biometric identification techniques, text-independent voiceprint recognition is considered to be one of the most practical biometric identification techniques. It is an important branch of speech recognition. In the practical application environment, due to the influence of many factors, such as acquisition equipment, transmission line, and so on, the final effective speech data is very limited, which makes the recognition performance and execution efficiency of the system difficult to achieve the ideal recognition effect. Therefore, this paper is mainly based on the text-independent phonetics validation method. The recognition rate and computational complexity of the system are important indexes to evaluate the system performance in the voiceprint verification system. The traditional UBM-MAP-GMM model structure solves the mismatch between the test speech and the trained speech to a certain extent, and the recognition performance of the system is also ideal. However, in the practical application, in the face of the short speech problem, the model requires a lot of computation. System robustness is poor. Therefore, this paper studies the voiceprint recognition algorithm from several angles, such as reducing the system computation and improving the recognition rate. The main contents are as follows: 1. This paper analyzes the influence of the initial value of the model on the EM algorithm in model training, aiming at the defect that the traditional K-means algorithm randomly selects the initial clustering center, which may lead to the local convergence of the algorithm, an initial clustering center selection algorithm based on density and distance is proposed. The K-means algorithm is improved, and the algorithm is proved by experiment. 2. 2. The structure of UBM-MAP-GMM model is discussed and analyzed. According to the large amount of calculation, the influence of individual voice-pattern model GMM service from the same model structure and part of Gao Si component on the recognition result is discussed. A voiceprint validation method based on UBM-CM-MAP-GMM model architecture is proposed. Experiments show that the algorithm can improve the recognition time and error rate of the algorithm. In the framework of UBM-CM-MAP-GMM model, the mixing degree of the voiceprint model GMM is studied. The experimental data show that the best result is when the mixing degree of GMM is half that of UBM. 4. In this paper, the phonetics validation software is implemented on the UBM-CM-MAP-GMM model architecture, and the recognition efficiency of the software is analyzed and verified experimentally. Compared with the traditional UBM-MAP-GMM model architecture, the recognition efficiency of the software is compared with that of the traditional UBM-MAP-GMM model. The improved algorithm reduces the amount of computation and the rate of equal error to a certain extent.
【學(xué)位授予單位】：電子科技大學(xué)
【學(xué)位級(jí)別】：碩士
【學(xué)位授予年份】：2017
【分類號(hào)】：TN912.3

【相似文獻(xiàn)】

相關(guān)期刊論文前10條

1 蔡耿平,黃順珍,徐志鴻,藍(lán)波,范國華,梁凡;聲紋識(shí)別系統(tǒng)[J];深圳大學(xué)學(xué)報(bào);2002年02期

2 于哲舟,楊佳東,蒲東兵,周春光,王綱巧;多門限聲紋識(shí)別方法[J];吉林大學(xué)學(xué)報(bào)(信息科學(xué)版);2005年02期

3 朱浩冰;郭東輝;;聲紋識(shí)別系統(tǒng)原理及其關(guān)鍵技術(shù)[J];計(jì)算機(jī)安全;2007年09期

4 靳玉紅;;聲紋識(shí)別中的語言屬性映射[J];重慶郵電大學(xué)學(xué)報(bào)(自然科學(xué)版);2012年04期

5 葉田田;;聲紋識(shí)別系統(tǒng)設(shè)計(jì)[J];工業(yè)控制計(jì)算機(jī);2012年06期

6 霍春寶;張彩娟;趙紅敏;;與文本無關(guān)的聲紋識(shí)別系統(tǒng)的研究[J];遼寧工業(yè)大學(xué)學(xué)報(bào)(自然科學(xué)版);2013年01期

7 楊凌;蔡濤;李瀚;;一種改進(jìn)型回聲狀態(tài)網(wǎng)絡(luò)及其在聲紋識(shí)別上的應(yīng)用[J];中國科技信息;2014年08期

8 陳幼松;從“芝麻開門”到聲紋識(shí)別[J];百科知識(shí);2003年01期

9 任培花;孫宏志;;基于言語過濾、情感補(bǔ)償?shù)幕铙w聲紋識(shí)別系統(tǒng)的設(shè)計(jì)[J];重慶科技學(xué)院學(xué)報(bào)(自然科學(xué)版);2007年01期

10 王會(huì)清;張濤;周帆;;聲紋識(shí)別在虛擬儀器平臺(tái)的實(shí)現(xiàn)[J];武漢工程大學(xué)學(xué)報(bào);2012年12期

相關(guān)會(huì)議論文前2條

1 楊瑩春;雷震春;吳朝暉;;基于情感補(bǔ)償?shù)幕铙w聲紋識(shí)別框架研究[A];第一屆中國情感計(jì)算及智能交互學(xué)術(shù)會(huì)議論文集[C];2003年

2 黃曉丹;洪青陽;李琳;李稀敏;梁大偉;陳萬里;呂偉辰;丘敬云;王薇;;聲紋識(shí)別語音數(shù)據(jù)庫建設(shè)的探討[A];第十一屆全國人機(jī)語音通訊學(xué)術(shù)會(huì)議論文集（一）[C];2011年

相關(guān)重要報(bào)紙文章前5條

1 閆潔;聲紋識(shí)別高精尖聽音辨人不遙遠(yuǎn)[N];新華每日電訊;2014年

2 吳璽宏;聲紋識(shí)別應(yīng)用前景[N];計(jì)算機(jī)世界;2001年

3 邢方亮;以聲辨人[N];計(jì)算機(jī)世界;2003年

4 北京大學(xué)信息科學(xué)中心視覺與聽覺信息處理國家重點(diǎn)實(shí)驗(yàn)室吳璽宏;聲紋識(shí)別聽聲辨人[N];計(jì)算機(jī)世界;2001年

5 本報(bào)記者霍娜;云上積累云中綻放[N];中國計(jì)算機(jī)報(bào);2014年

相關(guān)博士學(xué)位論文前1條

1 張晶;聲紋識(shí)別魯棒性技術(shù)及應(yīng)用研究[D];廣東工業(yè)大學(xué);2015年

相關(guān)碩士學(xué)位論文前10條

1 楊瑞瑞;基于文本無關(guān)的聲紋識(shí)別算法的研究及實(shí)現(xiàn)[D];電子科技大學(xué);2017年

2 于嫻;聲紋識(shí)別在微信中的模式匹配研究[D];貴州大學(xué);2015年

3 劉磊;聲紋識(shí)別算法在軍事通話中的研究與實(shí)現(xiàn)[D];東北大學(xué);2014年

4 陳俊彬;融合聲紋識(shí)別的護(hù)理床語音控制系統(tǒng)研發(fā)[D];廣東工業(yè)大學(xué);2016年

5 周雷;基于聲紋識(shí)別的說話人身份確認(rèn)方法的研究[D];上海師范大學(xué);2016年

6 胡青;卷積神經(jīng)網(wǎng)絡(luò)在聲紋識(shí)別中的應(yīng)用研究[D];貴州大學(xué);2016年

7 陳霄鵬;聲紋識(shí)別中的時(shí)變魯棒性問題研究[D];貴州大學(xué);2016年

8 張芝旖;聲紋識(shí)別相關(guān)技術(shù)研究及應(yīng)用[D];南京航空航天大學(xué);2016年

9 李韻;聲紋識(shí)別系統(tǒng)中特征參數(shù)提取方法的對(duì)比分析研究[D];成都理工大學(xué);2016年

10 王可;基于移動(dòng)終端的聲紋識(shí)別系統(tǒng)關(guān)鍵算法研究[D];上海師范大學(xué);2017年

，

本文編號(hào)：2387580

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會(huì)員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/xinxigongchenglunwen/2387580.html

上一篇：低復(fù)雜度的壓縮感知信道估計(jì)方法
下一篇：基于MIMO雷達(dá)信號(hào)模型的天線方向圖研究

論文發(fā)表

·知網(wǎng)|萬方|維普|龍?jiān)磡省級(jí)|國家級(jí)|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于文本無關(guān)的聲紋識(shí)別算法的研究及實(shí)現(xiàn)