基于移動(dòng)終端的聲紋識(shí)別系統(tǒng)關(guān)鍵算法研究

發(fā)布時(shí)間：2018-08-09 08:14

【摘要】：聲紋識(shí)別技術(shù)是一種生物認(rèn)證方法,它從說話人的語音中提取出能反映說話人生理和行為個(gè)性的特征,再結(jié)合模式識(shí)別的理論,來判斷說話人身份。本文主要針對(duì)基于移動(dòng)終端的聲紋識(shí)別系統(tǒng)的相關(guān)技術(shù)進(jìn)行了研究。在語音端點(diǎn)檢測方面,本文提出了改進(jìn)的能量-過零率兩級(jí)融合端點(diǎn)檢測法,該方法與傳統(tǒng)的能量-過零率端點(diǎn)檢測法不同,它可以將能量檢測和過零檢測分開操作,使這兩種檢測的結(jié)果同時(shí)進(jìn)行又互不影響,從而實(shí)現(xiàn)多線程并行計(jì)算。此外,改進(jìn)的能量-過零率端點(diǎn)檢測法在檢測中運(yùn)用的是單門限,相對(duì)于傳統(tǒng)算法,改進(jìn)算法可將閾值參數(shù)減少一半,使算法過程更加簡單。針對(duì)空間資源有限的移動(dòng)終端,本文將改進(jìn)算法與常用的單閾值能量檢測法進(jìn)行對(duì)比,發(fā)現(xiàn)運(yùn)用改進(jìn)算法的聲紋識(shí)別系統(tǒng)的識(shí)別率更高。因此,改進(jìn)的能量-過零率兩級(jí)融合端點(diǎn)檢測法在移動(dòng)終端上具有很高的應(yīng)用價(jià)值。針對(duì)傳統(tǒng)語音幀投票法無法突出每一幀語音判決結(jié)果的差異性的問題,本文提出了基于似然概率的的加權(quán)投票法。此方法根據(jù)不同語音幀與概率模型之間的似然概率取值,對(duì)每一幀語音進(jìn)行加權(quán),使得似然概率大的語音幀權(quán)重更大,置信度更高,從而增強(qiáng)每幀語音判決結(jié)果之間的差異,使語音幀融合結(jié)果更準(zhǔn)確。同時(shí),通過多次的加權(quán)檢測,本文驗(yàn)證了基于加權(quán)投票法的聲紋識(shí)別系統(tǒng)比基于傳統(tǒng)投票法的識(shí)別系統(tǒng)識(shí)別性能更優(yōu)。最后,本文設(shè)計(jì)了多種特征提取技術(shù)以及概率模型的組合方案,通過實(shí)際識(shí)別效果和算法復(fù)雜度的角度來分析它們?cè)谝苿?dòng)終端上的可行性,選出最可行的方案。并且根據(jù)最優(yōu)的聲紋識(shí)別系統(tǒng)方案,設(shè)計(jì)了一種基于移動(dòng)終端的聲紋識(shí)別系統(tǒng),并在MATLAB平臺(tái)上實(shí)現(xiàn)了該系統(tǒng),該系統(tǒng)可實(shí)現(xiàn)聲紋采集、模型訓(xùn)練、聲紋識(shí)別、聲紋注冊(cè)、聲紋確認(rèn)等功能。目前,該系統(tǒng)已經(jīng)成功移植于Android系統(tǒng)當(dāng)中。
[Abstract]:Voiceprint recognition is a biometric authentication method, which extracts the characteristics that reflect the speaker's physiological and behavioral personality from the speaker's speech, and then combines the theory of pattern recognition to judge the speaker's identity. This paper mainly focuses on the related technology of voiceprint recognition system based on mobile terminal. In the aspect of speech endpoint detection, this paper presents an improved two-stage fusion endpoint detection method with energy-zero crossing rate. This method is different from the traditional energy-zero-crossing rate endpoint detection method, and it can separate energy detection from zero-crossing detection. The results of these two kinds of detection are carried out simultaneously without affecting each other, so that multithreaded parallel computing is realized. In addition, the improved energy-zero crossing rate endpoint detection method uses a single threshold, compared with the traditional algorithm, the improved algorithm can reduce the threshold parameter by half, and make the algorithm more simple. For mobile terminals with limited space resources, the improved algorithm is compared with the conventional single threshold energy detection method. It is found that the recognition rate of the voiceprint recognition system using the improved algorithm is higher than that of the conventional single threshold energy detection method. Therefore, the improved energy-zero-crossing two-stage fusion endpoint detection method has high application value in mobile terminal. Aiming at the problem that the traditional voice frame voting method can not highlight the difference of the result of each frame, a weighted voting method based on likelihood probability is proposed in this paper. According to the likelihood probability of different speech frames and probabilistic models, each frame is weighted by this method, which makes the speech frames with large likelihood probability have greater weight and higher confidence, thus enhancing the difference between the results of speech judgment in each frame. The result of speech frame fusion is more accurate. At the same time, through multiple weighted detection, this paper verifies that the voice-pattern recognition system based on weighted voting method is better than that based on traditional voting method. Finally, this paper designs a variety of feature extraction techniques and probability model combination scheme, through the actual recognition effect and algorithm complexity to analyze their feasibility on the mobile terminal, select the most feasible scheme. According to the optimal scheme of voiceprint recognition system, a voiceprint recognition system based on mobile terminal is designed, and the system is implemented on MATLAB platform. The system can realize voice pattern acquisition, model training, voiceprint recognition and registration. Voiceprint confirmation and other functions. At present, the system has been successfully transplanted to the Android system.
【學(xué)位授予單位】：上海師范大學(xué)
【學(xué)位級(jí)別】：碩士
【學(xué)位授予年份】：2017
【分類號(hào)】：TN912.34

【參考文獻(xiàn)】

相關(guān)期刊論文前10條

1 屈丹,王炳錫,魏鑫;基于GMM-UBM模型的語言辨識(shí)研究[J];信號(hào)處理;2003年01期

2 甄斌,吳璽宏,劉志敏,遲惠生;語音識(shí)別和說話人識(shí)別中各倒譜分量的相對(duì)重要性[J];北京大學(xué)學(xué)報(bào)(自然科學(xué)版);2001年03期

3 胡光銳,韋曉東;基于倒譜特征的帶噪語音端點(diǎn)檢測[J];電子學(xué)報(bào);2000年10期

4 趙雪芬 ,江肇蓮;頻譜分析儀的諧波測量技術(shù)[J];國外電子測量技術(shù);2001年02期

5 陳芬菲;;基于GMM的說話人識(shí)別系統(tǒng)[J];微處理機(jī);2006年04期

6 燕繼坤,鄭輝,王艷,曾立君;基于可信度的投票法[J];計(jì)算機(jī)學(xué)報(bào);2005年08期

7 王娜;鄭德忠;張淑清;;基于混沌振子的低信噪比語音端點(diǎn)檢測新方法[J];儀器儀表學(xué)報(bào);2009年07期

8 韓志艷;王旭;王健;;基于短時(shí)能零積和鑒別信息的語音端點(diǎn)檢測[J];東北大學(xué)學(xué)報(bào)(自然科學(xué)版);2009年12期

9 陳業(yè)仙;張歆奕;毛杰;;基于GMM-UBM的語言辨識(shí)算法研究[J];五邑大學(xué)學(xué)報(bào)(自然科學(xué)版);2010年03期

10 蔣曄;唐振民;;GMM文本無關(guān)的說話人識(shí)別系統(tǒng)研究[J];計(jì)算機(jī)工程與應(yīng)用;2010年11期

相關(guān)博士學(xué)位論文前1條

1 張晶;聲紋識(shí)別魯棒性技術(shù)及應(yīng)用研究[D];廣東工業(yè)大學(xué);2015年

相關(guān)碩士學(xué)位論文前6條

1 李煒鋒;基于Android的有身份識(shí)別功能的流媒體播放器的設(shè)計(jì)與實(shí)現(xiàn)[D];電子科技大學(xué);2014年

2 路娜;孤立詞語音識(shí)別系統(tǒng)的研究與設(shè)計(jì)[D];曲阜師范大學(xué);2014年

3 陳衛(wèi)強(qiáng);基于DSP的孤立詞語音識(shí)別系統(tǒng)的研究與實(shí)現(xiàn)[D];南昌航空大學(xué);2013年

4 張慧珊;基于聲紋識(shí)別和動(dòng)態(tài)密碼的雙因素身份認(rèn)證系統(tǒng)的研究與實(shí)現(xiàn)[D];武漢理工大學(xué);2013年

5 胡政權(quán);說話人識(shí)別中語音參數(shù)提取方法的研究[D];南京師范大學(xué);2013年

6 郝艷莉;基于DM6446的音頻信號(hào)識(shí)別系統(tǒng)的研究[D];哈爾濱理工大學(xué);2012年

，

本文編號(hào)：2173475

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會(huì)員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/xinxigongchenglunwen/2173475.html

上一篇：3D大規(guī)模MIMO通信系統(tǒng)傳輸方案研究
下一篇：移動(dòng)支付相關(guān)技術(shù)與專利分析

論文發(fā)表

·知網(wǎng)|萬方|維普|龍?jiān)磡省級(jí)|國家級(jí)|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于移動(dòng)終端的聲紋識(shí)別系統(tǒng)關(guān)鍵算法研究