基于CNN的連續(xù)語(yǔ)音說(shuō)話人聲紋識(shí)別

發(fā)布時(shí)間：2019-02-13 07:28

【摘要】：近年來(lái),隨著社會(huì)生活水平的不斷提高,人們對(duì)機(jī)器智能人聲識(shí)別的要求越來(lái)越高。高斯混合—隱馬爾可夫模型(Gaussian of mixture-hidden Markov model,GMM-HMM)是說(shuō)話人識(shí)別研究領(lǐng)域中最重要的模型。由于該模型對(duì)大語(yǔ)音數(shù)據(jù)的建模能力不是很好,對(duì)噪聲的頑健性也比較差,模型的發(fā)展遇到了瓶頸。為了解決該問(wèn)題,研究者開(kāi)始關(guān)注深度學(xué)習(xí)技術(shù)。引入了CNN深度學(xué)習(xí)模型研究連續(xù)語(yǔ)音說(shuō)話人識(shí)別問(wèn)題,并提出了CNN連續(xù)說(shuō)話人識(shí)別(continuous speaker recognition of convolutional neural network,CSR-CNN)算法。模型提取固定長(zhǎng)度、符合語(yǔ)序的語(yǔ)音片段,形成時(shí)間線上的有序語(yǔ)譜圖,通過(guò)CNN提取特征序列,經(jīng)過(guò)獎(jiǎng)懲函數(shù)對(duì)特征序列組合進(jìn)行連續(xù)測(cè)量。實(shí)驗(yàn)結(jié)果表明,CSR-CNN算法在連續(xù)—片段說(shuō)話人識(shí)別領(lǐng)域取得了比GMM-HMM更好的識(shí)別效果。
[Abstract]:In recent years, with the continuous improvement of social living standards, the demand of machine intelligent voice recognition is becoming higher and higher. Gao Si Hybrid-Hidden Markov Model (Gaussian of mixture-hidden Markov model,GMM-HMM) is the most important model in the field of speaker recognition. Because the modeling ability of the model for large speech data is not very good, and the robustness to noise is also relatively poor, the development of the model has encountered a bottleneck. In order to solve this problem, researchers begin to pay attention to the technology of deep learning. In this paper, CNN depth learning model is introduced to study the continuous speech speaker recognition problem, and a CNN continuous speaker recognition (continuous speaker recognition of convolutional neural network,CSR-CNN) algorithm is proposed. The model extracts the speech fragments of fixed length and accords with the word order, and forms the ordered linguistic spectrum on the time line. The feature sequences are extracted by CNN, and the combination of feature sequences is continuously measured by the reward and punishment function. Experimental results show that the CSR-CNN algorithm achieves better recognition performance than GMM-HMM in the field of continuous-segment speaker recognition.
【作者單位】：杭州電子科技大學(xué);
【分類號(hào)】：TP393

【相似文獻(xiàn)】

相關(guān)會(huì)議論文前8條

1 曹陽(yáng);黃泰翼;;基于統(tǒng)計(jì)方法的漢語(yǔ)連續(xù)語(yǔ)音中聲調(diào)模式的研究[A];第九屆全國(guó)信號(hào)處理學(xué)術(shù)年會(huì)（CCSP-99）論文集[C];1999年

2 程蘭穎;俞鐵城;李忠香;;基于音節(jié)分割的連續(xù)語(yǔ)音多模板隱馬爾可夫模型的研究[A];第三屆全國(guó)人機(jī)語(yǔ)音通訊學(xué)術(shù)會(huì)議論文集[C];1994年

3 孫海;范京;劉惠華;;漢語(yǔ)連續(xù)語(yǔ)音中的單字起止點(diǎn)綜合判別的新方法[A];第十屆全國(guó)信號(hào)處理學(xué)術(shù)年會(huì)（CCSP-2001）論文集[C];2001年

4 吳及;許海天;王作英;;連續(xù)數(shù)字串識(shí)別中語(yǔ)速的在線自適應(yīng)方法[A];第六屆全國(guó)人機(jī)語(yǔ)音通訊學(xué)術(shù)會(huì)議論文集[C];2001年

5 沈彩鳳;俞一彪;;采用三音節(jié)FO插值的連續(xù)語(yǔ)音聲調(diào)評(píng)測(cè)算法[A];2011'中國(guó)西部聲學(xué)學(xué)術(shù)交流會(huì)論文集[C];2011年

6 肖熙;王作英;;漢語(yǔ)連續(xù)語(yǔ)音聲調(diào)識(shí)別的HMM方法[A];第五屆全國(guó)人機(jī)語(yǔ)音通訊學(xué)術(shù)會(huì)議論文集[C];1998年

7 曹陽(yáng);黃泰翼;;基于小波變換的基頻提取和連續(xù)語(yǔ)音中基頻變化模式的分析[A];第四屆全國(guó)人機(jī)語(yǔ)音通訊學(xué)術(shù)會(huì)議論文集[C];1996年

8 朱思俞;石鋒;;不定人連續(xù)漢語(yǔ)音的四聲識(shí)別[A];第二屆全國(guó)人機(jī)語(yǔ)音通訊學(xué)術(shù)會(huì)議論文集[C];1992年

相關(guān)博士學(xué)位論文前1條

1 鐘金宏;基于音節(jié)的漢語(yǔ)連續(xù)語(yǔ)音聲調(diào)識(shí)別方法研究[D];合肥工業(yè)大學(xué);2001年

相關(guān)碩士學(xué)位論文前8條

1 范佳露;3-5歲聽(tīng)障兒童連續(xù)語(yǔ)音重復(fù)能力的特征及干預(yù)研究[D];華東師范大學(xué);2010年

2 張芳;聽(tīng)障與健聽(tīng)兒童連續(xù)語(yǔ)音切換能力的比較及應(yīng)用研究[D];華東師范大學(xué);2009年

3 韓虎;漢語(yǔ)連續(xù)語(yǔ)音的音節(jié)自動(dòng)標(biāo)注算法研究及實(shí)現(xiàn)[D];哈爾濱工業(yè)大學(xué);2008年

4 袁浩;連續(xù)語(yǔ)音中關(guān)鍵詞快速檢出的研究[D];哈爾濱工業(yè)大學(xué);2011年

5 何義華;基于飛行器的連續(xù)語(yǔ)音指令識(shí)別技術(shù)研究[D];南京航空航天大學(xué);2008年

6 陳斌;漢語(yǔ)連續(xù)語(yǔ)音聲韻母類別屬性檢測(cè)技術(shù)研究[D];解放軍信息工程大學(xué);2011年

7 嚴(yán)歡;漢語(yǔ)連續(xù)語(yǔ)音聲調(diào)及數(shù)字串識(shí)別系統(tǒng)的研究[D];哈爾濱理工大學(xué);2011年

8 施凝;中等詞匯量的漢語(yǔ)連續(xù)語(yǔ)音關(guān)鍵詞識(shí)別系統(tǒng)[D];同濟(jì)大學(xué);2006年

，

本文編號(hào)：2421318

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會(huì)員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/xinxigongchenglunwen/2421318.html

上一篇：一種基于數(shù)據(jù)預(yù)處理和卡爾曼濾波的溫室監(jiān)測(cè)數(shù)據(jù)融合算法
下一篇：狼群優(yōu)化的神經(jīng)網(wǎng)絡(luò)頻譜感知算法

論文發(fā)表

·知網(wǎng)|萬(wàn)方|維普|龍?jiān)磡省級(jí)|國(guó)家級(jí)|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于CNN的連續(xù)語(yǔ)音說(shuō)話人聲紋識(shí)別