復(fù)雜背景下聲紋特征提取與識(shí)別
[Abstract]:With the rapid development of the Internet and information technology, voiceprint identification technology in finance, securities, social security, e-commerce, banking and other remote customer service identification and public security, The automatic detection and authentication of the specific identity of the military security department has a wide range of application value and foreground requirements. It is an important exploration direction in the field of sound signal processing and biometric information detection and recognition in the world today. In recent decades, great progress has been made in the research in this field. However, because the speaker's personality is easily influenced by the external factors and the complex variability of the actual environment, the bottleneck effect is becoming more and more prominent. It is very important to study the effective speech information detection method and the more robust feature extraction algorithm in complex background for improving the recognition rate of the system. The voiceprint recognition technology in complex background is based on the detection of sound and further feature extraction. After analyzing and processing, the recognition model is established. Finally, the recognition model is used to recognize the speaker. This paper mainly studies the speech endpoint detection method and feature extraction method to improve the recognition efficiency, the main work is as follows. Firstly, in the stage of sound preprocessing, two speech signal endpoint detection methods in noisy environment are proposed. According to the signal-to-noise ratio of different background complexity, the two threshold endpoint detection algorithms based on spectral entropy and short-time energy and zero-crossing rate are used, respectively. The experimental results show that, The dual-threshold endpoint detection algorithm based on short-time energy and zero-crossing rate is better in the case of high signal-to-noise ratio (SNR), and the algorithm based on spectral entropy is better when the background is low SNR. Secondly, in feature extraction stage, pitch period parameters are calculated by cepstrum method, and then the power spectrum of speech signal is converted to Mel cepstrum coefficient (MFCC), by Mel filter bank. Then, the improved feature extraction algorithm is used to make two parameters into one voiceprint feature parameter, and at the same time, the experimental simulation of them is carried out. Finally, in the stage of voiceprint recognition, a noisy feature recognition algorithm (SEMG) is proposed, that is, the speech signal is detected by spectral entropy based endpoint detection algorithm under complex background, and then the improved feature extraction algorithm is used to extract features. Finally, a Gao Si hybrid model, (GMM), is established for each speaker, and the effectiveness of the SEMG algorithm is verified by experiments, and the ideal results are obtained.
【學(xué)位授予單位】:中南林業(yè)科技大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2014
【分類號(hào)】:TN912.34
【參考文獻(xiàn)】
相關(guān)期刊論文 前9條
1 鄧浩江,王守覺,邢藏菊,李倩;基于聚類統(tǒng)計(jì)與文本無關(guān)的說話人識(shí)別研究[J];電路與系統(tǒng)學(xué)報(bào);2001年03期
2 賀永強(qiáng);石艷榮;;用CA認(rèn)證技術(shù)解決遠(yuǎn)程教育中的身份認(rèn)證[J];計(jì)算機(jī)安全;2009年05期
3 李虎生,劉加,劉潤生;語音識(shí)別說話人自適應(yīng)研究現(xiàn)狀及發(fā)展趨勢(shì)[J];電子學(xué)報(bào);2003年01期
4 張文耀,許剛,王裕國;循環(huán)AMDF及其語音基音周期估計(jì)算法[J];電子學(xué)報(bào);2003年06期
5 白瑩;趙振東;戚銀城;王斌;郭建勇;;基于小波神經(jīng)網(wǎng)絡(luò)的與文本無關(guān)說話人識(shí)別方法研究[J];電子與信息學(xué)報(bào);2006年06期
6 孫妍;楊曉非;;基于MFCC和LSP混合的語音特征參數(shù)的技術(shù)研究[J];計(jì)算機(jī)與信息技術(shù);2007年Z1期
7 盧官明;李海波;劉莉;;生物特征識(shí)別綜述[J];南京郵電大學(xué)學(xué)報(bào)(自然科學(xué)版);2007年01期
8 趙振東;張靜;李圓;胡喜梅;;基于GMM說話人分類的說話人識(shí)別方法研究[J];通信技術(shù);2009年10期
9 王偉;鄧輝文;;基于MFCC參數(shù)和VQ的說話人識(shí)別系統(tǒng)[J];儀器儀表學(xué)報(bào);2006年S3期
,本文編號(hào):2305639
本文鏈接:http://sikaile.net/kejilunwen/wltx/2305639.html