基于DNN的語言識(shí)別系統(tǒng)的研究與實(shí)現(xiàn)
[Abstract]:Language is one of the most common ways for people to communicate on a daily basis, and it is an indispensable skill. In the process of globalization, the obstacles to language communication are becoming more and more prominent. In this context, it is urgent to realize language recognition. Therefore, language recognition has become an important research topic in speech research in recent years. There are still many problems in the existing language recognition systems, such as extracting pure speech information from complex speech background, stripping the information with language attributes from confusing languages, and so on. Language recognition still needs to be further studied and explored. Language recognition (Language Identification,LID) is a biometric recognition technology which automatically distinguishes the language to which the speaker belongs according to speech, so as to identify the speaker's language. Phoneme based features and underlying acoustic features have been proved to be very effective in representing language category information. Although the performance of language recognition can be effectively improved by machine learning, the recognition rate still does not meet the requirements, especially for short-term speech segments, the recognition performance still needs to be improved. In recent years, language recognition based on DNN (Deep Neural Network,DNN has become a research focus in academia and industry because of the rise, wide application and good results of DNN. In this paper, DNN-based language recognition is the focus of research, and a perfect and good language recognition system is devoted to the completion of a perfect and good performance language recognition system. The main work has been done as follows: 1. Implement a language recognition system based on DNN. 2. A phoneme feature vector (DBF (Deep Bottleneck Features,DBF) feature based on the underlying acoustic feature is used, which is more able to express the language feature than the underlying acoustic feature and phoneme feature. Using a method of using DBF training DNN statistics to extract I-Vector, DBF is used instead of UBM (Universal Background Model,UBM in GMM (Gaussian Mixture Model,GMM) model to obtain more accurate statistics, and then improve the recognition efficiency. 4. The whole system is tested and analyzed. Firstly, the performance of DBF features is compared with that of SDC features. The results show that DBF features have stronger expression ability to language, and the performance of DBF features is significantly improved in short-term speech tasks, long-term speech tasks and obfuscating and dialect recognition tasks. Then, the performance of DBF-GMM-TV-based method and DNN-TV-based method is compared and analyzed. It is shown that the model domain can be used to estimate the model more effectively. Finally, the system performance is tested from two aspects: local test and network online test.
【學(xué)位授予單位】:電子科技大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類號(hào)】:TN912.34
【參考文獻(xiàn)】
相關(guān)期刊論文 前9條
1 張翼飛;騰瀟琦;;一種改進(jìn)的有監(jiān)督訓(xùn)練的TV語種識(shí)別方法[J];計(jì)算機(jī)應(yīng)用與軟件;2016年09期
2 馬志強(qiáng);張澤廣;閆瑞;劉利民;馮永祥;蘇依拉;;基于N-Gram模型的蒙古語文本語種識(shí)別算法的研究[J];中文信息學(xué)報(bào);2016年01期
3 崔瑞蓮;宋彥;蔣兵;戴禮榮;;基于深度神經(jīng)網(wǎng)絡(luò)的語種識(shí)別[J];模式識(shí)別與人工智能;2015年12期
4 洪新海;宋彥;蔣兵;戴禮榮;;采用DBN的TV改進(jìn)方法在語種識(shí)別中的應(yīng)用[J];信號(hào)處理;2015年09期
5 楊緒魁;屈丹;張文林;;基于正則化i-Vector算法的語種識(shí)別[J];信息工程大學(xué)學(xué)報(bào);2015年02期
6 陳瑤玲;楊鑒;陳江;徐永華;;基于支持向量機(jī)的語種識(shí)別[J];信息技術(shù);2010年06期
7 劉明輝;戴蓓劏;解焱陸;;基于GMM多維概率輸出的SVM話者確認(rèn)[J];模式識(shí)別與人工智能;2008年01期
8 屈丹,王炳錫,魏鑫;基于GMM-UBM模型的語言辨識(shí)研究[J];信號(hào)處理;2003年01期
9 劉穎超,張紀(jì)元;梯度下降法[J];南京理工大學(xué)學(xué)報(bào)(自然科學(xué)版);1993年02期
相關(guān)會(huì)議論文 前1條
1 田W,
本文編號(hào):2480760
本文鏈接:http://sikaile.net/kejilunwen/xinxigongchenglunwen/2480760.html