基于自適應(yīng)神經(jīng)模糊推理與隱馬爾可夫的語音分割研究
[Abstract]:Modern speech technology and research need high accuracy and high reliability of speech segmentation. Manual segmentation is always considered to be the most reliable and accurate method. However, the manual segmentation method is not only time-consuming and laborious, but also must be implemented by speech experts. This was a fatal flaw in big data's time, especially for large-scale speech banks. Therefore, it is necessary to develop automatic speech segmentation technology with high accuracy. The most important automatic speech segmentation technique is called forced calibration. In this method, the hidden Markov model (HMM) is used to construct different phoneme models. The speech signal is extracted into a set of feature vectors. The model can get the approximate phonemes boundary, but the results are not accurate. The traditional forced calibration system based on hidden Markov model is calculated with 20 millisecond tolerance in the TIMIT speech corpus, and the accuracy is between 80% and 89%. Up to now, many methods have been proposed to improve the automatic speech segmentation based on Hidden Markov. Some researchers have realized that the difference between automatic speech segmentation based on hidden Markov and artificial speech segmentation is that speech experts have knowledge of speech segmentation. Fuzzy logic can directly transform this knowledge into fuzzy rules that can be used in computers. However, fuzzy rules need to be carefully designed by experts, and the completeness of the rules cannot be guaranteed. To solve these problems, a more suitable method is proposed, which is the purpose of this study. Adaptive neural fuzzy inference system (ANFIS) is a machine learning method combining neural network and fuzzy inference system. Compared with other machine learning methods, it has the advantages of neural network and fuzzy inference system, and has better performance. Its advantages: simple, nonlinear, fuzzy reasoning rules, very suitable to solve the problems we mentioned earlier. In this paper, the adaptive neural fuzzy inference system is used to learn how to correct the location of segmentation points to compensate for the difference between manual segmentation and machine segmentation and the system segmentation error caused by Hidden Markov Model itself. The whole experiment is divided into two steps: first, context-free HMM is used to obtain the initial speech boundary. In the second step, the trained adaptive neural fuzzy inference system is used to modify the segmentation boundary obtained from the first step. The experiment uses TIMIT database. The experimental results show that the adaptive neural fuzzy inference system can significantly improve the accuracy of automatic speech segmentation based on Hidden Markov. In the TIMIT corpus, the adaptive neurofuzzy inference system can improve the accuracy from 86.25% to 92.08 by using 20 millisecond tolerance as the evaluation criterion. It also proves the effectiveness of adaptive neural fuzzy inference system in speech segmentation. In addition, our approach is easier to build and apply. In the future, we will continue to improve the accuracy of the system and apply it to other databases.
【學(xué)位授予單位】:天津大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2014
【分類號】:TN912.3
【相似文獻】
相關(guān)期刊論文 前10條
1 劉震;王厚軍;龍兵;張治國;;一種基于加權(quán)隱馬爾可夫的自回歸狀態(tài)預(yù)測模型[J];電子學(xué)報;2009年10期
2 李成,宋執(zhí)環(huán),李平;基于小波域隱馬爾可夫樹模型的過程趨勢分析[J];信息與控制;2005年03期
3 楊兵,謝維信;基于基因算法的隱馬爾可夫模型參數(shù)估計[J];系統(tǒng)工程與電子技術(shù);2002年07期
4 孫俊喜,趙永明,陳亞珠;基于小波域隱馬爾可夫樹模型的超聲圖象貝葉斯去噪[J];中國圖象圖形學(xué)報;2003年06期
5 周越,許晴;基于隱馬爾可夫復(fù)合樹模型的圖像紋理分析[J];數(shù)據(jù)采集與處理;2004年04期
6 王華華;周越;楊杰;戈新良;;基于正交余弦變換域概率主成分分析的嵌入隱馬爾可夫人臉識別模型[J];上海交通大學(xué)學(xué)報;2007年06期
7 景明利;周雪芹;;基于小波域的隱馬爾可夫樹模型的圖像去噪[J];昆明理工大學(xué)學(xué)報(理工版);2008年05期
8 彭玲,趙忠明,馬江林;基于樹狀分解隱馬爾可夫樹的紋理分類模型研究[J];武漢科技大學(xué)學(xué)報(自然科學(xué)版);2004年04期
9 江艷霞;周宏仁;敬忠良;;基于拉普拉斯臉和隱馬爾可夫的視頻人臉識別[J];計算機工程;2007年01期
10 蘇濤,張登福,畢篤彥;基于小波域分類隱馬爾可夫樹模型的圖像去噪[J];紅外與激光工程;2005年02期
相關(guān)碩士學(xué)位論文 前7條
1 苗聰聰;基于隱馬爾可夫樹模型與旋轉(zhuǎn)不變性的遙感圖像紋理檢索方法研究[D];中國礦業(yè)大學(xué);2015年
2 李遠林;基于連續(xù)隱馬爾可夫的蘭州PM_(10)污染提前24小時預(yù)測研究[D];蘭州大學(xué);2016年
3 董良;基于自適應(yīng)神經(jīng)模糊推理與隱馬爾可夫的語音分割研究[D];天津大學(xué);2014年
4 鐘微;基于隱馬爾可夫協(xié)議分析的無線網(wǎng)絡(luò)入侵檢測技術(shù)研究[D];電子科技大學(xué);2013年
5 馬晶晶;基于隱馬爾可夫理論的駕駛意圖辨識研究[D];長沙理工大學(xué);2012年
6 韓景靈;基于協(xié)議的隱馬爾可夫網(wǎng)絡(luò)入侵檢測系統(tǒng)研究[D];山西大學(xué);2007年
7 葛馨遠;小波域HMT模型的應(yīng)用研究[D];華北電力大學(xué)(北京);2009年
,本文編號:2392821
本文鏈接:http://sikaile.net/kejilunwen/wltx/2392821.html