基于DTW的語音關鍵詞檢出
發(fā)布時間:2018-07-24 19:03
【摘要】:針對少資源語言的語音關鍵詞檢出技術受到了廣泛關注。該文在基于動態(tài)時間規(guī)整(dynamic time warping,DTW)的關鍵詞檢出框架下,提出了基于音素邊界的局部匹配策略,用以解決基于樣例的語音關鍵詞檢出任務中的近似查詢問題。在QUESST 2014評測數(shù)據(jù)上采用多種特征進行了實驗驗證。實驗結果顯示:基于音素邊界的局部匹配策略不僅在近似查詢T2和T3任務上的檢出效果明顯提升,在精確查詢T1任務上也獲得了有效提升。隨后的系統(tǒng)融合實驗表明,該策略能夠大幅提升融合系統(tǒng)的性能。
[Abstract]:Speech keyword detection technology for less resource languages has been paid more and more attention. In this paper, a local matching strategy based on phoneme boundary is proposed under the framework of keyword detection based on dynamic time warping (dynamic time (dynamic time warping (dynamic time), which is used to solve the approximate query problem in speech keyword detection task based on sample. A variety of features are used to verify the QUESST 2014 data. The experimental results show that the local matching strategy based on phoneme boundary not only improves the detection effect on approximate T _ 2 and T _ 3 tasks, but also improves the precision query T _ 1 tasks. Subsequent system fusion experiments show that the strategy can greatly improve the performance of the fusion system.
【作者單位】: 西北工業(yè)大學計算機學院 陜西省語音與圖像信息處理重點實驗室;南洋理工大學Temasek實驗室;新加坡科技局資訊通信研究院 人類語言技術部;南洋理工大學計算機工程學院;
【基金】:國家自然科學基金面上項目(61571363)
【分類號】:TP391.3
本文編號:2142341
[Abstract]:Speech keyword detection technology for less resource languages has been paid more and more attention. In this paper, a local matching strategy based on phoneme boundary is proposed under the framework of keyword detection based on dynamic time warping (dynamic time (dynamic time warping (dynamic time), which is used to solve the approximate query problem in speech keyword detection task based on sample. A variety of features are used to verify the QUESST 2014 data. The experimental results show that the local matching strategy based on phoneme boundary not only improves the detection effect on approximate T _ 2 and T _ 3 tasks, but also improves the precision query T _ 1 tasks. Subsequent system fusion experiments show that the strategy can greatly improve the performance of the fusion system.
【作者單位】: 西北工業(yè)大學計算機學院 陜西省語音與圖像信息處理重點實驗室;南洋理工大學Temasek實驗室;新加坡科技局資訊通信研究院 人類語言技術部;南洋理工大學計算機工程學院;
【基金】:國家自然科學基金面上項目(61571363)
【分類號】:TP391.3
【相似文獻】
相關重要報紙文章 前2條
1 葉向榮 鄭民軍;大破語音關[N];中國教育報;2001年
2 山東省陽信縣翟王鎮(zhèn)韓箔中心小學 張玉蘭;語文課堂 讓學生快樂表達[N];學知報;2010年
相關碩士學位論文 前1條
1 朱林;語音關鍵詞檢測若干問題研究[D];北京郵電大學;2015年
,本文編號:2142341
本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2142341.html
最近更新
教材專著