基于哼唱的MIDI音頻檢索算法研究

發(fā)布時間：2018-06-29 14:47

本文選題：哼唱檢索 + MIDI　；參考：《山東科技大學(xué)》2017年碩士論文

【摘要】：隨著音樂數(shù)據(jù)庫爆炸式的增長,傳統(tǒng)的基于文本的音頻檢索給用戶帶來極大的不便。基于哼唱的MIDI音樂檢索是基于內(nèi)容的音樂檢索方式,它允許用戶不需要歌詞而只需哼唱旋律就可以檢索到自己需要的歌曲。本文的目標(biāo)是構(gòu)建完整的基于哼唱的MIDI音頻檢索算法并檢驗其可行性。本文的主要研究內(nèi)容如下:1.音頻特征提取。分析了音頻信號的時域、頻域和倒譜特征,并介紹了幾種基本的旋律輪廓的表達(dá),闡述了音頻信號的特征提取方法。2.基于HMM的哼唱檢索算法。建立了以音符為基礎(chǔ)的HMM模型,避免了音符切分。對音調(diào)進(jìn)行轉(zhuǎn)換,將音調(diào)轉(zhuǎn)換后的音高序列作為旋律的音高特征,從而克服了因哼唱者哼唱習(xí)慣和音域差別導(dǎo)致的差異。用500個哼唱片段的測試集測試算法的性能,達(dá)到了 TOP3為78%的識別率。3.基于深度學(xué)習(xí)的哼唱檢索算法。采用3層DBN網(wǎng)絡(luò)結(jié)構(gòu)得到每首歌曲的關(guān)鍵特征,保證旋律數(shù)據(jù)能精確描述歌曲旋律,解決了旋律特征不穩(wěn)定的情況。并采用了基于聚類的方法實現(xiàn)旋律特征的近鄰檢索。構(gòu)建了 200首MIDI格式的音樂庫,用42首wav格式的哼唱查詢文件驗證算法的性能,達(dá)到了 TOP3為81.0%的識別率。同時引入基于DBN的哼唱檢索算法與基于LSH的哼唱檢索算法的對比實驗,證明了基于DBN的檢索算法的優(yōu)良性能。上述兩個算法的核心部分都包括旋律特征提取和旋律特征匹配,這也是各個檢索算法著重研究的部分。MIDI音樂數(shù)據(jù)庫的旋律特征提取和哼唱旋律特征提取相關(guān)技術(shù)在各個算法中都有著重研究。
[Abstract]:With the explosive growth of music database, traditional text-based audio retrieval brings great inconvenience to users. Midi music retrieval based on humming is a content-based music retrieval method, which allows users to retrieve the songs they need without the lyrics but only by humming the melody. The goal of this paper is to construct a complete midi audio retrieval algorithm based on humming and to test its feasibility. The main contents of this paper are as follows: 1. Audio feature extraction. In this paper, the time domain, frequency domain and cepstrum characteristics of audio signal are analyzed, and the expression of several basic melodic contours is introduced, and the feature extraction method of audio signal. Hem retrieval algorithm based on hmm. The hmm model based on notes is established to avoid the segmentation of notes. In order to overcome the differences caused by humming habits and range differences, the pitch sequence after tone conversion is regarded as the pitch feature of the melody. The performance of the algorithm is tested with 500 humming test sets, and the recognition rate of TOP3 is 78%. 3. Hem retrieval algorithm based on deep learning. The key features of each song are obtained by using a three-layer DBN network structure, which ensures that the melody data can accurately describe the melody of the song, and solves the unstable situation of the melody characteristic. The nearest neighbor retrieval of melody feature is realized by clustering method. 200 music libraries in midi format are constructed and 42 wav format humming query files are used to verify the performance of the algorithm. The recognition rate of Top3 is 81.0%. At the same time, the comparison experiment between the humming retrieval algorithm based on DBN and the Hem retrieval algorithm based on LSH proves the excellent performance of the retrieval algorithm based on DBN. The core parts of the above two algorithms include melody feature extraction and melody feature matching. This is also the part of each retrieval algorithm. The melody feature extraction and humming melody feature extraction of midi music database are studied in each algorithm.
【學(xué)位授予單位】：山東科技大學(xué)
【學(xué)位級別】：碩士
【學(xué)位授予年份】：2017
【分類號】：TN912.3;TP391.3

【參考文獻(xiàn)】

相關(guān)期刊論文前10條

1 肖艷;王虎;;一種基于哼唱的小規(guī)模MIDI音樂檢索系統(tǒng)及實現(xiàn)[J];中國新通信;2017年03期

2 余凱;賈磊;陳雨強(qiáng);徐偉;;深度學(xué)習(xí)的昨天、今天和明天[J];計算機(jī)研究與發(fā)展;2013年09期

3 郭敏;張衛(wèi)強(qiáng);劉加;;一種基于幀-音符方式的哼唱檢索算法[J];清華大學(xué)學(xué)報(自然科學(xué)版);2011年04期

4 林小蘭;王曉光;王暉;;基于內(nèi)容的音樂檢索關(guān)鍵技術(shù)研究[J];中國傳媒大學(xué)學(xué)報(自然科學(xué)版);2010年04期

5 袁里馳;;基于改進(jìn)的隱馬爾科夫模型的語音識別方法[J];中南大學(xué)學(xué)報(自然科學(xué)版);2008年06期

6 羅凱;魏維;謝青松;;哼唱檢索中改進(jìn)的動態(tài)時間規(guī)整算法[J];計算機(jī)工程;2008年20期

7 趙芳;吳亞棟;宿繼奎;;基于音軌特征量的多音軌MIDI主旋律抽取方法[J];計算機(jī)工程;2007年02期

8 徐開闊;唐常杰;段磊;魏大剛;鐘義嘯;喬少杰;;正態(tài)分布下基于隱Markov模型的多聲道MIDI音樂檢索[J];四川大學(xué)學(xué)報(自然科學(xué)版);2006年03期

9 續(xù)鴻飛;肖明;;音頻檢索綜述[J];晉圖學(xué)刊;2005年06期

10 李雪瑩,劉寶旭,許榕生;字符串匹配技術(shù)研究[J];計算機(jī)工程;2004年22期

相關(guān)碩士學(xué)位論文前7條

1 孫潔;基于哼唱的MIDI音樂檢索系統(tǒng)的研究[D];西安建筑科技大學(xué);2013年

2 曹建紅;基于哼唱的音樂檢索技術(shù)研究[D];南京理工大學(xué);2009年

3 沙曉艷;HMM模型在哼唱檢索中的應(yīng)用[D];西北大學(xué);2008年

4 宋星華;基于哼唱的音樂檢索[D];南京理工大學(xué);2008年

5 陳家紅;哼唱檢索中哼唱信息處理方法的研究[D];南京理工大學(xué);2008年

6 陳旭;基于內(nèi)容的音頻哼唱識別及檢索系統(tǒng)[D];上海交通大學(xué);2008年

7 王薇;基于內(nèi)容的音頻檢索特征提取技術(shù)研究[D];上海交通大學(xué);2008年

，

本文編號：2082456

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/xinxigongchenglunwen/2082456.html

上一篇：大規(guī)模MIMO系統(tǒng)中基于用戶分類的動態(tài)導(dǎo)頻分配
下一篇：試驗靶場無線通信系統(tǒng)綜合效能評估方法

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于哼唱的MIDI音頻檢索算法研究