基于麥克風(fēng)陣列的多聲源測向方法研究
發(fā)布時間:2018-04-10 00:35
本文選題:麥克風(fēng)陣列 切入點:多源測向 出處:《南京理工大學(xué)》2014年碩士論文
【摘要】:基于麥克風(fēng)陣列的多聲源測向技術(shù)通過對麥克風(fēng)陣列接收的多聲源混合信號進行處理,從而確定各個聲源的方位。它在很多領(lǐng)域都具有廣泛的應(yīng)用前景和實際意義,如在民用方面的視/音頻會議、語音識別及增強等領(lǐng)域中,常利用聲源測向技術(shù)精確估計出說話人位置來控制攝像頭,使其自動對該位置的語音信號進行增強。在軍事方面聲源測向技術(shù)被廣泛地應(yīng)用在飛機,火炮、狙擊手探測等方面。因此,該技術(shù)成為了語音信號處理領(lǐng)域的研究熱點之一。 本課題針對基于麥克風(fēng)陣列多聲源測向問題展開研究,歸納總結(jié)并比較了傳統(tǒng)的幾類聲源測向方法。本文以典型的雙陣元麥克風(fēng)陣列為研究對象,針對遠場多聲源模型,將基于語音信號時頻正交特性的退化分離估計技術(shù)(DUET)應(yīng)用于聲源信號測向。該算法利用了語音信號特有的時頻稀疏和短時正交特性(W-Disjoint Orthogonality, W-DO),基于此特性的時延估計算法計算量小,實現(xiàn)簡單,僅用兩個麥克風(fēng)就可以實現(xiàn)多個聲源的方位測向。但是當(dāng)聲源存在波長小于兩倍陣元間距的高頻成分時,此類聲源測向方法將出現(xiàn)相位卷繞模糊問題,而陣元間距因物理尺寸限制也不可能無限縮小,因此限制了該類方法的實際應(yīng)用領(lǐng)域。針對上述問題,本文提出了一種基于迭代時頻掩蔽的寬間距麥克風(fēng)陣列多聲源測向方法,該方法通過迭代消去過程,顯著抑制了相位卷繞產(chǎn)生的影響。此外,結(jié)合基于能量的語音端點檢測技術(shù),本文進一步給出了上述方法的實時處理算法步驟。針對上述方法,本文進行了仿真實驗和相關(guān)外場實驗,實驗結(jié)果表明:針對寬間距麥克風(fēng)陣列多聲源測向,本文所述方法明顯優(yōu)于常規(guī)DUET類方法,具有一定的實際應(yīng)用價值。
[Abstract]:Multi-sound source direction-finding technology based on microphone array processes the mixed signals received by microphone array to determine the orientation of each sound source.It has a wide application prospect and practical significance in many fields, such as video / audio conference, speech recognition and enhancement in the civilian field, it often uses the sound source direction finding technology to accurately estimate the speaker's position to control the camera.Make it automatically enhance the speech signal at this position.Sound source direction finding technology is widely used in aircraft, artillery, sniper detection and so on.Therefore, this technology has become one of the research hotspots in the field of speech signal processing.This paper focuses on the research of multi-source direction finding based on microphone array, and summarizes and compares several traditional sound source direction finding methods.In this paper, a typical dual-array microphone array is studied. For the far-field multi-source model, the degenerate separation estimation technique based on the time-frequency orthogonality of speech signal is applied to the direction finding of the sound source signal.This algorithm utilizes the time-frequency sparse and short-time orthogonal characteristic of speech signal, W-Disjoint Orthogonality (W-DOA). The time delay estimation algorithm based on this characteristic has the advantages of small computation and simple implementation, and it can realize the azimuth direction finding of multiple sound sources with only two microphones.However, when the sound source has a high frequency component whose wavelength is less than two times the spacing of the array element, the phase winding ambiguity will occur in this method, and the distance between the elements can not be reduced indefinitely because of the limitation of the physical size.Therefore, the practical application field of this kind of method is limited.In order to solve the above problems, an iterative time-frequency masking method for direction finding of multi-source microphone arrays with wide spacing is proposed. The effect of phase winding is significantly suppressed by iterative elimination process.In addition, combined with the energy-based speech endpoint detection technology, this paper further gives the real-time processing algorithm steps of the above method.In view of the above methods, the simulation experiments and related field experiments are carried out in this paper. The experimental results show that the method presented in this paper is obviously superior to the conventional DUET method and has certain practical application value for the wide spacing microphone array multi-sound source direction finding.
【學(xué)位授予單位】:南京理工大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2014
【分類號】:TN912.3
【參考文獻】
相關(guān)期刊論文 前5條
1 嚴素清,黃冰;傳聲器陣列的聲源定位研究[J];電聲技術(shù);2004年12期
2 何蒙;祖麗楠;孫昊;楊鵬;;基于LMS的廣義互相關(guān)時延估計[J];電聲技術(shù);2010年09期
3 鄧艷容;景新幸;任華娟;;基于麥克風(fēng)陣列的聲源定位研究[J];電子技術(shù)應(yīng)用;2010年02期
4 雷鳴;陳紹欽;雷志勇;;近地炸點聲定位算法研究[J];計算機測量與控制;2012年03期
5 邵懷宗,林靜然,彭啟琮,居太亮,徐異凌;基于麥克風(fēng)陣列的聲源定位研究[J];云南民族大學(xué)學(xué)報(自然科學(xué)版);2004年04期
,本文編號:1728903
本文鏈接:http://sikaile.net/kejilunwen/wltx/1728903.html
最近更新
教材專著