基于語音識別技術(shù)的生命探測設(shè)備設(shè)計與實現(xiàn)
本文選題:音頻生命探測器 + 語音端點檢測; 參考:《成都理工大學(xué)》2014年碩士論文
【摘要】:本論文來源于四川省蘆山地震災(zāi)后恢復(fù)重建項目——四川省防震減災(zāi)社會服務(wù)工程。論文在探討現(xiàn)有的音頻生命探測器的基礎(chǔ)上,結(jié)合音頻生命探測器智能化,高效化的發(fā)展趨勢,淺析語音識別在地震救援中的應(yīng)用并給出了相應(yīng)的設(shè)計方案。 當(dāng)災(zāi)害降臨,生靈涂炭,建筑倒塌,交通阻斷,通訊中斷,以地震為首的自然災(zāi)害具有破壞性強,瞬時發(fā)生、波及范圍廣,監(jiān)測預(yù)報困難、社會影響深遠(yuǎn)等特點,,近幾年我國發(fā)生的5.12汶川地震和4.20蘆山地震給國家經(jīng)濟建設(shè)和人民生命財產(chǎn)安全造成重大損失,目前我國在地震應(yīng)急救援技術(shù)方面相對發(fā)達國家還較為落后,相關(guān)的地震救助技術(shù)研究較少。 近年來地震災(zāi)害依然頻繁,地震救援工作仍然受到黨中央國務(wù)院的高度重視。目前,中國的減災(zāi)救助行業(yè)仍然需要大量的救援設(shè)備的支持,然而高技術(shù)救援設(shè)備卻相對的缺乏自主知識產(chǎn)權(quán)。防震減災(zāi)作為公共安全的重要組成部分,是關(guān)乎民生的公益性事業(yè),是關(guān)乎社會穩(wěn)定造福于民的事業(yè),是關(guān)乎人民生命財產(chǎn)安全和社會經(jīng)濟的可持續(xù)發(fā)展的事業(yè),公共安全領(lǐng)域已經(jīng)被列入《國家中長期科學(xué)和技術(shù)發(fā)展規(guī)劃綱要(2006-2020)》的重點領(lǐng)域,其中的優(yōu)先主題中的重大自然災(zāi)害監(jiān)測與防御明確指出,重點研究開發(fā)地震、臺風(fēng)、暴雨、洪水、地質(zhì)災(zāi)害等監(jiān)測、預(yù)警和應(yīng)急處置關(guān)鍵技術(shù)[9]。 隨著我國經(jīng)濟的不斷發(fā)展和科技的不斷進步,從“十五”國家科技攻關(guān)項目到“十一五”預(yù)研項目都提出了相關(guān)的研究課題,越來越多的專家學(xué)者投入到研究行列中也取得了不少研究成果。但許多自主研制的救援設(shè)備由于材料選用欠佳,結(jié)構(gòu)考慮不周,導(dǎo)致在實際救援應(yīng)用中效果并不理想,怎樣結(jié)合實際救援環(huán)境研制出具有一定運用價值的救援設(shè)備成為了主導(dǎo)方向。 語音信號作為有效的生命體征信號,在救援過程中可以被用來作為有效營救信號,目前在地震救援中音頻生命探測儀被廣泛的使用,但大部分音頻聲明探測儀需要通過人來識別信息,救援時需要在廢墟中尋找空隙伸入探頭,或是在地勢相對平坦的地帶安放傳感器,容易受到現(xiàn)場噪音的影響,影響辨識。 FPGA(現(xiàn)場可編程邏輯陣列)是在CPLD的基礎(chǔ)上發(fā)展起來的新型高性能可編程邏輯器件[10],用戶可以個性化的配置所需的總線接口,憑借其良好的穩(wěn)定性與低功耗的特點在工業(yè)控制與高速信號處理方面得到了廣泛應(yīng)用,并且借助自身基于IP(Intellectual Property)的SOC設(shè)計可以大幅提高研發(fā)進度。 研究路線:研究并設(shè)計一套基于FPGA的集合語音采集、語音識別的裝置,該裝置通過對周圍環(huán)境中的音頻信號采集,由自適應(yīng)濾波算法完成語音消噪和增強的功能,再對增強后的語音信號提取Mel頻率倒譜系數(shù)作為特征向量,再用特征向量對BP神經(jīng)網(wǎng)絡(luò)進行訓(xùn)練,最后用訓(xùn)練好的神經(jīng)網(wǎng)絡(luò)完成語音識別。
[Abstract]:This paper comes from Sichuan Province Lushan earthquake disaster recovery and reconstruction project-Sichuan Province earthquake disaster reduction social service project.Based on the discussion of the existing audio life detector and the development trend of intelligent and high efficiency audio life detector, this paper analyzes the application of speech recognition in earthquake rescue and gives the corresponding design scheme.When disasters come, life is ruined, buildings collapse, traffic is blocked, communications are interrupted, natural disasters headed by earthquakes are characterized by strong destruction, instantaneous occurrence, wide spread, difficulty in monitoring and forecasting, and far-reaching social impact, etc.The 5.12 Wenchuan earthquake and 4.20 Lushan earthquake occurred in China in recent years have caused great losses to national economic construction and the safety of people's lives and property. At present, our country is still relatively backward in earthquake emergency rescue technology.There are few related seismic rescue techniques.Earthquake disaster is still frequent in recent years, earthquake rescue work is still attached great importance by the State Council of the CPC Central Committee.At present, China's disaster reduction and rescue industry still needs a large number of rescue equipment support, but high-tech rescue equipment is relatively lack of independent intellectual property rights.As an important part of public safety, earthquake prevention and disaster reduction is a public welfare undertaking related to people's livelihood, a cause related to social stability and benefit to the people, and a cause related to the safety of people's lives and property and the sustainable development of social economy.The field of public security has been included in the key areas of the National Medium- and Long-Term Science and Technology Development Program (2006-2020). The monitoring and prevention of major natural disasters in the priority themes clearly indicate that the key research and development of earthquakes, typhoons, rainstorms,Key technologies for flood, geological hazard monitoring, early warning and emergency management [9].With the development of our economy and the progress of science and technology, relevant research topics have been put forward from the National Science and Technology Research Project of the Tenth Five-Year Plan to the 11th Five-Year Plan project.More and more experts and scholars have also made a lot of research achievements.However, due to poor selection of materials and inadequate structural considerations, many self-developed rescue equipment are not effective in practical rescue applications.How to combine the actual rescue environment to develop a certain value of rescue equipment has become the leading direction.As an effective vital sign signal, voice signal can be used as an effective rescue signal in the rescue process. At present, audio life detector is widely used in earthquake rescue.However, most audio declaration detectors need to identify information by people, search for gaps in the debris to reach the probe, or install sensors in relatively flat terrain, which are vulnerable to the impact of field noise and affect identification.FPGA (Field Programmable Logic Array) is a new type of high performance programmable logic device based on CPLD.Because of its good stability and low power consumption, it has been widely used in industrial control and high-speed signal processing, and its own SOC design based on IP(Intellectual property can greatly improve the progress of research and development.Research route: research and design a set of FPGA based set of voice acquisition, speech recognition device, the device through the surrounding environment of audio signal acquisition, by the adaptive filtering algorithm to complete the speech noise reduction and enhancement function,The frequency cepstrum coefficient of Mel is extracted from the enhanced speech signal as the feature vector, and the BP neural network is trained with the feature vector. Finally, the speech recognition is completed with the trained neural network.
【學(xué)位授予單位】:成都理工大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2014
【分類號】:TN912.34
【參考文獻】
相關(guān)期刊論文 前10條
1 呂衛(wèi)強;黃荔;;基于短時能量加過零率的實時語音端點檢測方法[J];兵工自動化;2009年09期
2 孫昊;劉作軍;楊鵬;董愛龍;;基于語音識別的救援機器人聽覺導(dǎo)航方法[J];北京科技大學(xué)學(xué)報;2012年01期
3 楊占軍;楊英杰;王強;;基于DSP的語音識別系統(tǒng)的設(shè)計與實現(xiàn)[J];東北電力大學(xué)學(xué)報;2006年02期
4 洪家平;;LD3320的嵌入式語音識別系統(tǒng)的應(yīng)用[J];單片機與嵌入式系統(tǒng)應(yīng)用;2012年02期
5 程石磊;蔣嬌嬌;劉成;;基于FIR濾波器的語音特征向量提取及FPGA實現(xiàn)[J];電聲技術(shù);2013年10期
6 張軍;FPGA與CPLD器件使用經(jīng)驗談[J];測控技術(shù);2001年12期
7 朱恒軍;于泓博;王發(fā)智;;小波分析和支持向量機相融合的語音端點檢測算法[J];計算機科學(xué);2012年06期
8 趙明;吳剛;王麗萍;;生命探測儀及其技術(shù)原理[J];遼寧師專學(xué)報(自然科學(xué)版);2012年02期
9 宋奮韜;王夢瑩;付志遠(yuǎn);;FPGA發(fā)展概論[J];科技信息;2012年23期
10 史水平,李世作;線性預(yù)測編碼(LPC)技術(shù)及其在音頻文件上的應(yīng)用[J];現(xiàn)代電子技術(shù);2004年04期
本文編號:1752217
本文鏈接:http://sikaile.net/falvlunwen/zhishichanquanfa/1752217.html