天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

語(yǔ)音識(shí)別在視頻會(huì)議中的應(yīng)用研究及實(shí)現(xiàn)

發(fā)布時(shí)間:2018-03-01 18:09

  本文關(guān)鍵詞: 視頻會(huì)議 語(yǔ)音識(shí)別 Android平臺(tái) 出處:《華南理工大學(xué)》2014年碩士論文 論文類型:學(xué)位論文


【摘要】:視頻會(huì)議作為一種遠(yuǎn)程實(shí)時(shí)信息交流與互動(dòng)的通信方式,已經(jīng)在醫(yī)療、教育、金融、政府等領(lǐng)域獲得廣泛應(yīng)用。在傳統(tǒng)的視頻會(huì)議系統(tǒng)中,主要利用手動(dòng)控制方式對(duì)視頻會(huì)議進(jìn)行操控,隨著科技的進(jìn)步和用戶體驗(yàn)要求的提高,將語(yǔ)音識(shí)別技術(shù)應(yīng)用于視頻會(huì)議系統(tǒng)具有現(xiàn)實(shí)意義,語(yǔ)音識(shí)別技術(shù)是指計(jì)算機(jī)將人的語(yǔ)音信號(hào),通過(guò)識(shí)別和理解過(guò)程,將其轉(zhuǎn)換為相應(yīng)的文本或命令,語(yǔ)音識(shí)別技術(shù)正逐漸成為信息技術(shù)中人機(jī)接口的關(guān)鍵技術(shù),語(yǔ)音識(shí)別技術(shù)的應(yīng)用已經(jīng)成為一個(gè)具有競(jìng)爭(zhēng)性的新興高技術(shù)產(chǎn)業(yè)。 本文以視頻會(huì)議為背景,將語(yǔ)音識(shí)別技術(shù)應(yīng)用于視頻會(huì)議系統(tǒng)中,通過(guò)語(yǔ)音識(shí)別技術(shù)識(shí)別出預(yù)設(shè)的語(yǔ)音命令從而對(duì)視頻會(huì)議進(jìn)行操作控制,利用語(yǔ)音控制方式取代通過(guò)鼠標(biāo)、鍵盤(pán)或移動(dòng)智能終端等設(shè)備的手動(dòng)控制方式,使視頻會(huì)議系統(tǒng)更加人性化和智能化。 本文基于CoolView視頻會(huì)議系統(tǒng),以其中的Android平臺(tái)上的遙控器為基礎(chǔ),設(shè)計(jì)出基于遙控器平臺(tái)的語(yǔ)音識(shí)別系統(tǒng)的整體結(jié)構(gòu)并對(duì)其進(jìn)行功能模塊劃分,根據(jù)視頻會(huì)議遙控器的使用場(chǎng)景,分別實(shí)現(xiàn)了基于Google語(yǔ)音識(shí)別技術(shù)的在線語(yǔ)音識(shí)別系統(tǒng)和基于CMU PocketSphinx語(yǔ)音識(shí)別引擎的本地語(yǔ)音識(shí)別系統(tǒng),在線語(yǔ)音識(shí)別系統(tǒng)用于會(huì)議的選擇,而本地語(yǔ)音識(shí)別系統(tǒng)用于遙控器對(duì)其受控終端的控制,它是一個(gè)小詞匯量的語(yǔ)音識(shí)別系統(tǒng)。此外,,為了降低周圍環(huán)境噪聲的影響,提高語(yǔ)音信號(hào)的質(zhì)量,語(yǔ)音識(shí)別系統(tǒng)中設(shè)計(jì)實(shí)現(xiàn)了一個(gè)音頻處理模塊,用于噪聲抑制和音頻無(wú)損壓縮處理等。最后,通過(guò)測(cè)試,實(shí)現(xiàn)的語(yǔ)音識(shí)別系統(tǒng)能夠滿足視頻會(huì)議系統(tǒng)的基本操作需求,驗(yàn)證了語(yǔ)音識(shí)別在視頻會(huì)議系統(tǒng)中應(yīng)用的可行性,而且本地小詞匯量的語(yǔ)音識(shí)別系統(tǒng)具有較高的識(shí)別率和較短的識(shí)別處理時(shí)間,極大地提升了系統(tǒng)的用戶體驗(yàn)。
[Abstract]:As a remote and real-time information exchange and interactive communication method, videoconferencing has been widely used in medical, education, finance, government and other fields. With the development of science and technology and the improvement of user experience, it is of practical significance to apply speech recognition technology to video conference system. Speech recognition technology means that the computer converts the human speech signal into the corresponding text or command through the recognition and understanding process. Speech recognition technology is gradually becoming the key technology of man-machine interface in information technology. The application of speech recognition technology has become a competitive new high-tech industry. In this paper, based on the background of video conference, the speech recognition technology is applied to the video conference system, the preset voice command is recognized by the speech recognition technology to control the video conference, and the voice control method is used to replace the mouse. The manual control mode of keyboard or mobile intelligent terminal makes the video conference system more humanized and intelligent. Based on the CoolView video conference system, based on the remote control on the Android platform, this paper designs the whole structure of the speech recognition system based on the remote control platform and divides its function modules. According to the usage scene of video conference remote controller, the online speech recognition system based on Google speech recognition technology and the local speech recognition system based on CMU PocketSphinx speech recognition engine are implemented, respectively. The online speech recognition system is used for meeting selection. The local speech recognition system is used for the remote control of its controlled terminal. It is a small vocabulary speech recognition system. In addition, in order to reduce the influence of ambient noise and improve the quality of speech signal, In the speech recognition system, an audio processing module is designed and implemented, which is used for noise suppression and audio lossless compression. Finally, through testing, the realized speech recognition system can meet the basic operational requirements of the video conference system. The feasibility of the application of speech recognition in video conference system is verified, and the local small vocabulary speech recognition system has higher recognition rate and shorter processing time, which greatly improves the user experience of the system.
【學(xué)位授予單位】:華南理工大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2014
【分類號(hào)】:TN912.34

【參考文獻(xiàn)】

相關(guān)期刊論文 前10條

1 朱淑鑫;謝忠紅;;淺談?wù)Z音識(shí)別技術(shù)的應(yīng)用及發(fā)展[J];長(zhǎng)春理工大學(xué)學(xué)報(bào)(高教版);2009年02期

2 鄧永紅;視頻會(huì)議技術(shù)的應(yīng)用與發(fā)展概況[J];廣播電視信息;2005年02期

3 胡偉;;Android系統(tǒng)架構(gòu)及其驅(qū)動(dòng)研究[J];廣州廣播電視大學(xué)學(xué)報(bào);2010年04期

4 周英;;關(guān)于語(yǔ)音識(shí)別技術(shù)發(fā)展趨勢(shì)的分析[J];計(jì)算機(jī)光盤(pán)軟件與應(yīng)用;2012年19期

5 屈振華;李慧云;張海濤;龍顯軍;;WebRTC技術(shù)初探[J];電信科學(xué);2012年10期

6 向模軍;;利用JNI實(shí)現(xiàn)Java與C++通信[J];計(jì)算機(jī)時(shí)代;2009年12期

7 任俊偉,林東岱;JNI技術(shù)實(shí)現(xiàn)跨平臺(tái)開(kāi)發(fā)的研究[J];計(jì)算機(jī)應(yīng)用研究;2005年07期

8 高新濤;陳乖麗;;語(yǔ)音識(shí)別技術(shù)的發(fā)展現(xiàn)狀及應(yīng)用前景[J];甘肅科技縱橫;2007年04期

9 徐濟(jì)仁,牛紀(jì)海,陳家松;WAV文件格式實(shí)例分析[J];微型機(jī)與應(yīng)用;2002年03期

10 魯帆;;移動(dòng)智能終端發(fā)展趨勢(shì)研究[J];現(xiàn)代傳播(中國(guó)傳媒大學(xué)學(xué)報(bào));2011年11期



本文編號(hào):1552999

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/wltx/1552999.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶09335***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com