天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

能量受限條件下的手語(yǔ)視頻編碼方法研究

發(fā)布時(shí)間:2018-10-15 17:57
【摘要】:手語(yǔ)是由手形、手臂運(yùn)動(dòng)并輔之以表情、唇動(dòng)以及其他體勢(shì)表達(dá)思想的視覺(jué)語(yǔ)言,是聾啞人進(jìn)行交流的最自然方式。與頭肩視頻不同,手語(yǔ)視頻由于增加了手形、手臂運(yùn)動(dòng),并且存在手臉遮擋現(xiàn)象,所以更為復(fù)雜,對(duì)其進(jìn)行研究難度更大。和手語(yǔ)視頻識(shí)別與合成研究相比,目前針對(duì)手語(yǔ)視頻的編碼研究還較少,且大多數(shù)都是基于率失真(Rate-Distortion, R-D)理論,以給定編碼碼率為約束,研究編碼碼率和失真之間的關(guān)系,使重建手語(yǔ)視頻的失真最小。但是,隨著無(wú)線(xiàn)網(wǎng)絡(luò)帶寬的快速增加和新一代視頻編碼標(biāo)準(zhǔn)H.264的廣泛應(yīng)用,編碼碼率的約束性已經(jīng)越來(lái)越弱,而無(wú)線(xiàn)視頻終端在功耗上所受的制約卻越來(lái)越強(qiáng)。因此,如何在無(wú)線(xiàn)視頻終端能量有限的約束條件下,使手語(yǔ)視頻經(jīng)編碼后的失真最小,減小能耗、延長(zhǎng)電池的更新周期已成為一個(gè)迫切需要解決的問(wèn)題。 本論文對(duì)能量受限條件下的手語(yǔ)視頻編碼進(jìn)行了深入的研究,目的是利用聾啞人視覺(jué)選擇注意機(jī)制、功率率失真理論和感興趣區(qū)能量分配視頻編碼方法實(shí)現(xiàn)手語(yǔ)視頻編碼功耗、編碼碼率和編碼失真之間的動(dòng)態(tài)平衡優(yōu)化,在確保手語(yǔ)視頻主客觀編碼質(zhì)量的同時(shí),盡可能降低無(wú)線(xiàn)視頻終端總體功耗,延長(zhǎng)電池更新周期,為解決能量受限條件下聾啞人手語(yǔ)視頻編碼的最優(yōu)化參數(shù)配置和資源分配提供新理論和新方法。本論文的研究工作主要包括: (1)理論分析和實(shí)驗(yàn)統(tǒng)計(jì)了影響H.264手語(yǔ)視頻編碼復(fù)雜度的因素,將H.264手語(yǔ)視頻編碼器參數(shù)按照復(fù)雜度分為四種不同的級(jí)別,每種級(jí)別具有不同的編碼復(fù)雜度和編碼質(zhì)量,然后依據(jù)無(wú)線(xiàn)視頻終端電池能量和視頻運(yùn)動(dòng)復(fù)雜性自適應(yīng)地選擇編碼級(jí)別。實(shí)驗(yàn)結(jié)果表明該方法在保證手語(yǔ)視頻編碼質(zhì)量基本不變的同時(shí),能夠減少編碼器計(jì)算復(fù)雜度,節(jié)省無(wú)線(xiàn)視頻終端系統(tǒng)的計(jì)算資源。 (2)綜合考慮無(wú)線(xiàn)視頻終端電池能量的時(shí)變性和聾啞人視覺(jué)注意機(jī)制的不平衡性,建立了感興趣區(qū)能量感知手語(yǔ)視頻編碼方法,該方法在幀層依據(jù)無(wú)線(xiàn)視頻終端當(dāng)前可使用電池能量和視頻幀復(fù)雜度確定參考幀數(shù)和搜素范圍,在宏塊層依據(jù)手語(yǔ)視頻不同宏塊區(qū)域的視覺(jué)重要性確定宏塊預(yù)測(cè)模式和量化系數(shù),最后根據(jù)幀層和宏塊層共同確定的參數(shù)進(jìn)行編碼。實(shí)驗(yàn)結(jié)果表明該方法在保證手語(yǔ)視頻感興趣區(qū)編碼質(zhì)量的同時(shí),能夠進(jìn)一步減少編碼器計(jì)算復(fù)雜度,節(jié)省無(wú)線(xiàn)視頻終端系統(tǒng)的計(jì)算資源。 (3)詳細(xì)分析了H.264幀內(nèi)、幀間和跳幀三種編碼模式的功率率失真(Power-Rate-Distortion,P-R-D)特性,在此基礎(chǔ)上,分別建立了編碼一幀手語(yǔ)視頻的能耗模型和P-R-D模型,并提出了優(yōu)化一幀視頻中采用幀內(nèi)、幀間和跳幀編碼模式宏塊個(gè)數(shù)的算法,實(shí)驗(yàn)表明所提出的P-R-D模型和實(shí)測(cè)P-R-D性能相吻合。 (4)針對(duì)手臉遮擋條件下的手語(yǔ)視頻手勢(shì)檢測(cè)問(wèn)題,提出一種基于力場(chǎng)(Force Field)轉(zhuǎn)換的手勢(shì)檢測(cè)方法。該方法首先分別計(jì)算手臉遮擋幀和純臉部幀的力場(chǎng)圖像,然后將力場(chǎng)圖像分塊并統(tǒng)計(jì)各分塊直方圖特征,再將相同空間位置的分塊直方圖對(duì)應(yīng)相減,得到各分塊直方圖灰度分量差,最后將各分塊直方圖灰度分量差與灰度閾值進(jìn)行比較獲得手部位置。實(shí)驗(yàn)證明該方法能夠?qū)崟r(shí)進(jìn)行手臉遮擋條件下的手勢(shì)檢測(cè)。
[Abstract]:Sign language is the most natural way for the deaf and mute to communicate with the visual language of expression, lip movement and other body potential expression. Different from head-shoulder video, sign language video is more complicated and more difficult to study because of the increase of hand shape and arm movement. Compared with the research of sign language video recognition and synthesis, the current coding research for sign language video is less, and most of them are rate-distortion (R-D) theory, and the relationship between coding rate and distortion is studied based on rate-distortion (R-D) theory, so that the distortion of reconstructed sign language video is minimized. However, with the rapid increase of wireless network bandwidth and the wide application of new generation video coding standard H.264, the restriction of coding rate has become weaker and stronger, while the limitation of wireless video terminal in power consumption is becoming stronger and stronger. Therefore, how to minimize the distortion of sign language video, reduce energy consumption and prolong battery renewal cycle has become an urgent problem under the condition of limited energy of wireless video terminal. This paper makes an in-depth study of sign language video coding under energy-limited conditions with the aim of realizing sign language video coding by using the visual selection attention mechanism of the deaf-mute, the power rate distortion theory and the energy distribution video coding method of the region of interest. the dynamic balance optimization between power consumption, coding code rate and coding distortion can reduce the overall power consumption of the wireless video terminal as much as possible while ensuring the subjective and objective coding quality of the sign language video, New theory and new method for optimizing parameter configuration and resource allocation for deaf-mute sign language video coding under energy-limited condition Methods: The research work of this thesis mainly comprises the following steps: (1) theoretical analysis and experiment statistics influence factors influencing the video coding complexity of H.264 sign language, and divides the parameters of the H.264 sign language video coder into four different levels according to the complexity, and then adaptively selects according to the energy of the battery and the complexity of the video motion of the wireless video terminal. The experiment results show that the method can reduce the computational complexity of the encoder and save the wireless video terminal system while ensuring the quality of the sign language video coding is basically unchanged. (2) the energy perception of the region of interest is established by comprehensively considering the imbalance of the energy of the wireless video terminal battery and the visual attention mechanism of the deaf-mute; the method comprises the following steps of: determining the reference frame number and the search element range according to the current available battery energy and the video frame complexity of the wireless video terminal according to the current available battery energy and the video frame complexity of the wireless video terminal; determining the macro block according to the visual importance of different macro block areas of the sign language video at the macro block layer; the measurement mode and the quantization coefficient are finally determined according to the frame layer and the macro block layer; The experimental results show that the method can reduce the computational complexity of the encoder and save the wireless video at the same time of guaranteeing the coding quality of the sign language video ROI. Power-Rate-Distance (P-R-D) characteristics of three coding modes of H. 264 frame, inter-frame and inter-frame coding modes are analyzed in detail. On this basis, the energy consumption model and P-R-D model of coded frame sign language video are respectively set up. An algorithm is used to optimize the number of macro blocks in frame, inter-frame and skip coding mode in one frame of video. The experiment results show that the proposed P-R-D model and reality The performance of P-R-D is matched. (4) The force field (Force F) is proposed for sign language video gesture detection under the shielding condition of hand face. The method comprises the following steps of: respectively calculating a force field image of a hand face shielding frame and a pure face frame, in that method, the gray component difference of each block histogram is obtain, and finally, the gray component difference of each block histogram is equal to that of each block histogram, The gray threshold is compared to obtain the hand position. The experiment proves that the method can be used in real time
【學(xué)位授予單位】:蘭州理工大學(xué)
【學(xué)位級(jí)別】:博士
【學(xué)位授予年份】:2014
【分類(lèi)號(hào)】:TN919.81

【參考文獻(xiàn)】

相關(guān)期刊論文 前10條

1 劉鵬宇;何絮;賈克斌;;對(duì)特定模式進(jìn)行預(yù)判的H.264幀間快速編碼算法[J];兵工學(xué)報(bào);2011年04期

2 崔玉斌;蔡安妮;;一種新穎的H.264幀內(nèi)預(yù)測(cè)快速算法[J];北京郵電大學(xué)學(xué)報(bào);2008年02期

3 韋耿;王亮;朱斌;;無(wú)線(xiàn)移動(dòng)環(huán)境視頻編碼動(dòng)態(tài)功耗模型研究[J];傳感技術(shù)學(xué)報(bào);2009年03期

4 張淑芳;李華;;基于H.264的多參考幀快速選擇算法[J];電子學(xué)報(bào);2009年01期

5 吳曉軍;白世軍;盧文濤;;基于H.264視頻編碼的運(yùn)動(dòng)估計(jì)算法優(yōu)化[J];電子學(xué)報(bào);2009年11期

6 周宇;陳熙霖;趙德斌;姚鴻勛;高文;;基于數(shù)據(jù)生成的手語(yǔ)識(shí)別自適應(yīng)方法[J];高技術(shù)通訊;2009年12期

7 何書(shū)前;倪江群;石春;;一種分層判決結(jié)構(gòu)的H.264/AVC快速幀間模式選擇方法[J];電子學(xué)報(bào);2013年11期

8 曹昕燕;趙繼印;李敏;;基于膚色和運(yùn)動(dòng)檢測(cè)技術(shù)的單目視覺(jué)手勢(shì)分割[J];湖南大學(xué)學(xué)報(bào)(自然科學(xué)版);2011年01期

9 楊春玲;王華興;;基于結(jié)構(gòu)相似度的H.264快速運(yùn)動(dòng)估計(jì)算法[J];華南理工大學(xué)學(xué)報(bào)(自然科學(xué)版);2008年08期

10 張良國(guó);高文;陳熙霖;陳益強(qiáng);王春立;;面向中等詞匯量的中國(guó)手語(yǔ)視覺(jué)識(shí)別系統(tǒng)[J];計(jì)算機(jī)研究與發(fā)展;2006年03期

相關(guān)博士學(xué)位論文 前2條

1 韋耿;視頻編碼功率率失真模型及低復(fù)雜度算法研究[D];華中科技大學(xué);2007年

2 李斌;面向高性能視頻編碼標(biāo)準(zhǔn)的率失真優(yōu)化技術(shù)研究[D];中國(guó)科學(xué)技術(shù)大學(xué);2013年

,

本文編號(hào):2273345

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/wltx/2273345.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶(hù)e7686***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com