能量受限條件下的手語(yǔ)視頻編碼方法研究
[Abstract]:Sign language is the most natural way for the deaf and mute to communicate with the visual language of expression, lip movement and other body potential expression. Different from head-shoulder video, sign language video is more complicated and more difficult to study because of the increase of hand shape and arm movement. Compared with the research of sign language video recognition and synthesis, the current coding research for sign language video is less, and most of them are rate-distortion (R-D) theory, and the relationship between coding rate and distortion is studied based on rate-distortion (R-D) theory, so that the distortion of reconstructed sign language video is minimized. However, with the rapid increase of wireless network bandwidth and the wide application of new generation video coding standard H.264, the restriction of coding rate has become weaker and stronger, while the limitation of wireless video terminal in power consumption is becoming stronger and stronger. Therefore, how to minimize the distortion of sign language video, reduce energy consumption and prolong battery renewal cycle has become an urgent problem under the condition of limited energy of wireless video terminal. This paper makes an in-depth study of sign language video coding under energy-limited conditions with the aim of realizing sign language video coding by using the visual selection attention mechanism of the deaf-mute, the power rate distortion theory and the energy distribution video coding method of the region of interest. the dynamic balance optimization between power consumption, coding code rate and coding distortion can reduce the overall power consumption of the wireless video terminal as much as possible while ensuring the subjective and objective coding quality of the sign language video, New theory and new method for optimizing parameter configuration and resource allocation for deaf-mute sign language video coding under energy-limited condition Methods: The research work of this thesis mainly comprises the following steps: (1) theoretical analysis and experiment statistics influence factors influencing the video coding complexity of H.264 sign language, and divides the parameters of the H.264 sign language video coder into four different levels according to the complexity, and then adaptively selects according to the energy of the battery and the complexity of the video motion of the wireless video terminal. The experiment results show that the method can reduce the computational complexity of the encoder and save the wireless video terminal system while ensuring the quality of the sign language video coding is basically unchanged. (2) the energy perception of the region of interest is established by comprehensively considering the imbalance of the energy of the wireless video terminal battery and the visual attention mechanism of the deaf-mute; the method comprises the following steps of: determining the reference frame number and the search element range according to the current available battery energy and the video frame complexity of the wireless video terminal according to the current available battery energy and the video frame complexity of the wireless video terminal; determining the macro block according to the visual importance of different macro block areas of the sign language video at the macro block layer; the measurement mode and the quantization coefficient are finally determined according to the frame layer and the macro block layer; The experimental results show that the method can reduce the computational complexity of the encoder and save the wireless video at the same time of guaranteeing the coding quality of the sign language video ROI. Power-Rate-Distance (P-R-D) characteristics of three coding modes of H. 264 frame, inter-frame and inter-frame coding modes are analyzed in detail. On this basis, the energy consumption model and P-R-D model of coded frame sign language video are respectively set up. An algorithm is used to optimize the number of macro blocks in frame, inter-frame and skip coding mode in one frame of video. The experiment results show that the proposed P-R-D model and reality The performance of P-R-D is matched. (4) The force field (Force F) is proposed for sign language video gesture detection under the shielding condition of hand face. The method comprises the following steps of: respectively calculating a force field image of a hand face shielding frame and a pure face frame, in that method, the gray component difference of each block histogram is obtain, and finally, the gray component difference of each block histogram is equal to that of each block histogram, The gray threshold is compared to obtain the hand position. The experiment proves that the method can be used in real time
【學(xué)位授予單位】:蘭州理工大學(xué)
【學(xué)位級(jí)別】:博士
【學(xué)位授予年份】:2014
【分類(lèi)號(hào)】:TN919.81
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 劉鵬宇;何絮;賈克斌;;對(duì)特定模式進(jìn)行預(yù)判的H.264幀間快速編碼算法[J];兵工學(xué)報(bào);2011年04期
2 崔玉斌;蔡安妮;;一種新穎的H.264幀內(nèi)預(yù)測(cè)快速算法[J];北京郵電大學(xué)學(xué)報(bào);2008年02期
3 韋耿;王亮;朱斌;;無(wú)線(xiàn)移動(dòng)環(huán)境視頻編碼動(dòng)態(tài)功耗模型研究[J];傳感技術(shù)學(xué)報(bào);2009年03期
4 張淑芳;李華;;基于H.264的多參考幀快速選擇算法[J];電子學(xué)報(bào);2009年01期
5 吳曉軍;白世軍;盧文濤;;基于H.264視頻編碼的運(yùn)動(dòng)估計(jì)算法優(yōu)化[J];電子學(xué)報(bào);2009年11期
6 周宇;陳熙霖;趙德斌;姚鴻勛;高文;;基于數(shù)據(jù)生成的手語(yǔ)識(shí)別自適應(yīng)方法[J];高技術(shù)通訊;2009年12期
7 何書(shū)前;倪江群;石春;;一種分層判決結(jié)構(gòu)的H.264/AVC快速幀間模式選擇方法[J];電子學(xué)報(bào);2013年11期
8 曹昕燕;趙繼印;李敏;;基于膚色和運(yùn)動(dòng)檢測(cè)技術(shù)的單目視覺(jué)手勢(shì)分割[J];湖南大學(xué)學(xué)報(bào)(自然科學(xué)版);2011年01期
9 楊春玲;王華興;;基于結(jié)構(gòu)相似度的H.264快速運(yùn)動(dòng)估計(jì)算法[J];華南理工大學(xué)學(xué)報(bào)(自然科學(xué)版);2008年08期
10 張良國(guó);高文;陳熙霖;陳益強(qiáng);王春立;;面向中等詞匯量的中國(guó)手語(yǔ)視覺(jué)識(shí)別系統(tǒng)[J];計(jì)算機(jī)研究與發(fā)展;2006年03期
相關(guān)博士學(xué)位論文 前2條
1 韋耿;視頻編碼功率率失真模型及低復(fù)雜度算法研究[D];華中科技大學(xué);2007年
2 李斌;面向高性能視頻編碼標(biāo)準(zhǔn)的率失真優(yōu)化技術(shù)研究[D];中國(guó)科學(xué)技術(shù)大學(xué);2013年
,本文編號(hào):2273345
本文鏈接:http://sikaile.net/kejilunwen/wltx/2273345.html