基于率失真優(yōu)化的高效視頻編碼技術(shù)研究

發(fā)布時間：2018-04-30 22:34

本文選題：高效視頻編碼 + 率失真優(yōu)化�。� 參考：《哈爾濱工業(yè)大學》2014年博士論文

【摘要】：隨著互聯(lián)網(wǎng)技術(shù)對人們生活的不斷滲透,數(shù)字視頻的產(chǎn)生速度和數(shù)量增長迅速,人類社會已進入大數(shù)據(jù)時代。海量的視頻對于視頻的存儲和傳輸提出了更大的挑戰(zhàn),這也使得對數(shù)字視頻編碼標準的研究一直是學術(shù)界和工業(yè)界的熱點。2013年,新一代視頻編碼標準——高效視頻編碼(High Efficiency Video Coding,HEVC)正式發(fā)布,和上一代視頻編碼標準H.264/AVC相比,編碼性能獲得了大幅度的提升。HEVC在帶來高性能的同時也帶來了復(fù)雜度的大幅度增加,因此在實際應(yīng)用中對視頻編碼標準進行合理的優(yōu)化,降低編碼復(fù)雜度,從而提升視頻編碼效率具有重要的意義。本文立足于率失真優(yōu)化的基本理論,從碼率控制、幀內(nèi)編碼、幀間編碼以及主觀視覺四個層面探討對HEVC的率失真優(yōu)化技術(shù),主要研究內(nèi)容包括如下四個部分。第一,視頻需要有一個良好的碼率控制方法以確保編碼視頻的有效傳輸,目前HEVC中的碼率控制方法并沒有充分考慮HEVC新的編碼結(jié)構(gòu)和特性。本文基于HEVC中新的編碼結(jié)構(gòu)和特性提出了一種基于Rate-GOP的碼率控制方法。首先本文研究了Rate-GOP中幀間的率失真依賴性關(guān)系,并基于這種依賴性關(guān)系提出了基于率失真依賴性的率失真模型和基于Rate-GOP的率失真模型。其次,基于變換系數(shù)的混合拉普拉斯分布,本文提出一種變換域的二次ρ-R模型,并建立了R和QP之間的關(guān)系;最后基于上述模型,提出了一種基于率失真優(yōu)化的碼率分配方法。實驗結(jié)果表明,和相關(guān)算法相比,本文方法具有較高的碼率控制性能。第二,HEVC的幀內(nèi)編碼采用了更多的預(yù)測模式,最多達到35種,同時對于編碼單元采用基于四叉樹的劃分結(jié)構(gòu)以確定最優(yōu)的劃分模式,這大大增加了幀內(nèi)編碼的復(fù)雜度。為了有效降低HEVC幀內(nèi)編碼的復(fù)雜度,本文基于梯度方差、紋理以及預(yù)測模式的分布之間的關(guān)系,首先提出了一種自適應(yīng)的預(yù)測模式數(shù)量的收縮方法;其次,基于哈達瑪變換和量化,本文提出了一種預(yù)測模式?jīng)Q策模型以提升預(yù)測模式?jīng)Q策的準確性。實驗結(jié)果表明,本文方法有效減少了幀內(nèi)預(yù)測模式的數(shù)量,在客觀質(zhì)量下降幾乎可以忽略的情況下,有效降低了幀內(nèi)編碼的復(fù)雜度,提升了幀內(nèi)編碼的效率。同時,本文算法在AVS2平臺上也可以有效降低幀內(nèi)編碼的復(fù)雜度。第三,HEVC中的幀間編碼,依然采用了多參考幀的運動補償,同時對編碼單元采用了基于四叉樹的劃分結(jié)構(gòu),這大大增加了運動估計的復(fù)雜度,為了有效降低幀間編碼的復(fù)雜度,本文從參考幀選擇和編碼單元的劃分兩個方面提出了對幀間編碼的率失真優(yōu)化技術(shù)。首先,針對HEVC特有的參考幀集合的結(jié)構(gòu),基于參考幀分布的時空特性,提出了一種基于運動復(fù)雜度的參考幀快速決策方法,以減少多參考幀帶來的運動估計的復(fù)雜度增加。其次,基于對同一深度下編碼單元劃分與未劃分情況下的率失真代價分布的統(tǒng)計,提出了一種基于率失真代價的快速劃分決策方法,以減少不必要的劃分帶來的復(fù)雜度提升。實驗結(jié)果表明,本文的幀間率失真優(yōu)化方法有效降低了幀間編碼的復(fù)雜度,同時客觀質(zhì)量的下降幾乎可以忽略。第四,HEVC中,新的編碼技術(shù)的采用導致了視頻質(zhì)量的主觀影響因素發(fā)生了改變,這對如何從視覺的角度對HEVC進行優(yōu)化提出了新的課題,本文首先基于分歧歸一化理論,提出了一種適合HEVC的視覺因子的計算方法,然后對該視覺因子應(yīng)用非線性縮放方法進行縮放,以適合人眼的視覺特性,并用于編碼過程中對量化參數(shù)的調(diào)整;其次根據(jù)HEVC的編碼特性提出了一種基于視覺特性的率失真代價計算方法進行模式?jīng)Q策。實驗結(jié)果表明,該方法能夠?qū)崿F(xiàn)對量化參數(shù)的有效調(diào)節(jié),能夠較大幅度的提升視頻編碼的主觀性能。同時,本文算法在AVS2平臺上,也可以有效果提升視頻編碼的主觀性能。
[Abstract]:With the continuous infiltration of Internet technology to people's life, the speed and number of digital video are growing rapidly. Human society has entered the era of big data. Massive video has put forward more challenges to the storage and transmission of video. This also makes the research on digital video coding standard has been a hot spot in academic and industrial circles.201 In the 3 year, the new generation video coding standard, High Efficiency Video Coding (HEVC), was formally published, compared with the previous generation of video coding standard H.264/AVC, the coding performance has been greatly enhanced by the enhancement of.HEVC in high performance and a large increase in complexity. Therefore, video coding is used in practical applications. Based on the basic theory of rate distortion optimization, this paper, based on the basic theory of rate distortion optimization, discusses the rate de truth optimization technology for HEVC from four levels, rate control, intra coding, inter frame coding and subjective vision. The main research contents include the following four parts First, the video needs a good rate control method to ensure the effective transmission of coded video. At present, the rate control method in HEVC does not fully consider the new coding structure and characteristics of HEVC. Based on the new coding structure and characteristics in HEVC, this paper presents a rate control method based on Rate-GOP. The rate distortion dependence relationship between frames in Rate-GOP is given, and the rate distortion model based on rate distortion dependence and the rate distortion model based on Rate-GOP are proposed based on the dependence relationship. Secondly, based on the mixed Laplasse distribution of transform coefficients, a two order -R model of the transform domain is proposed, and the relationship between R and QP is established. Finally, based on the above model, a rate allocation method based on rate distortion optimization is proposed. The experimental results show that the proposed method has a higher rate control performance compared with the related algorithms. Second, the intra coding of HEVC uses more prediction modes, up to 35, and the four forked tree is used for the coding unit. Structure to determine the optimal partition pattern, which greatly increases the complexity of intra coding. In order to effectively reduce the complexity of HEVC intra coding, based on the relationship between the gradient variance, the texture and the distribution of the prediction mode, this paper first proposes an adaptive prediction model number contraction method; secondly, based on Hadamard transform and In this paper, a prediction model decision model is proposed to improve the accuracy of prediction model decision. The experimental results show that this method effectively reduces the number of intra prediction modes and reduces the complexity of intra coding effectively and improves the efficiency of intra coding under the situation that the objective quality is almost negligible. The algorithm can also effectively reduce the complexity of intra coding on the AVS2 platform. Third, inter frame coding in HEVC still uses the motion compensation of multiple reference frames, and the coding unit is based on the four fork tree division structure, which greatly increases the complexity of the motion estimation. In order to effectively reduce the complexity of the inter frame coding, this paper can effectively reduce the complexity of the inter frame coding. The rate distortion optimization technique for inter frame coding is proposed from two aspects of reference frame selection and coding unit division. Firstly, based on the structure of the reference frame set in HEVC, a fast decision method based on motion complexity is proposed based on the temporal and spatial characteristics of the reference frame distribution, in order to reduce the motion estimation caused by the multi reference frame. The complexity of the calculation is increased. Secondly, based on the statistics of the rate distortion cost distribution in the division and undivided conditions of the same depth, a fast partition decision method based on the rate distortion cost is proposed to reduce the complexity raised by the unnecessary division. The experimental results show that the inter frame rate distortion optimization method in this paper is used in this paper. The complexity of inter frame coding is effectively reduced, and the decrease of objective quality is almost negligible. Fourth, in HEVC, the adoption of new coding techniques has led to the change in the subjective factors of video quality. This is a new topic on how to optimize the HEVC from the visual angle. A method of computing the visual factor suitable for HEVC is proposed. Then the visual factor is zoomed by nonlinear scaling method to fit the visual characteristics of the human eye, and is used to adjust the quantization parameters in the coding process. Secondly, according to the coding characteristics of the HEVC, a method for calculating the rate distortion cost based on the visual characteristics is proposed. The experimental results show that this method can effectively adjust the quantized parameters and can greatly improve the subjective performance of video coding. At the same time, this algorithm can also improve the subjective performance of video coding on the AVS2 platform.

【學位授予單位】：哈爾濱工業(yè)大學
【學位級別】：博士
【學位授予年份】：2014
【分類號】：TN919.81

【共引文獻】

相關(guān)期刊論文前5條

1 楊洪敏;王祖強;徐輝;;AVS編碼器中變換量化和掃描的FPGA設(shè)計[J];電子技術(shù)應(yīng)用;2014年03期

2 申文龍;;基于HEVC的幀內(nèi)快速模式選擇算法[J];計算機與現(xiàn)代化;2014年05期

3 歐陽甸;張偉華;董騫;閆雪;;基于SVAC感興趣區(qū)域的碼率控制算法[J];數(shù)據(jù)采集與處理;2014年01期

4 彭昕;;淺談網(wǎng)絡(luò)數(shù)字視頻技術(shù)與博物館的發(fā)展活力[J];首都博物館論叢;2013年00期

5 宋傳鳴;趙長偉;劉丹;王相海;;3D多尺度幾何分析研究進展[J];軟件學報;2015年05期

相關(guān)博士學位論文前2條

1 黃晗;HEVC幀間/幀內(nèi)預(yù)測及優(yōu)化技術(shù)研究[D];北京交通大學;2014年

2 王建富;H.265/HEVC編碼加速算法研究[D];中國科學技術(shù)大學;2015年

相關(guān)碩士學位論文前10條

1 董昕;高清晰視頻會議系統(tǒng)的研究與設(shè)計[D];北京郵電大學;2012年

2 施現(xiàn)偉;基于ARM11的遠程視頻監(jiān)控系統(tǒng)設(shè)計[D];哈爾濱理工大學;2013年

3 周文帥;二維/三維視頻的多描述編碼方法研究[D];北京交通大學;2014年

4 潘軍威;基于嵌入式Linux的工程機械遠程監(jiān)控車載系統(tǒng)研究[D];浙江大學;2014年

5 張嘉煒;H.264視頻加密算法的研究[D];東北大學;2013年

6 彭爽;智能監(jiān)控系統(tǒng)中跨平臺播放器的設(shè)計與實現(xiàn)[D];浙江大學;2014年

7 孫越;面向網(wǎng)絡(luò)傳輸?shù)娜S視頻錯誤隱藏[D];寧波大學;2013年

8 王衛(wèi);融合RTP和H.264技術(shù)的遠程視頻監(jiān)控系統(tǒng)[D];杭州電子科技大學;2014年

9 宋楚杰;面向高校教學應(yīng)用的課堂錄播系統(tǒng)設(shè)計研究[D];沈陽師范大學;2014年

10 孔維國;基于H.264/AVC的視頻信息隱藏及檢測技術(shù)研究[D];西南交通大學;2014年

，

本文編號：1826558

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/wltx/1826558.html

上一篇：高頻雷達海洋表面電磁散射及海態(tài)遙感研究
下一篇：穿戴位置無關(guān)的手機用戶行為識別模型

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于率失真優(yōu)化的高效視頻編碼技術(shù)研究