基于率失真優(yōu)化的高效視頻編碼技術(shù)研究
本文選題:高效視頻編碼 + 率失真優(yōu)化。 參考:《哈爾濱工業(yè)大學(xué)》2014年博士論文
【摘要】:隨著互聯(lián)網(wǎng)技術(shù)對(duì)人們生活的不斷滲透,數(shù)字視頻的產(chǎn)生速度和數(shù)量增長(zhǎng)迅速,人類社會(huì)已進(jìn)入大數(shù)據(jù)時(shí)代。海量的視頻對(duì)于視頻的存儲(chǔ)和傳輸提出了更大的挑戰(zhàn),這也使得對(duì)數(shù)字視頻編碼標(biāo)準(zhǔn)的研究一直是學(xué)術(shù)界和工業(yè)界的熱點(diǎn)。2013年,新一代視頻編碼標(biāo)準(zhǔn)——高效視頻編碼(High Efficiency Video Coding,HEVC)正式發(fā)布,和上一代視頻編碼標(biāo)準(zhǔn)H.264/AVC相比,編碼性能獲得了大幅度的提升。HEVC在帶來(lái)高性能的同時(shí)也帶來(lái)了復(fù)雜度的大幅度增加,因此在實(shí)際應(yīng)用中對(duì)視頻編碼標(biāo)準(zhǔn)進(jìn)行合理的優(yōu)化,降低編碼復(fù)雜度,從而提升視頻編碼效率具有重要的意義。本文立足于率失真優(yōu)化的基本理論,從碼率控制、幀內(nèi)編碼、幀間編碼以及主觀視覺(jué)四個(gè)層面探討對(duì)HEVC的率失真優(yōu)化技術(shù),主要研究?jī)?nèi)容包括如下四個(gè)部分。第一,視頻需要有一個(gè)良好的碼率控制方法以確保編碼視頻的有效傳輸,目前HEVC中的碼率控制方法并沒(méi)有充分考慮HEVC新的編碼結(jié)構(gòu)和特性。本文基于HEVC中新的編碼結(jié)構(gòu)和特性提出了一種基于Rate-GOP的碼率控制方法。首先本文研究了Rate-GOP中幀間的率失真依賴性關(guān)系,并基于這種依賴性關(guān)系提出了基于率失真依賴性的率失真模型和基于Rate-GOP的率失真模型。其次,基于變換系數(shù)的混合拉普拉斯分布,本文提出一種變換域的二次ρ-R模型,并建立了R和QP之間的關(guān)系;最后基于上述模型,提出了一種基于率失真優(yōu)化的碼率分配方法。實(shí)驗(yàn)結(jié)果表明,和相關(guān)算法相比,本文方法具有較高的碼率控制性能。第二,HEVC的幀內(nèi)編碼采用了更多的預(yù)測(cè)模式,最多達(dá)到35種,同時(shí)對(duì)于編碼單元采用基于四叉樹(shù)的劃分結(jié)構(gòu)以確定最優(yōu)的劃分模式,這大大增加了幀內(nèi)編碼的復(fù)雜度。為了有效降低HEVC幀內(nèi)編碼的復(fù)雜度,本文基于梯度方差、紋理以及預(yù)測(cè)模式的分布之間的關(guān)系,首先提出了一種自適應(yīng)的預(yù)測(cè)模式數(shù)量的收縮方法;其次,基于哈達(dá)瑪變換和量化,本文提出了一種預(yù)測(cè)模式?jīng)Q策模型以提升預(yù)測(cè)模式?jīng)Q策的準(zhǔn)確性。實(shí)驗(yàn)結(jié)果表明,本文方法有效減少了幀內(nèi)預(yù)測(cè)模式的數(shù)量,在客觀質(zhì)量下降幾乎可以忽略的情況下,有效降低了幀內(nèi)編碼的復(fù)雜度,提升了幀內(nèi)編碼的效率。同時(shí),本文算法在AVS2平臺(tái)上也可以有效降低幀內(nèi)編碼的復(fù)雜度。第三,HEVC中的幀間編碼,依然采用了多參考幀的運(yùn)動(dòng)補(bǔ)償,同時(shí)對(duì)編碼單元采用了基于四叉樹(shù)的劃分結(jié)構(gòu),這大大增加了運(yùn)動(dòng)估計(jì)的復(fù)雜度,為了有效降低幀間編碼的復(fù)雜度,本文從參考幀選擇和編碼單元的劃分兩個(gè)方面提出了對(duì)幀間編碼的率失真優(yōu)化技術(shù)。首先,針對(duì)HEVC特有的參考幀集合的結(jié)構(gòu),基于參考幀分布的時(shí)空特性,提出了一種基于運(yùn)動(dòng)復(fù)雜度的參考幀快速?zèng)Q策方法,以減少多參考幀帶來(lái)的運(yùn)動(dòng)估計(jì)的復(fù)雜度增加。其次,基于對(duì)同一深度下編碼單元?jiǎng)澐峙c未劃分情況下的率失真代價(jià)分布的統(tǒng)計(jì),提出了一種基于率失真代價(jià)的快速劃分決策方法,以減少不必要的劃分帶來(lái)的復(fù)雜度提升。實(shí)驗(yàn)結(jié)果表明,本文的幀間率失真優(yōu)化方法有效降低了幀間編碼的復(fù)雜度,同時(shí)客觀質(zhì)量的下降幾乎可以忽略。第四,HEVC中,新的編碼技術(shù)的采用導(dǎo)致了視頻質(zhì)量的主觀影響因素發(fā)生了改變,這對(duì)如何從視覺(jué)的角度對(duì)HEVC進(jìn)行優(yōu)化提出了新的課題,本文首先基于分歧歸一化理論,提出了一種適合HEVC的視覺(jué)因子的計(jì)算方法,然后對(duì)該視覺(jué)因子應(yīng)用非線性縮放方法進(jìn)行縮放,以適合人眼的視覺(jué)特性,并用于編碼過(guò)程中對(duì)量化參數(shù)的調(diào)整;其次根據(jù)HEVC的編碼特性提出了一種基于視覺(jué)特性的率失真代價(jià)計(jì)算方法進(jìn)行模式?jīng)Q策。實(shí)驗(yàn)結(jié)果表明,該方法能夠?qū)崿F(xiàn)對(duì)量化參數(shù)的有效調(diào)節(jié),能夠較大幅度的提升視頻編碼的主觀性能。同時(shí),本文算法在AVS2平臺(tái)上,也可以有效果提升視頻編碼的主觀性能。
[Abstract]:With the continuous infiltration of Internet technology to people's life, the speed and number of digital video are growing rapidly. Human society has entered the era of big data. Massive video has put forward more challenges to the storage and transmission of video. This also makes the research on digital video coding standard has been a hot spot in academic and industrial circles.201 In the 3 year, the new generation video coding standard, High Efficiency Video Coding (HEVC), was formally published, compared with the previous generation of video coding standard H.264/AVC, the coding performance has been greatly enhanced by the enhancement of.HEVC in high performance and a large increase in complexity. Therefore, video coding is used in practical applications. Based on the basic theory of rate distortion optimization, this paper, based on the basic theory of rate distortion optimization, discusses the rate de truth optimization technology for HEVC from four levels, rate control, intra coding, inter frame coding and subjective vision. The main research contents include the following four parts First, the video needs a good rate control method to ensure the effective transmission of coded video. At present, the rate control method in HEVC does not fully consider the new coding structure and characteristics of HEVC. Based on the new coding structure and characteristics in HEVC, this paper presents a rate control method based on Rate-GOP. The rate distortion dependence relationship between frames in Rate-GOP is given, and the rate distortion model based on rate distortion dependence and the rate distortion model based on Rate-GOP are proposed based on the dependence relationship. Secondly, based on the mixed Laplasse distribution of transform coefficients, a two order -R model of the transform domain is proposed, and the relationship between R and QP is established. Finally, based on the above model, a rate allocation method based on rate distortion optimization is proposed. The experimental results show that the proposed method has a higher rate control performance compared with the related algorithms. Second, the intra coding of HEVC uses more prediction modes, up to 35, and the four forked tree is used for the coding unit. Structure to determine the optimal partition pattern, which greatly increases the complexity of intra coding. In order to effectively reduce the complexity of HEVC intra coding, based on the relationship between the gradient variance, the texture and the distribution of the prediction mode, this paper first proposes an adaptive prediction model number contraction method; secondly, based on Hadamard transform and In this paper, a prediction model decision model is proposed to improve the accuracy of prediction model decision. The experimental results show that this method effectively reduces the number of intra prediction modes and reduces the complexity of intra coding effectively and improves the efficiency of intra coding under the situation that the objective quality is almost negligible. The algorithm can also effectively reduce the complexity of intra coding on the AVS2 platform. Third, inter frame coding in HEVC still uses the motion compensation of multiple reference frames, and the coding unit is based on the four fork tree division structure, which greatly increases the complexity of the motion estimation. In order to effectively reduce the complexity of the inter frame coding, this paper can effectively reduce the complexity of the inter frame coding. The rate distortion optimization technique for inter frame coding is proposed from two aspects of reference frame selection and coding unit division. Firstly, based on the structure of the reference frame set in HEVC, a fast decision method based on motion complexity is proposed based on the temporal and spatial characteristics of the reference frame distribution, in order to reduce the motion estimation caused by the multi reference frame. The complexity of the calculation is increased. Secondly, based on the statistics of the rate distortion cost distribution in the division and undivided conditions of the same depth, a fast partition decision method based on the rate distortion cost is proposed to reduce the complexity raised by the unnecessary division. The experimental results show that the inter frame rate distortion optimization method in this paper is used in this paper. The complexity of inter frame coding is effectively reduced, and the decrease of objective quality is almost negligible. Fourth, in HEVC, the adoption of new coding techniques has led to the change in the subjective factors of video quality. This is a new topic on how to optimize the HEVC from the visual angle. A method of computing the visual factor suitable for HEVC is proposed. Then the visual factor is zoomed by nonlinear scaling method to fit the visual characteristics of the human eye, and is used to adjust the quantization parameters in the coding process. Secondly, according to the coding characteristics of the HEVC, a method for calculating the rate distortion cost based on the visual characteristics is proposed. The experimental results show that this method can effectively adjust the quantized parameters and can greatly improve the subjective performance of video coding. At the same time, this algorithm can also improve the subjective performance of video coding on the AVS2 platform.
【學(xué)位授予單位】:哈爾濱工業(yè)大學(xué)
【學(xué)位級(jí)別】:博士
【學(xué)位授予年份】:2014
【分類號(hào)】:TN919.81
【共引文獻(xiàn)】
相關(guān)期刊論文 前5條
1 楊洪敏;王祖強(qiáng);徐輝;;AVS編碼器中變換量化和掃描的FPGA設(shè)計(jì)[J];電子技術(shù)應(yīng)用;2014年03期
2 申文龍;;基于HEVC的幀內(nèi)快速模式選擇算法[J];計(jì)算機(jī)與現(xiàn)代化;2014年05期
3 歐陽(yáng)甸;張偉華;董騫;閆雪;;基于SVAC感興趣區(qū)域的碼率控制算法[J];數(shù)據(jù)采集與處理;2014年01期
4 彭昕;;淺談網(wǎng)絡(luò)數(shù)字視頻技術(shù)與博物館的發(fā)展活力[J];首都博物館論叢;2013年00期
5 宋傳鳴;趙長(zhǎng)偉;劉丹;王相海;;3D多尺度幾何分析研究進(jìn)展[J];軟件學(xué)報(bào);2015年05期
相關(guān)博士學(xué)位論文 前2條
1 黃晗;HEVC幀間/幀內(nèi)預(yù)測(cè)及優(yōu)化技術(shù)研究[D];北京交通大學(xué);2014年
2 王建富;H.265/HEVC編碼加速算法研究[D];中國(guó)科學(xué)技術(shù)大學(xué);2015年
相關(guān)碩士學(xué)位論文 前10條
1 董昕;高清晰視頻會(huì)議系統(tǒng)的研究與設(shè)計(jì)[D];北京郵電大學(xué);2012年
2 施現(xiàn)偉;基于ARM11的遠(yuǎn)程視頻監(jiān)控系統(tǒng)設(shè)計(jì)[D];哈爾濱理工大學(xué);2013年
3 周文帥;二維/三維視頻的多描述編碼方法研究[D];北京交通大學(xué);2014年
4 潘軍威;基于嵌入式Linux的工程機(jī)械遠(yuǎn)程監(jiān)控車載系統(tǒng)研究[D];浙江大學(xué);2014年
5 張嘉煒;H.264視頻加密算法的研究[D];東北大學(xué);2013年
6 彭爽;智能監(jiān)控系統(tǒng)中跨平臺(tái)播放器的設(shè)計(jì)與實(shí)現(xiàn)[D];浙江大學(xué);2014年
7 孫越;面向網(wǎng)絡(luò)傳輸?shù)娜S視頻錯(cuò)誤隱藏[D];寧波大學(xué);2013年
8 王衛(wèi);融合RTP和H.264技術(shù)的遠(yuǎn)程視頻監(jiān)控系統(tǒng)[D];杭州電子科技大學(xué);2014年
9 宋楚杰;面向高校教學(xué)應(yīng)用的課堂錄播系統(tǒng)設(shè)計(jì)研究[D];沈陽(yáng)師范大學(xué);2014年
10 孔維國(guó);基于H.264/AVC的視頻信息隱藏及檢測(cè)技術(shù)研究[D];西南交通大學(xué);2014年
,本文編號(hào):1826558
本文鏈接:http://sikaile.net/kejilunwen/wltx/1826558.html