基于空間位置信息的三維音頻編碼技術(shù)研究
發(fā)布時(shí)間:2018-09-08 14:31
【摘要】:現(xiàn)有的空間方位量化格點(diǎn)算法大多以全部聲道為基礎(chǔ)合成虛擬聲源,違背了矢量幅度平移(VBAP)用3個(gè)聲道形成一個(gè)虛擬聲源的基本原理。此外未考慮各個(gè)聲道之間的時(shí)間差,會造成音質(zhì)下降。針對上述問題,設(shè)計(jì)基于VBAP基本原理的三維音頻編解碼框架,以3個(gè)聲道為一組合成虛擬聲源及下混信號,編碼時(shí)增添時(shí)間差參數(shù),在解碼端提出基于線性方程組求解的虛擬聲像重分配方法,獲得與原始聲道配置一致的重建信號。實(shí)驗(yàn)結(jié)果表明,該方法生成的三維音頻信號,利用MUSHRA標(biāo)準(zhǔn)進(jìn)行主觀測試,平均得分比現(xiàn)有方法高出12分。
[Abstract]:Most of the existing spatial azimuth quantization lattice algorithms synthesize virtual sound sources on the basis of all sound channels, which violates the basic principle of vector amplitude translation (VBAP) to form a virtual sound source from three channels. Moreover, the time difference between each channel is not taken into account, and the sound quality will decrease. Aiming at the above problems, a 3D audio coding and decoding framework based on the basic principle of VBAP is designed. Three channels are combined to form a virtual sound source and a mixed signal, and the time difference parameters are added to the coding process. At the decoding end, a virtual image redistribution method based on linear equations is proposed to obtain the reconstructed signal which is consistent with the original channel configuration. The experimental results show that the three-dimension audio signal generated by this method is tested subjectively by using MUSHRA standard, and the average score is 12 points higher than that of the existing methods.
【作者單位】: 武漢大學(xué)國家多媒體軟件工程技術(shù)研究中心;
【基金】:國家自然科學(xué)基金青年基金“三維聲場中聲源水平定位線索感知特性測量與分析”(61201340);國家自然科學(xué)基金重點(diǎn)項(xiàng)目(61231015) 國家“863”計(jì)劃項(xiàng)目(2015AA016306)
【分類號】:TN912.3
,
本文編號:2230803
[Abstract]:Most of the existing spatial azimuth quantization lattice algorithms synthesize virtual sound sources on the basis of all sound channels, which violates the basic principle of vector amplitude translation (VBAP) to form a virtual sound source from three channels. Moreover, the time difference between each channel is not taken into account, and the sound quality will decrease. Aiming at the above problems, a 3D audio coding and decoding framework based on the basic principle of VBAP is designed. Three channels are combined to form a virtual sound source and a mixed signal, and the time difference parameters are added to the coding process. At the decoding end, a virtual image redistribution method based on linear equations is proposed to obtain the reconstructed signal which is consistent with the original channel configuration. The experimental results show that the three-dimension audio signal generated by this method is tested subjectively by using MUSHRA standard, and the average score is 12 points higher than that of the existing methods.
【作者單位】: 武漢大學(xué)國家多媒體軟件工程技術(shù)研究中心;
【基金】:國家自然科學(xué)基金青年基金“三維聲場中聲源水平定位線索感知特性測量與分析”(61201340);國家自然科學(xué)基金重點(diǎn)項(xiàng)目(61231015) 國家“863”計(jì)劃項(xiàng)目(2015AA016306)
【分類號】:TN912.3
,
本文編號:2230803
本文鏈接:http://sikaile.net/kejilunwen/xinxigongchenglunwen/2230803.html
最近更新
教材專著