天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于張量分析的多因素音頻信號建模與應(yīng)用研究

發(fā)布時間:2018-01-02 21:25

  本文關(guān)鍵詞:基于張量分析的多因素音頻信號建模與應(yīng)用研究 出處:《北京理工大學》2016年博士論文 論文類型:學位論文


  更多相關(guān)文章: 多因素音頻信號 張量分解 特征建模 音頻分類 數(shù)據(jù)恢復(fù) 張量補全


【摘要】:隨著互聯(lián)網(wǎng)技術(shù)和多媒體技術(shù)的不斷發(fā)展,音頻信號作為多媒體信號的重要組成部分,對其進行分析和處理引起了越來越多研究人員的關(guān)注,而張量分析是近年來被廣泛使用的多邊或者多線性分析工具,可以處理不止一個影響因素的信號,包括信號的高階擴展形式或者本身是多維度的信號。本文把張量分析方法引入到多因素音頻信號建模及其應(yīng)用研究領(lǐng)域,利用其作為一種多因素分析方法在處理高階信號方面可以保持數(shù)據(jù)結(jié)構(gòu)信息的優(yōu)勢,解決音頻信號的高階特征建模、音頻分類的高階子空間分析和多聲道音頻信號丟失數(shù)據(jù)的恢復(fù)這三個應(yīng)用問題,具體研究內(nèi)容如下:1.針對音頻信號的特征建模,本文對傳統(tǒng)的一維、二維建模方式進行擴展,使用張量對音頻信號的高階特征建模,既體現(xiàn)了音頻信號在不同屬性子空間的物理意義,又保證了各子空間之間的聯(lián)系,而且通過張量分解可以挖掘音頻信號潛在的、本質(zhì)的、具有區(qū)分度的結(jié)構(gòu)信息。在無人車語音命令識別系統(tǒng)中構(gòu)建一個幀結(jié)構(gòu)、分解尺度、特征參數(shù)的三階張量;在音頻分類系統(tǒng)中,采用聲學特征空間、感知特征空間和心理聲學特征空間的不同屬性構(gòu)建三階張量。通過使用張量建模和分解得到的音頻特征集合,有利于提升音頻識別和分類的正確率。2.針對音頻分類這一模式識別問題,本文利用高階子空間分析方法,創(chuàng)造性的使用非負張量分解技術(shù)進行音頻分類。在有監(jiān)督的音頻分類訓練時,把音頻信號用非負張量模型表示,為了確保分解結(jié)果的唯一性,使用非負張量分解對每類音頻信號分別進行學習,得到各類音頻的非負核張量和因子矩陣;音頻分類測試時,通過訓練生成的非負因子矩陣把測試音頻映射到各種類音頻空間,通過Frobenius范數(shù)比較映射結(jié)果與訓練時得到的各類音頻核張量的相似度完成音頻分類。與傳統(tǒng)分類器相比,因為音頻數(shù)據(jù)結(jié)構(gòu)中的非線性關(guān)系并沒有在非負張量分解過程中受到破壞,所以音頻分類效果更好,可以更有效的實現(xiàn)音頻數(shù)據(jù)庫的分類標注。3.針對多聲道音頻信號丟失數(shù)據(jù)的恢復(fù)問題,本文把張量分解和張量補全技術(shù)首次引入到音頻數(shù)據(jù)恢復(fù)中。張量分解方法是對有數(shù)據(jù)丟失的音頻信號用三階張量建模并分解,通過加權(quán)處理和交替迭代算法實現(xiàn)目標函數(shù)最小化;而張量補全方法主要通過定義張量的跡范數(shù),利用凸松弛技術(shù)把秩函數(shù)最小化問題轉(zhuǎn)化成跡范數(shù)最小化問題,即完成非凸優(yōu)化到凸優(yōu)化問題的轉(zhuǎn)變,進而解決了非確定性多項式困難問題,利用基于塊坐標下降算法的簡單補全和基于交替方向乘子算法的精確補全完成多聲道音頻信號丟失數(shù)據(jù)的恢復(fù)。
[Abstract]:With the development of Internet technology and multimedia technology, audio signal, as an important part of multimedia signal, has attracted more and more researchers' attention. Zhang Liang analysis is a multilateral or multi-linear analysis tool widely used in recent years, which can deal with more than one factor of the signal. This paper introduces Zhang Liang analysis method into multi-factor audio signal modeling and application research field. As a multi-factor analysis method, it can maintain the advantage of data structure information in processing high-order signal, and solve the high-order feature modeling of audio signal. High order subspace analysis of audio classification and restoration of lost data of multi-channel audio signal are three application problems. The specific research contents are as follows: 1. Aiming at the feature modeling of audio signal, this paper focuses on the traditional one-dimensional. The two-dimensional modeling method is extended to use Zhang Liang to model the high-order features of audio signal, which not only reflects the physical meaning of audio signal in different attribute subspace, but also ensures the relationship between each subspace. And through Zhang Liang decomposition can mine audio signal potential, essential, has the discriminative structure information, constructs a frame structure in the unmanned vehicle speech command recognition system, decomposes the scale. Third order Zhang Liang of characteristic parameter; In the audio classification system, the third order Zhang Liang is constructed by using different attributes of acoustic feature space, perceptual feature space and psychoacoustic feature space. It is helpful to improve the accuracy of audio recognition and classification. 2. Aiming at the pattern recognition problem of audio classification, this paper uses high-order subspace analysis method. Creative use of non-negative Zhang Liang decomposition technology for audio classification. In the supervised audio classification training, the audio signal is represented by non-negative Zhang Liang model, in order to ensure the uniqueness of decomposition results. The non-negative Zhang Liang decomposition is used to study each kind of audio signal separately, and the non-negative nuclear Zhang Liang and factor matrix of all kinds of audio are obtained. In audio classification testing, the test audio is mapped to various kinds of audio space through the non-negative factor matrix generated by the training. By comparing the mapping result of Frobenius norm with the similarity of Zhang Liang, the audio kernel obtained from the training, the audio classification is completed, which is compared with the traditional classifier. Because the nonlinear relationship in the audio data structure is not destroyed in the process of non-negative Zhang Liang decomposition, the audio classification effect is better. It is more effective to realize the classification tagging of audio database. 3. To recover the lost data of multi-channel audio signal. In this paper, Zhang Liang decomposition and Zhang Liang complement technology are introduced into audio data recovery for the first time. Zhang Liang decomposition method is to model and decompose the audio signals with data loss by the third order Zhang Liang. The objective function is minimized by weighted processing and alternating iteration algorithm. Zhang Liang complements the whole method mainly by defining Zhang Liang's trace norm, using convex relaxation technique to transform the rank function minimization problem into the trace norm minimization problem, that is, to complete the transformation from non-convex optimization to convex optimization. Furthermore, the problem of uncertain polynomial is solved. The simple complement based on block coordinate descent algorithm and the exact complement based on alternating direction multiplier algorithm are used to restore the lost data of multi-channel audio signal.
【學位授予單位】:北京理工大學
【學位級別】:博士
【學位授予年份】:2016
【分類號】:TN912.3

【參考文獻】

相關(guān)期刊論文 前10條

1 楊立東;王晶;謝湘;匡鏡明;;基于Tucker分解的音頻分類研究[J];信號處理;2015年02期

2 王磊;周樂囡;姬紅兵;林琳;;一種面向信號分類的匹配追蹤新方法[J];電子與信息學報;2014年06期

3 XING Ling;MA Qiang;ZHU Min;;Tensor semantic model for an audio classification system[J];Science China(Information Sciences);2013年06期

4 邢玲;賀梅;馬強;朱敏;;基于張量神經(jīng)網(wǎng)絡(luò)的音頻多語義分類方法[J];計算機應(yīng)用;2012年10期

5 盧雁;吳盛教;趙文強;;壓縮感知理論綜述[J];計算機與數(shù)字工程;2012年08期

6 高悅;陳硯圃;閔剛;杜佳;;基于線性預(yù)測分析和差分變換的語音信號壓縮感知[J];電子與信息學報;2012年06期

7 王膂;伍家松;Senhadji Lotfi;舒華忠;;音頻壓縮中3種整數(shù)型MDCT變換的比較[J];東南大學學報(自然科學版);2012年02期

8 朱墨;吳國清;郭新毅;;基于盲解卷積的水聲信號恢復(fù)技術(shù)[J];應(yīng)用聲學;2011年03期

9 龐毅;閆德勤;;基于張量模式的降維方法研究[J];吉林師范大學學報(自然科學版);2011年02期

10 劉銘;俞能海;李衛(wèi)海;周浩;;基于張量分解的數(shù)字圖像取證[J];計算機工程;2011年08期

,

本文編號:1370979

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/shoufeilunwen/xxkjbs/1370979.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶d1007***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com
美日韩一区二区精品系列| 极品少妇嫩草视频在线观看| 国产对白老熟女正在播放| 免费在线观看激情小视频| 高清在线精品一区二区| 亚洲精品小视频在线观看| 91福利视频日本免费看看| 少妇高潮呻吟浪语91| 日本午夜免费观看视频| 国产精品美女午夜视频| 欧美乱视频一区二区三区| 区一区二区三中文字幕| 大香蕉大香蕉手机在线视频| 国产av精品高清一区二区三区| 欧美av人人妻av人人爽蜜桃| 亚洲五月婷婷中文字幕| 国产欧美日韩一级小黄片| 国产成人精品资源在线观看| 护士又紧又深又湿又爽的视频| 99少妇偷拍视频在线| 在线观看欧美视频一区| 日韩高清中文字幕亚洲| 熟女白浆精品一区二区| 久久老熟女一区二区三区福利| 成人免费观看视频免费| 国产又粗又猛又黄又爽视频免费| 精品视频一区二区三区不卡| 国产精品免费视频久久| 东京热男人的天堂一二三区| 亚洲天堂久久精品成人| 日韩黄色一级片免费收看| 狠狠干狠狠操在线播放| 国产精品欧美一区二区三区| 日本不卡在线一区二区三区| 久久福利视频视频一区二区| 中文字幕91在线观看| 自拍偷女厕所拍偷区亚洲综合| 精品综合欧美一区二区三区| 高清免费在线不卡视频| 婷婷开心五月亚洲综合| 大尺度剧情国产在线视频|