基于低秩和稀疏表示模型的視頻目標(biāo)提取和跟蹤研究

發(fā)布時間：2018-10-12 17:43

【摘要】：視頻目標(biāo)的提取與跟蹤是計算機(jī)視覺領(lǐng)域的基本問題,也是智能視頻監(jiān)控系統(tǒng)的關(guān)鍵與核心技術(shù)。目前,盡管這方面的研究取得了令人矚目的進(jìn)展,但是,由于數(shù)據(jù)、場景、環(huán)境的復(fù)雜性,視頻目標(biāo)的提取與跟蹤仍是挑戰(zhàn)性極大的研究課題。本文圍繞上述復(fù)雜因素,從低秩和稀疏表示模型的角度出發(fā),對視頻目標(biāo)的提取和跟蹤問題展開討論,分別研究了基于正則化低秩表示模型的視頻目標(biāo)分割、基于加權(quán)低秩分解的多模態(tài)運動目標(biāo)檢測、基于圖像塊表達(dá)和動態(tài)圖學(xué)習(xí)的目標(biāo)跟蹤以及基于協(xié)同稀疏表示模型的多模態(tài)目標(biāo)跟蹤。在視頻目標(biāo)提取方面,針對視頻數(shù)據(jù)中類內(nèi)差異性和類間相似性較大,以及視頻噪聲的存在,提出一種基于正則化低秩表示模型的視頻目標(biāo)分割框架。以超體素為圖結(jié)點,使用低秩表示模型優(yōu)化它們之間的相似性關(guān)系,可以有效地克服稀疏大噪聲和稠密高斯噪聲的干擾。為了提高超體素之間的判別性,在稀疏表示模型中引入判別性重復(fù)先驗對稀疏表示系數(shù)矩陣進(jìn)行正則化,即正則化稀疏表示模型。由于視頻數(shù)據(jù)一般是非常龐大的,因此,提出一種基于次優(yōu)低秩分解的優(yōu)化算法高效地求解提出的模型,并在理論上保證了其收斂性。同時,提出流處理方法,使得分割方法能夠在有限的計算和存儲資源中處理無限長的視頻。為了驗證有效性,本文分別把優(yōu)化的超體素的相似性關(guān)系應(yīng)用于無監(jiān)督的和交互式的視頻目標(biāo)分割任務(wù),均取得了較優(yōu)的性能。針對場景和環(huán)境的復(fù)雜性,本文提出了一種基于加權(quán)低秩分解的多模態(tài)運動目標(biāo)檢測的通用框架。由于可見光譜信息受復(fù)雜場景、光照和霧霾等因素的干擾較大,因此,引入熱紅外光譜信息對其進(jìn)行補充。具體地,通過為每個模態(tài)引入一個質(zhì)量權(quán)重,把不同模態(tài)的具有低秩結(jié)構(gòu)的背景數(shù)據(jù)、多模態(tài)共享的稀疏前景模板以及前景、背景像素點的連續(xù)性約束進(jìn)行聯(lián)合建模,使得能夠自適應(yīng)地融合多模態(tài)數(shù)據(jù),進(jìn)而魯棒地檢測運動目標(biāo)。為了進(jìn)一步地改善算法檢測效率并保持精度,提出一種有效的基于保邊濾波的加速算法,使得算法效率達(dá)到近實時。此外,構(gòu)建了一個包括25個視頻對的多模態(tài)運動目標(biāo)檢測平臺,彌補了該領(lǐng)域缺乏標(biāo)準(zhǔn)評價體系的不足,促進(jìn)相關(guān)領(lǐng)域的研究發(fā)展。在目標(biāo)跟蹤方面,為了解決基于檢測的跟蹤框架中的模型漂移問題,本文提出了一種基于圖像塊的動態(tài)圖學(xué)習(xí)方法,消弱目標(biāo)表達(dá)中的背景干擾。首先,把跟蹤矩形框劃分成不重疊的小圖像塊,并為每個圖像塊分配一個權(quán)重,用來表示圖像塊對于目標(biāo)的重要性。由于傳統(tǒng)的8-鄰域圖忽略了圖的全局結(jié)構(gòu)以及局部線性關(guān)系,因此,以圖像塊為圖結(jié)點,利用它們之間的全局低秩結(jié)構(gòu)、稀疏局部線性關(guān)系以及邊權(quán)的非負(fù)性動態(tài)地學(xué)習(xí)圖的結(jié)構(gòu),同時,以半監(jiān)督的方式聯(lián)合地優(yōu)化圖像塊的權(quán)重向量。其次,為了提高跟蹤方法的時效性,提出一個實時的優(yōu)化算法求解提出的模型。最后,把優(yōu)化的權(quán)重向量嵌入到目標(biāo)跟蹤和模型更新中,極大地提高跟蹤性能。為了克服場景和環(huán)境復(fù)雜性帶來的挑戰(zhàn),本文在貝葉斯濾波框架下提出了一種基于協(xié)同稀疏表示模型的多模態(tài)目標(biāo)跟蹤方法。傳統(tǒng)的多模態(tài)目標(biāo)跟蹤方法把每個模態(tài)平等地對待,如果某個模態(tài)的信息有非常大的歧義性,則會對最終的跟蹤結(jié)果造成影響。因此,本文自適應(yīng)地融合不同的模態(tài),即在稀疏表示模型中為每個模態(tài)引入一個質(zhì)量權(quán)重,以此實現(xiàn)穩(wěn)健地跟蹤。特別地,每個模態(tài)的權(quán)重由該模態(tài)的重構(gòu)誤差以及目標(biāo)與背景的判別性來確定的,并和稀疏表示系數(shù)一起聯(lián)合優(yōu)化。此外,由于該問題缺少標(biāo)準(zhǔn)的評測平臺,因此,構(gòu)建了一個標(biāo)準(zhǔn)的多模態(tài)目標(biāo)檢測平臺,包含50個配準(zhǔn)的視頻對、22個基準(zhǔn)方法和2種度量方式。該平臺為該問題及相關(guān)領(lǐng)域提供了一個標(biāo)準(zhǔn)的評價體系,有助于這方面的研究。
[Abstract]:The extraction and tracking of video object is the basic problem in the field of computer vision, and it is also the key and core technology of intelligent video monitoring system. At present, although the research in this aspect has made remarkable progress, because of the complexity of data, scene and environment, the extraction and tracking of video object is still a very challenging research topic. From the viewpoint of low rank and sparse representation model, this paper discusses the extraction and tracking of video object from the viewpoint of low rank and sparse representation model, studies the video target segmentation based on regularization low rank representation model, and based on weighted low rank decomposition multi-modal motion target detection, Object tracking based on image block representation and dynamic graph learning and multi-modal target tracking based on collaborative sparse representation model. In terms of video target extraction, a video target segmentation framework based on regularization low rank representation model is proposed for video data. Using supervoxel as graph node, using low rank representation model to optimize the similarity relation among them, we can effectively overcome the interference of sparse large noise and dense Gaussian noise. In order to improve the discriminability between supervoxels, the sparse representation coefficient matrix is regularized in sparse representation model, that is, regularization sparse representation model. Because the video data is usually very large, an optimization algorithm based on sub-optimal low-rank decomposition is proposed to solve the proposed model efficiently, and its convergence is guaranteed theoretically. At the same time, a stream processing method is proposed so that the segmentation method can process unlimited long video in limited computing and storage resources. In order to verify the validity, this paper applies the similarity relation of the optimized supervoxel to the unsupervised and interactive video object segmentation task. In view of the complexity of scene and environment, a universal framework for multi-modal motion target detection based on weighted low-rank decomposition is proposed in this paper. Since the visible spectrum information is affected by complex scenes, illumination and haze, the thermal infrared spectral information is introduced to supplement it. in particular, by introducing a quality weight for each modality, combining background data with a low rank structure, a sparse foreground template of multi-modality sharing and a foreground and a continuity constraint of a background pixel point are jointly modeled, so that multi-modal data can be adaptively fused, and then the moving object is detected by the rod. In order to improve the algorithm detection efficiency and maintain the accuracy, an efficient algorithm based on edge-preserving filtering is proposed, which makes the efficiency of the algorithm close to real-time. In addition, a multi-modal motion target detection platform including 25 video pairs is constructed, which makes up for the lack of standard evaluation system in this field and promotes research and development in related fields. In the aspect of target tracking, in order to solve the problem of model drift in the detection-based tracking framework, a dynamic graph learning method based on image block is proposed in this paper. First, the tracking block is divided into non-overlapping small image blocks, and a weight is allocated for each image block to represent the importance of the image block for the object. because the traditional 8-neighborhood graph ignores the global structure of the graph and the local linear relationship, the structure of the graph is dynamically learned by using the global low-rank structure, the sparse local linear relation and the non-negative dynamic learning graph of the edge right between the image blocks as the graph nodes, and meanwhile, the weight vector of the image block is optimized in a semi-supervised manner. Secondly, in order to improve the timeliness of tracking method, a real-time optimization algorithm is proposed to solve the proposed model. Finally, the optimized weight vector is embedded into the target tracking and model updating, so that the tracking performance is greatly improved. In order to overcome the challenges of scene and environment complexity, a multi-modal target tracking method based on collaborative sparse representation model is proposed in this paper. The traditional multi-modal target tracking method treats each modality equally, and if the information of a certain modality has very large ambiguity, the final tracking result is affected. Therefore, a robust tracking is achieved by adaptively fusing different modalities, i.e., introducing a quality weight for each modality in the sparse representation model. In particular, the weight of each modality is determined by the reconstruction error of the modality and the determination of the target and background, and is jointly optimized with the sparse representation coefficients. In addition, since the problem lacks the standard evaluation platform, a standard multi-modal target detection platform is constructed, including 50 matching video pairs, 22 reference methods and two measurement methods. The platform provides a standard evaluation system for the problem and related fields, which contributes to the research in this field.
【學(xué)位授予單位】：安徽大學(xué)
【學(xué)位級別】：博士
【學(xué)位授予年份】：2016
【分類號】：TP391.41

【相似文獻(xiàn)】

相關(guān)期刊論文前10條

1 李映;張艷寧;許星;;基于信號稀疏表示的形態(tài)成分分析:進(jìn)展和展望[J];電子學(xué)報;2009年01期

2 趙瑞珍;王飛;羅阿理;張彥霞;;基于稀疏表示的譜線自動提取方法[J];光譜學(xué)與光譜分析;2009年07期

3 楊蜀秦;寧紀(jì)鋒;何東健;;基于稀疏表示的大米品種識別[J];農(nóng)業(yè)工程學(xué)報;2011年03期

4 史加榮;楊威;魏宗田;;基于非負(fù)稀疏表示的人臉識別[J];計算機(jī)工程與設(shè)計;2012年05期

5 高志榮;熊承義;笪邦友;;改進(jìn)的基于殘差加權(quán)的稀疏表示人臉識別[J];中南民族大學(xué)學(xué)報(自然科學(xué)版);2012年03期

6 朱杰;楊萬扣;唐振民;;基于字典學(xué)習(xí)的核稀疏表示人臉識別方法[J];模式識別與人工智能;2012年05期

7 耿耀君;張軍英;袁細(xì)國;;一種基于稀疏表示系數(shù)的特征相關(guān)性測度[J];模式識別與人工智能;2013年01期

8 張疆勤;廖海斌;李原;;基于因子分析與稀疏表示的多姿態(tài)人臉識別[J];計算機(jī)工程與應(yīng)用;2013年05期

9 李正周;王會改;劉梅;丁浩;金鋼;;基于形態(tài)成分稀疏表示的紅外小弱目標(biāo)檢測[J];彈箭與制導(dǎo)學(xué)報;2013年04期

10 胡正平;趙淑歡;李靜;;基于塊稀疏遞推殘差分析的稀疏表示遮擋魯棒識別算法研究[J];模式識別與人工智能;2014年01期

相關(guān)會議論文前3條

1 何愛香;劉玉春;魏廣芬;;基于稀疏表示的煤矸界面識別研究[A];虛擬運營與云計算——第十八屆全國青年通信學(xué)術(shù)年會論文集（上冊）[C];2013年

2 樊亞翔;孫浩;周石琳;鄒煥新;;基于元樣本稀疏表示的多視角目標(biāo)識別[A];2013年中國智能自動化學(xué)術(shù)會議論文集（第五分冊）[C];2013年

3 葛鳳翔;任歲玲;郭鑫;郭良浩;孫波;;微弱信號處理及其研究進(jìn)展[A];中國聲學(xué)學(xué)會水聲學(xué)分會2013年全國水聲學(xué)學(xué)術(shù)會議論文集[C];2013年

相關(guān)博士學(xué)位論文前10條

1 李進(jìn)明;基于稀疏表示的圖像超分辨率重建方法研究[D];重慶大學(xué);2015年

2 王亞寧;基于信號稀疏表示的電機(jī)故障診斷研究[D];河北工業(yè)大學(xué);2014年

3 姚明海;視頻異常事件檢測與認(rèn)證方法研究[D];東北師范大學(xué);2015年

4 黃國華;蛋白質(zhì)翻譯后修飾位點與藥物適應(yīng)癥預(yù)測方法研究[D];上海大學(xué);2015年

5 王瑾;基于稀疏表示的數(shù)據(jù)收集、復(fù)原與壓縮研究[D];北京工業(yè)大學(xué);2015年

6 王文卿;基于融合框架與稀疏表示的遙感影像銳化[D];西安電子科技大學(xué);2015年

7 解虎;高維小樣本陣列自適應(yīng)信號處理方法研究[D];西安電子科技大學(xué);2015年

8 秦振濤;基于稀疏表示及字典學(xué)習(xí)遙感圖像處理關(guān)鍵技術(shù)研究[D];成都理工大學(xué);2015年

9 薛明;基于稀疏表示的在線目標(biāo)跟蹤研究[D];上海交通大學(xué);2014年

10 孫樂;空譜聯(lián)合先驗的高光譜圖像解混與分類方法[D];南京理工大學(xué);2014年

相關(guān)碩士學(xué)位論文前10條

1 王道文;基于稀疏表示的目標(biāo)跟蹤算法研究[D];華南理工大學(xué);2015年

2 李哲;基于稀疏表示和LS-SVM的心電信號分類[D];河北大學(xué);2015年

3 孫雪青;Shearlet變換和稀疏表示相結(jié)合的甲狀腺結(jié)節(jié)圖像融合[D];河北大學(xué);2015年

4 吳麗璇;基于稀疏表示的微聚焦X射線圖像去噪方法[D];華南理工大學(xué);2015年

5 趙孝磊;基于圖像分塊稀疏表示的人臉識別算法研究[D];南京信息工程大學(xué);2015年

6 黃志明;基于辨別式稀疏字典學(xué)習(xí)的視覺追蹤算法研究[D];華南理工大學(xué);2015年

7 張鈴華;非約束環(huán)境下的稀疏表示人臉識別算法研究[D];南京信息工程大學(xué);2015年

8 賀妍斐;基于稀疏表示與自適應(yīng)倒易晶胞的遙感圖像復(fù)原方法研究[D];南京信息工程大學(xué);2015年

9 楊爍;電能質(zhì)量擾動信號的稀疏表示/壓縮采樣研究[D];西南交通大學(xué);2015年

10 應(yīng)艷麗;基于低秩稀疏表示的目標(biāo)跟蹤算法研究[D];西南交通大學(xué);2015年

，

本文編號：2267027

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/shoufeilunwen/xxkjbs/2267027.html

上一篇：分布式模型預(yù)測控制算法相關(guān)研究
下一篇：線結(jié)構(gòu)光視覺測量關(guān)鍵技術(shù)及在軌道交通巡檢中應(yīng)用

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于低秩和稀疏表示模型的視頻目標(biāo)提取和跟蹤研究