基于圖像分割和區(qū)域語義相關(guān)性的圖像標(biāo)注算法研究
[Abstract]:With the rapid development of computer technology, network technology and intelligent communication technology, a large number of image data are widely spread on the network, and show explosive growth. How to manage and utilize these image resources effectively has become a difficult problem. Although many achievements have been made in the field of image retrieval, there are still many problems. Because of its low efficiency and artificial subjectivity, text-based image retrieval has been unable to meet the needs of the current big data era, and content-based image retrieval has hindered its development because of its inability to solve the problem of "semantic gap". Semantic automatic image annotation is the main development direction in the field of image retrieval. Researchers have done a lot of research and exploration in this field, but still face a lot of technical difficulties. In view of the present situation and development trend of image retrieval field and many difficult problems, this paper puts forward a series of effective improvement methods. The main points are as follows: (1) automatic image tagging based on semantics needs to use image segmentation algorithm to preprocess the image and to segment the image accurately and effectively. It is very important for the feature extraction and the construction of the tagging model. In this paper, an improved image segmentation algorithm is proposed. The basic idea of the algorithm is: firstly, the Mean Shift algorithm is used to pre-segment the image, because the Mean Shift algorithm is sensitive to the edge of the image. Therefore, the edge information of the image can be extracted very well, but the algorithm can easily produce a lot of small regions. In view of this shortcoming, this paper uses Ncut algorithm to further process the image region obtained from the previous step. Because the Ncut algorithm always tends to get large image regions, it can solve the problem of over-segmentation of Mean Shift, and because Ncut deals with image regions that have been segmented rather than pixels, it greatly reduces the amount of computation. The performance of the algorithm is improved, but the Ncut algorithm also has some shortcomings. This algorithm is a difficult problem of NP. It is necessary to specify the number of segmentation regions before segmentation. If the parameter is not set properly, it is easy to produce over-segmentation and under-segmentation. In this paper, we use the region merging and splitting algorithm to further correct the segmentation results obtained by Ncut processing, merge the over-segmented regions and split the under-segmented regions. The accuracy of image segmentation is improved as much as possible. (2) an improved image semantic annotation method combining regional semantic correlation with Gao Si mixed model is proposed. The traditional Gao Si mixed model is based on the size of the semantic posteriori probability directly to get the image tagging results: one is the direct selection of semantic posteriori probability of N semantic words as the image tagging results. The other is to directly select semantic words whose semantic posteriori probability is greater than a threshold value as the result of image tagging. However, the results obtained by this method are not accurate, and it is easy to produce some redundant or incorrect tagging words, which affects the accuracy of the labeling results. Considering the "semantic gap" problem in the model, the magnitude of the posterior probability can not completely determine its weight, and there may be large errors in the classification decision only based on the posterior probability. Aiming at the above problems, this paper proposes a method of GMM image tagging based on regional semantic correlation, which integrates the semantic correlation of each region into the GMM model to make comprehensive decision. The labeling results of the model are calibrated and optimized effectively, so as to improve the accuracy of the labeling results.
【學(xué)位授予單位】:華中師范大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2017
【分類號】:TP391.41
【參考文獻】
相關(guān)期刊論文 前10條
1 李果;;一種基于遺傳算法的Normalized Cut準(zhǔn)則圖像分割方法[J];齊齊哈爾大學(xué)學(xué)報(自然科學(xué)版);2016年03期
2 崔兆華;孫穗;陳思國;高立群;;Mean shift模糊C均值聚類圖像分割算法[J];控制與決策;2014年06期
3 李宏益;吳素萍;;Mean Shift圖像分割算法的并行化[J];中國圖象圖形學(xué)報;2013年12期
4 魏津瑜;施鶴南;蘇思沁;;基于改進算法的自動種子區(qū)域生長圖像分割[J];中南大學(xué)學(xué)報(自然科學(xué)版);2013年S2期
5 郭玉堂;韓昌剛;;基于CCA子空間和GMM的自動圖像標(biāo)注[J];計算機工程;2013年06期
6 楊棟;周秀玲;郭平;;基于貝葉斯通用背景模型的圖像標(biāo)注[J];自動化學(xué)報;2013年10期
7 張桂梅;周明明;馬珂;;基于彩色模型的重構(gòu)標(biāo)記分水嶺分割算法[J];中國圖象圖形學(xué)報;2012年05期
8 吳一全;張金礦;;二維直方圖θ-劃分最大平均離差閾值分割算法[J];自動化學(xué)報;2010年05期
9 許新征;丁世飛;史忠植;賈偉寬;;圖像分割的新理論和新方法[J];電子學(xué)報;2010年S1期
10 廖建勇;郭斯羽;黃梓效;;基于Mean Shift聚類的最大熵圖像分割方法[J];計算機仿真;2009年09期
相關(guān)碩士學(xué)位論文 前1條
1 劉苗苗;幾何活動輪廓模型在圖像分割中的應(yīng)用研究[D];南京航空航天大學(xué);2007年
,本文編號:2337100
本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2337100.html