基于稀疏編碼理論的圖像多標(biāo)簽排序算法研究
[Abstract]:In today's high-speed Internet era, the popularity of many digital imaging devices, coupled with the advancement of Internet technology, Internet images are playing an increasingly important role in our lives. Network image search has become a very active and challenging research topic in the field of computer vision. Unlike a decade ago, the Internet now makes it easy to create, upload, share, and distribute digital images on the Internet. Social media, such as Facebook, YouTube, Flickr, allow image uploaders to provide a set of keywords (also known as Social Tags) that describe the image. To index images, the semantic annotation of images is accomplished by the cooperation of users through the network, so this kind of image set is also called Collaborative-Tagged Images. There are a large number of tagged image sets on image sharing websites such as R. This sharing method based on social tagging will greatly improve the performance of mass image organization and retrieval on the Internet. Therefore, how to use these tagged image sets more effectively is the key to improve the performance of automatic image annotation. One of the problems is that users usually upload tags corresponding to images in Random Order, i.e. the set of tags submitted by users is not always sorted according to the size of tag-to-image semantic relevance (Tag Relevance). At present, Flickr does not provide a Relevance-based Ranking-based retrieval sorting mechanism. The random sorting of label sets restricts further application of massive image retrieval performance. Most Interesting: that is, according to the click rate of users, the number of comments and so on, but Flickr can not provide the retrieval mode according to semantic relevance at present. In other words, although community-based annotation sharing greatly improves the performance of mass image organization and retrieval on the Internet, users usually upload tags corresponding to images in random order, i.e. the set of tags submitted by users is not always in accordance with the semantic relevance between tags and image content (Tag R). The random ordering of label sets restricts the further application of massive image retrieval performance. Therefore, label ordering is becoming a new hotspot in multimedia research field. It should be pointed out that the image set ordered by semantic correlation can be used as the key to represent semantics. As mentioned earlier, community labeling has become a popular way to capture, classify, and retrieve content on the Internet, and has been successfully applied in the management and retrieval of real social media systems. Although users provide tags to describe the content of a community image, because these tags are from different cultural backgrounds, network users with knowledge structures have their own subjective understanding of the content of the image. The label quality of community image can not be directly used as a reliable image indexing keyword for keyword-based image retrieval. At present, the label of community image mainly exists the disorder of label arrangement and the imprecision of label content, so the semantic understanding of labeled community image mainly focuses on. Some research institutes (such as MSRA) have studied the Tag Ranking problem. Since an image may be labeled with several semantic conceptual markers at the same time, it is a typical multi-marker learning problem. The image itself has a certain degree of semantic ambiguity. However, the realization of tag set sorting according to semantic relevance is abstracted as a typical multi-label Ranking problem. At present, there are many researches on multi-label learning, while there are relatively few researches on multi-label sorting problem. Most of the existing Tag Ranking algorithms focus on Relevance-based Tag Ranking. Intuitively, given an image and a label set, if the correlation of a label A in the label set is higher than that of label B, it is shown that In other words, the frequency of tag A appearing in the subset of K-nearest neighbor image of a given image is higher than that of tag B. This kind of algorithm mainly has two representative works. (1) Statistical model-based algorithm. Modeling sorting algorithm; (2) Data-driven sorting algorithm. Statistical model-based sorting algorithm uses kernel density estimation to estimate the semantic correlation between each tag in an image and the image itself. Its essence is to estimate the Typicality of the sample, if the image represents the region of a semantic tag. Low-level visual features are more typical, that is, if the feature vectors of the region with the same label are closer in the feature space, the semantic relevance of the label will be high; considering the semantic correlation between the labels, random walk algorithm is used to improve the sorting results and achieve the final label sorting. Because global low-level visual features represent images with multi-label semantics, it is impossible to estimate the density of each label in the feature space. Data-driven sorting algorithm gets a subset of the nearest neighbor images of a given image by simple image global feature matching, and counts them by Neighbor-voting strategy. Different from the sorting algorithm based on statistical model, the data-driven sorting algorithm only uses the visual features of the image when selecting the nearest neighbor sample set of the sorted image without considering the label information of the image. Intuitively, tag sorting algorithm based on nearest neighbor voting mechanism shows good scalability in massive image datasets because of its simplicity. However, it should be pointed out that this kind of algorithm ignores the semantic correlation between tags, so its sorting performance is not very good. Secondly, the algorithm also uses global visual features. In this paper, an improved image multi-label sorting algorithm is proposed. By introducing the sparse representation theory in the field of signal processing, the nearest neighbor image retrieval problem is transformed into a sparse reconstruction problem, which improves the selection of nearest neighbor image sets. In recent years, the combination of Compressed Sensing (CS) and feature selection theory and method to form more effective sparse representations for images has become a hot topic in the field of computer vision and machine learning. Tibshirani and Breiman of the University of California, Berkeley, et al. almost simultaneously proposed the idea of associating feature selection sparsely with (?) 1-norm constraints to make the selected features as sparse as possible and to improve the interpretability and accuracy of the data processing process. For statistical analysis of high-dimensional data, the theory and method of image semantic understanding can be studied on the basis of sparse representation. The idea of image multi-label sorting algorithm based on sparse representation proposed in this paper is as follows: Firstly, the algorithm essentially belongs to image multi-label sorting based on semantic correlation sorting. Given a test image to be sorted and a large set of labeled community images, we consider the test image to be sorted as a test sample to be reconstructed, and the large set of labeled community images as an over-complete dictionary. The sparse reconstruction of a few samples from the over-complete dictionary can be used to characterize the semantic similarity and correlation between each labeled image and the sample image in the bullet based on the sparse coefficient vector obtained from the learning. Therefore, each dimension of the sparse coefficient vector obtained from the learning represents the test sample image and the dictionary. Finally, the nearest neighbor image subset of the test image is obtained based on the acquired semantic correlation, and the frequency of each key word in the tag sequence is counted by the nearest neighbor voting strategy, and the tag sequence is sorted according to the frequency. Considering the semantic correlation between tags (i.e. the symbiosis relationship), the Random Walk algorithm is used to improve the sorting result and achieve the final sorting. We implement the algorithm proposed in this paper by using MATLAB programming language and carry out experimental verification on the NUS-WIDE image data set. Comparison is made to verify the effectiveness of our proposed sparse representation based image label sorting algorithm.
【學(xué)位授予單位】:北京交通大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2017
【分類號】:TP391.41
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 安朝輝;錢劍敏;;一種新的排序算法——端點(diǎn)排序算法[J];現(xiàn)代電子技術(shù);2011年24期
2 盧敏;黃亞樓;謝茂強(qiáng);王揚(yáng);劉杰;廖振;;代價敏感的列表排序算法[J];計(jì)算機(jī)研究與發(fā)展;2012年08期
3 張正鈾;;散列排序算法[J];廣西科學(xué)院學(xué)報;1982年01期
4 全惠云;;基于矩陣分裂法的一類異步N&行排序算法[J];計(jì)算技術(shù)與自動化;1991年01期
5 董德林;兩個高效排序算法的APPLESOFT BASIC程序[J];麗水師專學(xué)報;1992年S1期
6 王曉東;最優(yōu)堆排序算法[J];小型微型計(jì)算機(jī)系統(tǒng);2000年05期
7 吳江,張德同;二次分“檔”鏈接排序算法分析[J];計(jì)算機(jī)研究與發(fā)展;2001年08期
8 李德啟,王雄;一種新型快速的排序算法[J];計(jì)算機(jī)工程;2001年03期
9 趙忠孝;一種新的散列排序算法[J];電腦開發(fā)與應(yīng)用;2001年03期
10 許善祥,朱學(xué)東,邵敬春;選擇排序算法的改進(jìn)[J];佳木斯大學(xué)學(xué)報(自然科學(xué)版);2001年04期
相關(guān)會議論文 前10條
1 周曉方;金志權(quán);;尋找最佳分布式排序算法[A];第九屆全國數(shù)據(jù)庫學(xué)術(shù)會議論文集(上)[C];1990年
2 張艷秋;李建中;;一種基于蛇型磁帶的排序算法[A];第十八屆全國數(shù)據(jù)庫學(xué)術(shù)會議論文集(研究報告篇)[C];2001年
3 劉春陽;葉君峰;母海龍;陸秋霞;陳滄;高鶯;;一種商品標(biāo)題主題詞的重要性排序算法[A];第五屆全國信息檢索學(xué)術(shù)會議論文集[C];2009年
4 王少帥;湯慶新;姚路;;并行獨(dú)立集排序算法的改進(jìn)與實(shí)現(xiàn)[A];第十六屆全國青年通信學(xué)術(shù)會議論文集(上)[C];2011年
5 于芳;王大玲;于戈;陳冬玲;鮑玉斌;;面向用戶的排序算法研究[A];第二十四屆中國數(shù)據(jù)庫學(xué)術(shù)會議論文集(研究報告篇)[C];2007年
6 閆潑;馬軍;陳竹敏;;面向主題的網(wǎng)頁排序算法研究[A];第三屆全國信息檢索與內(nèi)容安全學(xué)術(shù)會議論文集[C];2007年
7 張健沛;李連江;楊靜;;個性化搜索引擎排序算法的研究與改進(jìn)[A];第三屆全國信息檢索與內(nèi)容安全學(xué)術(shù)會議論文集[C];2007年
8 吳志彬;陳義華;;ANP中超矩陣排序算法研究[A];2006中國控制與決策學(xué)術(shù)年會論文集[C];2006年
9 陳叢叢;石冰;陳健;;面向主題的查詢相關(guān)網(wǎng)頁排序算法[A];第三屆中國智能計(jì)算大會論文集[C];2009年
10 齊曼;張珩;;實(shí)時視覺仿真中幀連貫性應(yīng)用[A];'2000系統(tǒng)仿真技術(shù)及其應(yīng)用學(xué)術(shù)交流會論文集[C];2000年
相關(guān)重要報紙文章 前1條
1 廣東 黃陀;基本算法簡介(三)[N];電腦報;2001年
相關(guān)博士學(xué)位論文 前3條
1 趙立軍;基于歸并的高效排序算法的研究[D];中國科學(xué)院研究生院(計(jì)算技術(shù)研究所);1998年
2 崔筠;無向基因組的移位排序算法[D];山東大學(xué);2006年
3 郝凡昌;有向基因組復(fù)合操作重組排序算法研究[D];山東大學(xué);2011年
相關(guān)碩士學(xué)位論文 前10條
1 徐林龍;基于商品特征屬性的排序算法研究[D];西南交通大學(xué);2015年
2 陳浩;基于圖理論的圖像搜索結(jié)果重排序的研究[D];安徽大學(xué);2016年
3 雙全;基于用戶行為分析的搜索排序算法研究[D];華中科技大學(xué);2014年
4 王麒深;面向網(wǎng)絡(luò)輿情的社會情感排序算法研究[D];中國民航大學(xué);2012年
5 郭佳;一種SDN環(huán)境中的網(wǎng)絡(luò)節(jié)點(diǎn)重要性排序算法[D];西安電子科技大學(xué);2015年
6 馮少泳;兩層哈希的重排序算法[D];華南理工大學(xué);2016年
7 陸沛棟;基于可重構(gòu)SoC平臺的排序算法設(shè)計(jì)和自相關(guān)算法優(yōu)化[D];南京大學(xué);2017年
8 祁洋;RankNet學(xué)習(xí)排序算法的一種改進(jìn)[D];吉林大學(xué);2017年
9 RAPHAEL DE-SOUZA;基于稀疏編碼理論的圖像多標(biāo)簽排序算法研究[D];北京交通大學(xué);2017年
10 王靖;數(shù)據(jù)庫管理系統(tǒng)中高能效排序算法[D];浙江工業(yè)大學(xué);2012年
,本文編號:2225130
本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2225130.html