基于MSER的自然場(chǎng)景文本定位算法研究
發(fā)布時(shí)間:2019-06-18 19:03
【摘要】:自然場(chǎng)景圖像中的文本含有大量語(yǔ)義信息,是對(duì)圖像場(chǎng)景的重要補(bǔ)充。隨著智能手機(jī)、平板電腦和數(shù)碼相機(jī)的普及,人們?cè)絹碓饺菀撰@取高質(zhì)量的場(chǎng)景圖像。從自然場(chǎng)景圖像中提取文本信息不僅有助于人們更深層次地理解場(chǎng)景,而且在檢索、查詢以及視覺輔助系統(tǒng)中有重要用途。準(zhǔn)確提取自然場(chǎng)景中的文本信息的前提是精確定位文本區(qū)域,自然場(chǎng)景文本定位面臨著圖像背景復(fù)雜、字體多樣以及遮擋、模糊等難題,是一個(gè)極具挑戰(zhàn)性的研究課題。本文對(duì)自然場(chǎng)景文本定位的相關(guān)技術(shù)進(jìn)行探索,提出了一種新的基于最大穩(wěn)定極值區(qū)域的自然場(chǎng)景文本定位算法框架。本文的主要貢獻(xiàn)如下:(1)針對(duì)MSER檢測(cè)器檢測(cè)文本候選區(qū)域的重復(fù)檢測(cè)問題,提出了一種基于區(qū)域變化率的MSER重復(fù)嵌套區(qū)域刪除規(guī)則。首先對(duì)圖像進(jìn)行預(yù)處理,從各個(gè)顏色通道中提取出MSER,然后根據(jù)區(qū)域的變化率以及包含關(guān)系,刪除重復(fù)檢測(cè)的區(qū)域。(2)針對(duì)低分辨率或者有陰影的圖像,相鄰字符之間存在邊緣粘連的問題,本文用邊緣增強(qiáng)的MSER作為字符候選區(qū)域,并且在此基礎(chǔ)上設(shè)計(jì)了一種由粗到細(xì)的字符候選區(qū)域驗(yàn)證規(guī)則。首先利用區(qū)域的形狀特征設(shè)計(jì)了驗(yàn)證候選字符區(qū)域的啟發(fā)式規(guī)則,然后結(jié)合區(qū)域的筆畫寬度變換和支持向量機(jī)實(shí)現(xiàn)字符區(qū)域的確認(rèn)。(3)設(shè)計(jì)了一種基于字符區(qū)域特征相似性的文本行建立方法,將從多個(gè)通道中提取出的字符區(qū)域合并為能夠表達(dá)完整語(yǔ)義信息的文本行。為了驗(yàn)證提出算法的性能,分別在ICDAR 2003、ICDAR 2013和SVT三個(gè)公開數(shù)據(jù)庫(kù)進(jìn)行了仿真實(shí)驗(yàn),得到了良好的實(shí)驗(yàn)效果。
[Abstract]:The text in natural scene image contains a lot of semantic information, which is an important supplement to image scene. With the popularity of smartphones, tablets and digital cameras, it is more and more easy to obtain high-quality scene images. Extracting text information from natural scene images not only helps people to understand the scene more deeply, but also plays an important role in retrieval, query and visual assistance system. The premise of accurately extracting text information from natural scene is to accurately locate text area. Natural scene text location is faced with complex image background, diverse fonts, occlusion, blur and other problems, which is a very challenging research topic. In this paper, the related technologies of natural scene text location are explored, and a new natural scene text location algorithm framework based on maximum stable extremum region is proposed. The main contributions of this paper are as follows: (1) in order to solve the problem of repeated detection of text candidate regions detected by MSER detector, a MSER repeated nesting region deletion rule based on region change rate is proposed. Firstly, the image is preprocessed, and then the MSER, is extracted from each color channel, and then the repeated detection area is deleted according to the change rate of the region and the inclusion relationship. (2) aiming at the problem of edge adhesion between adjacent characters in low resolution or shadowed images, this paper uses edge enhanced MSER as character candidate region, and on this basis, designs a verification rule of character candidate region from thick to fine. Firstly, the heuristic rules for verifying the candidate character region are designed by using the shape features of the region, and then the recognition of the character region is realized by combining the stroke width transformation of the region and the support vector machine. (3) A text line establishment method based on the feature similarity of the character region is designed, which merges the character region extracted from multiple channels into a text line that can express the complete semantic information. In order to verify the performance of the proposed algorithm, three public databases, ICDAR 2013 and SVT, are simulated in ICDAR 2003, and good experimental results are obtained.
【學(xué)位授予單位】:西安科技大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類號(hào)】:TP391.1
本文編號(hào):2501713
[Abstract]:The text in natural scene image contains a lot of semantic information, which is an important supplement to image scene. With the popularity of smartphones, tablets and digital cameras, it is more and more easy to obtain high-quality scene images. Extracting text information from natural scene images not only helps people to understand the scene more deeply, but also plays an important role in retrieval, query and visual assistance system. The premise of accurately extracting text information from natural scene is to accurately locate text area. Natural scene text location is faced with complex image background, diverse fonts, occlusion, blur and other problems, which is a very challenging research topic. In this paper, the related technologies of natural scene text location are explored, and a new natural scene text location algorithm framework based on maximum stable extremum region is proposed. The main contributions of this paper are as follows: (1) in order to solve the problem of repeated detection of text candidate regions detected by MSER detector, a MSER repeated nesting region deletion rule based on region change rate is proposed. Firstly, the image is preprocessed, and then the MSER, is extracted from each color channel, and then the repeated detection area is deleted according to the change rate of the region and the inclusion relationship. (2) aiming at the problem of edge adhesion between adjacent characters in low resolution or shadowed images, this paper uses edge enhanced MSER as character candidate region, and on this basis, designs a verification rule of character candidate region from thick to fine. Firstly, the heuristic rules for verifying the candidate character region are designed by using the shape features of the region, and then the recognition of the character region is realized by combining the stroke width transformation of the region and the support vector machine. (3) A text line establishment method based on the feature similarity of the character region is designed, which merges the character region extracted from multiple channels into a text line that can express the complete semantic information. In order to verify the performance of the proposed algorithm, three public databases, ICDAR 2013 and SVT, are simulated in ICDAR 2003, and good experimental results are obtained.
【學(xué)位授予單位】:西安科技大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類號(hào)】:TP391.1
【參考文獻(xiàn)】
相關(guān)期刊論文 前5條
1 趙丹;;SVM核函數(shù)與選擇算法[J];數(shù)字技術(shù)與應(yīng)用;2014年09期
2 管士勇;陸利忠;閆鑌;童莉;;一種基于穩(wěn)定區(qū)域的圖像特征描述子[J];計(jì)算機(jī)工程;2012年18期
3 王國(guó)鋒;宋鵬飛;張?zhí)N靈;;智能交通系統(tǒng)發(fā)展與展望[J];公路;2012年05期
4 晉瑾;平西建;張濤;陳明貴;;圖像中的文本定位技術(shù)研究綜述[J];計(jì)算機(jī)應(yīng)用研究;2007年06期
5 ;Automatic character detection and segmentation in natural scene images[J];Journal of Zhejiang University Science A(Science in Engineer;2007年01期
相關(guān)碩士學(xué)位論文 前3條
1 黃攀;基于深度學(xué)習(xí)的自然場(chǎng)景文字識(shí)別[D];浙江大學(xué);2016年
2 吳慧;面向盲人視覺輔助系統(tǒng)的自然場(chǎng)景文本檢測(cè)[D];中南大學(xué);2014年
3 馬海清;基于邊緣和紋理的文本定位算法的研究[D];哈爾濱工業(yè)大學(xué);2007年
,本文編號(hào):2501713
本文鏈接:http://sikaile.net/kejilunwen/ruanjiangongchenglunwen/2501713.html
最近更新
教材專著