蒙古文文檔圖像版面分析及識(shí)別后處理的研究與實(shí)現(xiàn)
[Abstract]:The research of optical character recognition (OCR) technology has been developed rapidly in recent years. Character recognition rate is the most important performance index in OCR system. For printed Mongolian character recognition system, it is necessary to perfect the whole system and improve the recognition rate of Mongolian characters. It is necessary to study and implement the layout analysis technology of Mongolian document image in the early stage and the post processing technology in the later stage. Therefore, the main content of this paper includes two parts, one is the layout analysis of Mongolian document images, the other is the post-processing of Mongolian text recognition. In the process of printed Mongolian character recognition, layout analysis is a very important basic work, but at present, there are few researches on layout analysis of Mongolian document image, and Mongolian document image has a variety of layout forms, and there are characters and pictures. The mixed arrangement of various layout elements, such as tables, brings many difficulties to the recognition of printed Mongolian characters. In this paper, a bottom-up and top-down layout analysis method is used to remove the non-text part, only the text part, by marking the connected domain, merging the connected domain, removing the connected domain, and so on. After paragraph division, the location information of each paragraph is obtained, which can be used for subsequent page restoration. In Mongolian character recognition system, the result of document image segmentation and recognition is Mongolian font coding. The post-processing of this paper is the process of converting the result of font recognition into international standard coding. The coding conversion method based on contrast dictionary is adopted in this paper. Firstly, we need to convert the existing international standard code dictionary (covering 50553 Mongolian words) into word document and PDF file in turn. Finally, the images are converted into pictures, and the layout analysis and column segmentation, word segmentation and character segmentation are carried out. The Mongolian character element image obtained by the segmentation is used as the input of the trained convolution neural network classifier, and the output is Mongolian font coding. The existing international standard code dictionaries and the obtained glyph codes are arranged into a coding conversion dictionary according to the one-to-one correspondence. After the post-processing, we can find the corresponding international standard code in the dictionary and complete the coding conversion process by looking up the position of the glyph code which is the same as the recognition result in the arranged dictionary. The Mongolian document image layout analysis technology studied in this paper can process the Mongolian document image in many complicated layout formats, including removing the text part, dividing the text area into paragraphs and marking the paragraph position, etc. A certain number of samples were tested, and the accuracy of layout analysis reached 97.87. The post-processing in this paper can quickly, effectively and accurately convert the recognition result of Mongolian font coding into international standard code, which makes the printed Mongolian character recognition system more perfect.
【學(xué)位授予單位】:內(nèi)蒙古大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類號(hào)】:TP391.4
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 楊戈;張威強(qiáng);黃靜;;一個(gè)感知機(jī)神經(jīng)網(wǎng)絡(luò)字符識(shí)別器的實(shí)現(xiàn)[J];電子技術(shù)應(yīng)用;2015年03期
2 單煜翔;陳諧;史永哲;劉加;;基于擴(kuò)展N元文法模型的快速語(yǔ)言模型預(yù)測(cè)算法[J];自動(dòng)化學(xué)報(bào);2012年10期
3 王健;哈力木拉提·買買提;;印刷體維吾爾文識(shí)別后處理[J];新疆大學(xué)學(xué)報(bào)(自然科學(xué)版);2011年02期
4 蘇志祁;方康玲;;一種鋼筋圖像自動(dòng)計(jì)數(shù)的方法[J];現(xiàn)代電子技術(shù);2010年06期
5 董廣宇;呂學(xué)強(qiáng);王濤;施水才;;基于N-gram語(yǔ)言模型的漢字識(shí)別后處理研究[J];微計(jì)算機(jī)信息;2009年10期
6 魏宏喜;高光來(lái);;一種基于連通域的蒙古文文檔圖像版面分析方法[J];內(nèi)蒙古大學(xué)學(xué)報(bào)(自然科學(xué)版);2007年05期
7 魏宏喜;高光來(lái);;印刷體蒙古文字識(shí)別中蒙古文字特征的選擇[J];內(nèi)蒙古大學(xué)學(xué)報(bào)(自然科學(xué)版);2006年06期
8 張廣淵;李晶皎;王愛(ài)俠;;基于知識(shí)的滿文識(shí)別后處理[J];計(jì)算機(jī)輔助工程;2006年03期
9 趙驥;李晶皎;王麗君;張繼生;;基于HMM的滿文文本識(shí)別后處理的研究[J];中文信息學(xué)報(bào);2006年04期
10 徐兆軍,業(yè)寧,王厚立;基于神經(jīng)網(wǎng)絡(luò)的版面分析[J];計(jì)算機(jī)應(yīng)用;2004年S2期
相關(guān)博士學(xué)位論文 前2條
1 趙于前;基于數(shù)學(xué)形態(tài)學(xué)的醫(yī)學(xué)圖像處理理論與方法研究[D];中南大學(xué);2006年
2 劉建勝;文檔圖象版面理解的研究[D];重慶大學(xué);2002年
相關(guān)碩士學(xué)位論文 前9條
1 姚志鵬;基于Hadoop平臺(tái)的印刷體蒙古文字識(shí)別系統(tǒng)的研究與實(shí)現(xiàn)[D];內(nèi)蒙古大學(xué);2016年
2 張文杰;基于移動(dòng)終端的報(bào)紙版面分析及識(shí)別[D];北京郵電大學(xué);2014年
3 施晟;文檔圖像的版面分析技術(shù)研究[D];中南大學(xué);2011年
4 郭軍;信息資源數(shù)字化文本型數(shù)字圖像OCR識(shí)別準(zhǔn)確度影響因素及提高策略研究[D];鄭州大學(xué);2011年
5 黨興;復(fù)雜的中文文檔圖像版面分析研究[D];蘇州大學(xué);2010年
6 包艷花;蒙古文識(shí)別文本后處理相關(guān)技術(shù)研究[D];內(nèi)蒙古大學(xué);2007年
7 魏宏喜;印刷體蒙古文字識(shí)別中關(guān)鍵技術(shù)的研究[D];內(nèi)蒙古大學(xué);2006年
8 鄧立國(guó);基于多層次可信度指導(dǎo)下的自底向上版面分析[D];西華大學(xué);2006年
9 楊芳;基于紋理分析的印刷字體識(shí)別研究及應(yīng)用[D];河北大學(xué);2003年
,本文編號(hào):2118662
本文鏈接:http://sikaile.net/shoufeilunwen/xixikjs/2118662.html