面向網(wǎng)頁文本的地理信息變化語義檢測方法研究
本文選題:地理信息 + 變化檢測 ; 參考:《南京師范大學(xué)》2013年碩士論文
【摘要】:本論文依托國家測繪科技項(xiàng)目“網(wǎng)絡(luò)地理信息變化檢測技術(shù)研究”,面向地理信息動態(tài)持續(xù)更新需求,針對網(wǎng)絡(luò)文本中地理信息變化語言的描述特點(diǎn),集成地名識別、時(shí)間和屬性抽取技術(shù),較為系統(tǒng)地探索了面向網(wǎng)頁文本的地理信息變化語義檢測方法,研發(fā)了相應(yīng)的原型系統(tǒng),并進(jìn)行了實(shí)驗(yàn)驗(yàn)證分析。主要研究內(nèi)容和結(jié)論包括以下三個(gè)方面: (1)地理信息變化語義檢測知識庫構(gòu)建:通過對地理信息變化網(wǎng)頁中變化信息內(nèi)容和相關(guān)網(wǎng)頁獲取方法的分析,構(gòu)建了包括要素特征詞匯庫、要素變化詞匯庫、停用詞詞匯庫、空間關(guān)系詞匯庫等在內(nèi)的變化檢測知識庫,利用Protege本體編輯器和OWL API實(shí)現(xiàn)了知識庫的管理和維護(hù)。 (2)地理信息變化網(wǎng)頁獲取與信息解析:設(shè)計(jì)了基于搜索引擎的主題爬蟲和通用主題爬蟲,有效解決了相關(guān)網(wǎng)頁的廣度和深度搜索問題;構(gòu)建了網(wǎng)頁可信度計(jì)算模型,為虛假網(wǎng)頁的甄別提供了依據(jù);在時(shí)間抽取和地名識別技術(shù)基礎(chǔ)上,提出了知識庫驅(qū)動的網(wǎng)頁文本中地理信息變化特征信息抽取方法。 (3)原型系統(tǒng)研發(fā)和實(shí)驗(yàn)驗(yàn)證:在上述研究基礎(chǔ)之上,借助富客戶端技術(shù)設(shè)計(jì)了面向網(wǎng)頁文本的地理信息變化語義檢測原型系統(tǒng),實(shí)現(xiàn)了知識庫管理、變化信息獲取和空間統(tǒng)計(jì)分析等功能,并以南京地區(qū)為例,進(jìn)行了實(shí)驗(yàn)驗(yàn)證分析。 研究表明,地理信息涉及地理要素類型較多,知識庫構(gòu)建是一個(gè)較為復(fù)雜的系統(tǒng)工程,采用人工歸納存在一定的局限性。不同地理要素類型的變化信息在網(wǎng)頁文本中的出現(xiàn)頻率差異較大,交通和水系類地理信息解析效果較好。另外,大多數(shù)從網(wǎng)頁文本中獲取的地理信息變化屬性為定性描述,而且具有較強(qiáng)的模糊性和不確定性,需要結(jié)合其他地理信息數(shù)據(jù)源進(jìn)行進(jìn)一步驗(yàn)證。
[Abstract]:This paper relies on the national surveying and mapping project "the research of the network geographic information change detection technology", facing the geographical information dynamic continuous renewal demand, according to the description characteristic of the geographical information change language in the network text, integrates the place name recognition, Based on the technology of time and attribute extraction, this paper systematically explores the semantic detection method of geographic information change oriented to web page text, develops the corresponding prototype system, and carries out the experimental verification and analysis. The main research contents and conclusions include the following three aspects: 1) Construction of knowledge base for semantic detection of geographic information change: by analyzing the content of change information and the methods of obtaining relevant web pages, this paper constructs a lexical database including the feature of elements, the lexicon of change of elements, and the vocabulary of inactive words. The Protege ontology editor and OWL API are used to manage and maintain the knowledge base. (2) Web page acquisition and information analysis of geographic information change: a search engine based topic crawler and a general theme crawler are designed, which effectively solve the breadth and depth search problems of related web pages, and a web credibility calculation model is constructed. Based on the technology of time extraction and place name recognition, this paper proposes a method of extracting the feature information of geographical information change in web pages driven by knowledge base. Research and development of prototype system and experimental verification: on the basis of the above research, a prototype system for semantic detection of geographic information change oriented to web page text is designed with the help of rich client technology, and the knowledge base management is realized. Taking Nanjing area as an example, the functions of information acquisition and spatial statistical analysis are analyzed. The research shows that geographical information involves many types of geographical elements, and the construction of knowledge base is a more complex system engineering, and the artificial induction has some limitations. The frequency of changing information of different geographic element types in the text of web pages is quite different, and the interpretation effect of traffic and river type geographic information is better. In addition, most of the geographic information attributes obtained from the text of the web page are qualitative description, and have strong fuzziness and uncertainty, which need to be further verified with other geographic information data sources.
【學(xué)位授予單位】:南京師范大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2013
【分類號】:P208;TP391.1
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 田浩,蔣理興,時(shí)三帥,張強(qiáng);多源遙感影像融合技術(shù)及其在更新土地利用數(shù)據(jù)庫中的應(yīng)用[J];測繪工程;2005年01期
2 梅洋;陸苗;;基于遙感影像的變化檢測研究動態(tài)[J];地理信息世界;2009年02期
3 劉直芳,張繼平;變化檢測方法及其在城市中的應(yīng)用[J];測繪通報(bào);2002年09期
4 孫成忠,李成名,洪志剛;基于衛(wèi)星遙感影像的城市地圖快速更新技術(shù)[J];測繪通報(bào);2002年12期
5 陳文慧;陳冬暉;王占新;;數(shù)字測圖技術(shù)在阜新市地理數(shù)據(jù)庫更新中的應(yīng)用[J];測繪與空間地理信息;2008年01期
6 李麗雙;黨延忠;廖文平;黃德根;張穎;;CRF與規(guī)則相結(jié)合的中文地名識別[J];大連理工大學(xué)學(xué)報(bào);2012年02期
7 張雪英;閭國年;李伯秋;陳文君;;基于規(guī)則的中文地址要素解析方法[J];地球信息科學(xué)學(xué)報(bào);2010年01期
8 張春菊;張雪英;朱少楠;徐希濤;;基于網(wǎng)絡(luò)爬蟲的地名數(shù)據(jù)庫維護(hù)方法[J];地球信息科學(xué)學(xué)報(bào);2011年04期
9 鄒進(jìn)貴;潘正風(fēng);虞暉;隗劍秋;;城市基礎(chǔ)地理信息系統(tǒng)數(shù)據(jù)更新方法的研究[J];地理空間信息;2005年06期
10 周立;鄧云青;;城市地理信息系統(tǒng)數(shù)據(jù)更新方式研究[J];地理空間信息;2008年05期
,本文編號:1905503
本文鏈接:http://sikaile.net/kejilunwen/dizhicehuilunwen/1905503.html