林業(yè)動態(tài)信息快速搜索與集成
[Abstract]:Forestry in China is the basic industry of the national economy and undertakes the important mission of ecological environment construction and social sustainable development. Mankind is the main beneficiary group in the construction of forestry industry. When the forestry industrial structure forms, people play their different roles in it. In recent years, forestry informatization has promoted forestry credit. The sharing of information resources provides convenience for the public and promotes the development of forestry industry. However, the development of forestry informatization is still necessary. How to make better use of forestry information resources and provide services for scientific researchers, teaching workers and foresters in the field of Forestry Science in China is an urgent need to search and collect forestry information quickly. It is.
How to quickly find the information users need from the mass of information has become a major problem facing the public in the search of information in a specific field. Forestry information on the Internet is becoming more and more complex and disorderly, and ordinary search engines can no longer meet the needs of the public for personalized information. When searching, the general search engine needs to spend a lot of time and energy to find the information needed, and the recall rate and accuracy of the subject information are relatively low, which can not meet the needs of users. Therefore, the public urgently needs a forestry subject search engine with accurate classification, comprehensive data and timely update.
The research content of this paper comes from the key project of Hunan Science and Technology Program (2010 nk2004), which is presided over by the tutor. Guided by the theories of system science, forestry, informatics and statistics, this paper makes a comprehensive study on the search and integration of forestry dynamic information. In the course of the study, the research on the search and integration of forestry dynamic information at home and abroad is carried out. In this paper, the demand analysis and classification of forestry dynamic information, subject crawler searcher and text recognition classifier are summarized.
(1) The existing theories and practices of search engines at home and abroad are analyzed comprehensively, which indicates the importance and necessity of establishing a forestry subject search engine at present, and the key technologies are studied deeply. The forestry subject search engine is divided into three layers: data collection layer, data storage layer and data representation layer. At the same time, we discuss and summarize the relevant methods in these three levels.
(2) Using the information published on the web pages and combining with the demand of various departments and the public for forestry dynamic information, the types of forestry dynamic information which are really meaningful to the departments and the public are defined, and the required forestry dynamic information is classified and divided into seven groups, so as to concretize the various forestry dynamic information. Forestry means of production, market supply and demand information for forest products, flower information, forestry policies and regulations, Forestry labor information, meteorological and environmental information.
(3) According to the established forestry dynamic information classification system, collect the relevant forestry professional websites, identify the source of information collection websites, collect the domain name of the websites provided by the data we need, and collect the content after the domain name, at the same time identify the websites collected, so as to realize the collection and classification of forestry dynamic information sources.
(4) Using a new search strategy based on content analysis and link structure analysis, through comprehensive analysis and evaluation, the topic relevance of the pages pointed by the candidate URLs is judged and the candidate URLs are sorted to achieve the optimal forestry theme crawler searcher, so that the downloaded pages are related to forestry topics. And the importance is highlighted in decreasing order.
(5) Adopting SVM automatic text categorization technology of computer intelligence, the sample data is trained by machine, and the dynamic forestry information collected by the subject crawler searcher is classified and stored, so as to optimize the data collection layer of the forestry subject search engine.
Forestry dynamic information search and integration is based on the research and optimization of existing search and integration technology, which integrates the public demand for forestry dynamic information. The accuracy, comprehensiveness and success rate of public access to forestry dynamic information have been significantly improved. New methods and new technologies will be further applied to the rapid search and integration of forestry dynamic information, and forestry information management and service will also take a new step.
【學位授予單位】:中南林業(yè)科技大學
【學位級別】:碩士
【學位授予年份】:2013
【分類號】:S712
【參考文獻】
相關(guān)期刊論文 前10條
1 方鴻錦;孫旭東;劉燕德;;江西省農(nóng)業(yè)信息化與新農(nóng)村建設(shè)的研究[J];安徽農(nóng)業(yè)科學;2007年34期
2 張黎爍;李鑫;徐猛;;基于PageRank的網(wǎng)頁主題相關(guān)性算法研究[J];光盤技術(shù);2008年12期
3 王灝,黃厚寬,田盛豐;文本分類實現(xiàn)技術(shù)[J];廣西師范大學學報(自然科學版);2003年01期
4 劉林,汪濤,樊孝忠;主題爬蟲的解決方案[J];華南理工大學學報(自然科學版);2004年S1期
5 鄭麗桑;蘭樟仁;盧毅敏;;福建省林業(yè)信息服務(wù)平臺的研究[J];集美大學學報(自然科學版);2006年02期
6 錢功偉;倪林;曹榮;;基于網(wǎng)頁鏈接和內(nèi)容分析的改進PageRank算法[J];計算機工程與應(yīng)用;2007年21期
7 歐陽柳波,李學勇,李國徽,王鑫;專業(yè)搜索引擎搜索策略綜述[J];計算機工程;2004年13期
8 吳明禮,施水才;一種結(jié)合超鏈接分析的搜索引擎排序方法[J];計算機工程;2004年15期
9 李勇;韓亮;;主題搜索引擎中網(wǎng)絡(luò)爬蟲的搜索策略研究[J];計算機工程與科學;2008年03期
10 牛振國,符海芳,崔偉宏;面向多層用戶的農(nóng)業(yè)信息分類初步研究[J];計算機與農(nóng)業(yè).綜合版;2003年03期
相關(guān)碩士學位論文 前7條
1 陳杰;主題搜索引擎中網(wǎng)絡(luò)蜘蛛搜索策略研究[D];浙江大學;2006年
2 鄭火國;農(nóng)業(yè)信息服務(wù)平臺的構(gòu)建與實現(xiàn)[D];中國農(nóng)業(yè)科學院;2006年
3 劉瑋瑋;搜索引擎中主題爬蟲的研究與實現(xiàn)[D];南京理工大學;2006年
4 鄭健珍;定題爬蟲搜索策略研究[D];廈門大學;2007年
5 陳叢叢;主題爬蟲搜索策略研究[D];山東大學;2009年
6 王冬坡;基于Lucene的主題搜索引擎的研究與實現(xiàn)[D];河北科技大學;2010年
7 馮明麗;面向個性化主題搜索的用戶—查詢詞語義本體構(gòu)建[D];西華大學;2010年
,本文編號:2189131
本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2189131.html