林業(yè)主題搜索引擎研究
[Abstract]:Search engine is the primary tool for people to obtain massive network information, and it is the key content of network research and application. At present, with the explosive growth of Internet information and the development of information diversification, topic search engine is becoming a research hotspot and development trend. In this paper, we do some research on Chinese Web subject information acquisition and retrieval technology, and design and implement a forestry topic search engine FIS (Forestry Information Search). With theme information collection FRobot as the core. This paper first introduces the development, current situation, classification and working principle of comprehensive search engine, and points out its deficiency and development direction. Then the background and working mode of thematic search engine are summarized, and the key technologies of search engine, such as information retrieval model, topic information collection strategy, Fish algorithm, weighted index and retrieval technology, are discussed in detail. On this basis, the mature vector space model (Vector Space Model,VSM (Vector Space Model) and the improved Fish algorithm are adopted, and various technologies such as html document analysis, home page association, content prediction, database full-text index and so on are combined. In this paper, an ideal design scheme of topic search engine is given and a forestry topic search engine system, FIS., is implemented. The system is oriented to the forestry field, ensures the complete collection and timely update of forestry information, avoids the strong search noise, improves the retrieval efficiency, and can provide forestry special information query quickly, completely and accurately. Finally, this paper summarizes the research and development experience of forestry subject search engine system, and points out the application prospect of the system and the future research direction.
【學(xué)位授予單位】:北京林業(yè)大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2005
【分類(lèi)號(hào)】:TP393.09;S712
【引證文獻(xiàn)】
相關(guān)期刊論文 前2條
1 張黎爍;李鑫;徐猛;;基于PageRank的網(wǎng)頁(yè)主題相關(guān)性算法研究[J];光盤(pán)技術(shù);2008年12期
2 王承君;;Web搜索引擎的關(guān)鍵技術(shù)分析[J];濰坊學(xué)院學(xué)報(bào);2009年04期
相關(guān)博士學(xué)位論文 前1條
1 李群;主題搜索引擎聚類(lèi)算法的研究[D];北京林業(yè)大學(xué);2011年
相關(guān)碩士學(xué)位論文 前8條
1 岳廣飛;基于二次搜索的搜索引擎技術(shù)研究[D];山東科技大學(xué);2010年
2 胡曉博;面向特定領(lǐng)域的專(zhuān)業(yè)搜索引擎的架構(gòu)與實(shí)現(xiàn)方法[D];哈爾濱工程大學(xué);2007年
3 段雪英;基于.NET的氣象主題搜索引擎系統(tǒng)的研究與實(shí)現(xiàn)[D];南京信息工程大學(xué);2007年
4 客斌;經(jīng)營(yíng)分析系統(tǒng)信息檢索平臺(tái)[D];北京交通大學(xué);2010年
5 高川;Deep Web數(shù)據(jù)源的發(fā)現(xiàn)與聚類(lèi)研究[D];北京化工大學(xué);2010年
6 戴支榮;基于Lucene的面向主題信息搜索系統(tǒng)的關(guān)鍵技術(shù)分析及應(yīng)用[D];武漢理工大學(xué);2011年
7 郭艷芬;林業(yè)主題搜索引擎的設(shè)計(jì)與實(shí)現(xiàn)[D];北京林業(yè)大學(xué);2011年
8 張蓬飛;Deep Web數(shù)據(jù)源聚類(lèi)與查詢(xún)轉(zhuǎn)換的研究[D];北京化工大學(xué);2011年
,本文編號(hào):2460830
本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2460830.html