基于爬蟲的小企業(yè)搜索系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)
[Abstract]:With the continuous development of the Internet, the Internet has gradually become a major means for people to obtain information, and at present, the information content of some web portals is increasing at an alarming rate. In the face of so much information, how to obtain the latest and most effective information more comprehensively and accurately has become the most important condition for us to seize the opportunity to meet the challenge. The traditional WEB search engine has several shortcomings such as slow update, low accuracy and so on. In order to improve the ability of information retrieval for some websites, In this paper, a search engine system based on crawler technology is designed. Firstly, this paper briefly introduces and analyzes the research background of search engine system based on crawler technology, and the existing search technology at home and abroad. Then, the requirement analysis is carried out, and the characteristics of real-time and high accuracy are put forward. The overall design framework, module partition and module related introduction of the system are put forward. In this system, Maven is used for project management, Velocity template technology is used to realize network robot, based on Compass and Chinese word segmentation technology, Service pattern is used to design search framework, J2EE technology such as Webwork, Spring is adopted, and MVC mode is adopted. Command mode and various RPC technologies to achieve a variety of search interfaces. The system can provide a general vertical search service for enterprises, with the characteristics of real-time, versatility, and can be easily integrated with enterprise applications through a variety of search interfaces.
【學(xué)位授予單位】:大連理工大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類號(hào)】:TP391.3
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 楊樹林;胡潔萍;;JSON數(shù)據(jù)交換格式及其在數(shù)據(jù)驗(yàn)證中的應(yīng)用[J];北京印刷學(xué)院學(xué)報(bào);2008年04期
2 張盼;聶剛;;基于Lucene的全文檢索系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[J];電腦知識(shí)與技術(shù);2010年01期
3 徐立新;雷相波;;應(yīng)用Maven管理項(xiàng)目[J];電腦知識(shí)與技術(shù);2010年10期
4 徐宏發(fā);王衛(wèi)平;;基于SOA的數(shù)字圖書館互操作開放框架[J];計(jì)算機(jī)工程與應(yīng)用;2006年34期
5 陸榮幸,郁洲,阮永良,王志強(qiáng);J2EE平臺(tái)上MVC設(shè)計(jì)模式的研究與實(shí)現(xiàn)[J];計(jì)算機(jī)應(yīng)用研究;2003年03期
6 俞華鋒;;Memcached在大型網(wǎng)站中的應(yīng)用[J];科技信息(科學(xué)教研);2008年01期
7 李蕾,王楠,鐘義信,郭祥昊,韓鵬,賈自燕,高清霞;基于語(yǔ)義網(wǎng)絡(luò)的概念檢索研究與實(shí)現(xiàn)[J];情報(bào)學(xué)報(bào);2000年05期
8 姜強(qiáng);;SOA的規(guī)劃與設(shè)計(jì)[J];軟件導(dǎo)刊;2010年11期
9 趙國(guó)棟;;SOA觀點(diǎn) SOA,,重在實(shí)踐[J];信息系統(tǒng)工程;2006年08期
10 劉純波,李琦,承繼成;基于XML-RPC的分布式地理信息系統(tǒng)計(jì)算模型[J];中國(guó)圖象圖形學(xué)報(bào);2003年06期
本文編號(hào):2406239
本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2406239.html