天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁(yè) > 科技論文 > 搜索引擎論文 >

基于爬蟲(chóng)的小企業(yè)搜索系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)

發(fā)布時(shí)間:2019-01-10 11:14
【摘要】:隨著互聯(lián)網(wǎng)的不斷發(fā)展,網(wǎng)絡(luò)已逐步成為眾人獲取信息的一種主要手段,而且目前一些門(mén)戶網(wǎng)站的信息內(nèi)容正以驚人的速度增加著。面對(duì)如此大量的信息,如何更全面、更準(zhǔn)確地獲取最新、最有效的信息已經(jīng)成為我們把握機(jī)遇迎接挑戰(zhàn)的首要條件。針對(duì)一些門(mén)戶網(wǎng)站,目前傳統(tǒng)的WEB搜索引擎具有更新慢,準(zhǔn)確度低等幾個(gè)缺點(diǎn),為了提高針對(duì)一些網(wǎng)站的信息檢索能力,本課題研究設(shè)計(jì)了一套基于爬蟲(chóng)技術(shù)的站內(nèi)搜索引擎系統(tǒng)。 本論文首先簡(jiǎn)要介紹和分析基于爬蟲(chóng)技術(shù)的搜索引擎系統(tǒng)的研究背景、國(guó)內(nèi)外現(xiàn)有的搜索技術(shù)。接著,進(jìn)行需求分析,提出系統(tǒng)應(yīng)具備實(shí)時(shí)性和高準(zhǔn)確性等特點(diǎn),對(duì)此提出系統(tǒng)的總體設(shè)計(jì)框架、模塊劃分,以及模塊相關(guān)介紹。 本系統(tǒng)采用Maven進(jìn)行項(xiàng)目管理,選用Velocity模板技術(shù)實(shí)現(xiàn)網(wǎng)絡(luò)機(jī)器人,基于Compass和中文分詞技術(shù),采用Service模式設(shè)計(jì)搜索框架以及采用Webwork、 Spring等J2EE技術(shù),采用MVC模式、Command模式等以及多種RPC技術(shù)等實(shí)現(xiàn)多種搜索接口。該系統(tǒng)可為企業(yè)提供通用的垂直搜索服務(wù),具備實(shí)時(shí)性、通用性等特點(diǎn),并可通過(guò)多種搜索接口可以很方便的與企業(yè)應(yīng)用集成。
[Abstract]:With the continuous development of the Internet, the Internet has gradually become a major means for people to obtain information, and at present, the information content of some web portals is increasing at an alarming rate. In the face of so much information, how to obtain the latest and most effective information more comprehensively and accurately has become the most important condition for us to seize the opportunity to meet the challenge. The traditional WEB search engine has several shortcomings such as slow update, low accuracy and so on. In order to improve the ability of information retrieval for some websites, In this paper, a search engine system based on crawler technology is designed. Firstly, this paper briefly introduces and analyzes the research background of search engine system based on crawler technology, and the existing search technology at home and abroad. Then, the requirement analysis is carried out, and the characteristics of real-time and high accuracy are put forward. The overall design framework, module partition and module related introduction of the system are put forward. In this system, Maven is used for project management, Velocity template technology is used to realize network robot, based on Compass and Chinese word segmentation technology, Service pattern is used to design search framework, J2EE technology such as Webwork, Spring is adopted, and MVC mode is adopted. Command mode and various RPC technologies to achieve a variety of search interfaces. The system can provide a general vertical search service for enterprises, with the characteristics of real-time, versatility, and can be easily integrated with enterprise applications through a variety of search interfaces.
【學(xué)位授予單位】:大連理工大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類(lèi)號(hào)】:TP391.3

【參考文獻(xiàn)】

相關(guān)期刊論文 前10條

1 楊樹(shù)林;胡潔萍;;JSON數(shù)據(jù)交換格式及其在數(shù)據(jù)驗(yàn)證中的應(yīng)用[J];北京印刷學(xué)院學(xué)報(bào);2008年04期

2 張盼;聶剛;;基于Lucene的全文檢索系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[J];電腦知識(shí)與技術(shù);2010年01期

3 徐立新;雷相波;;應(yīng)用Maven管理項(xiàng)目[J];電腦知識(shí)與技術(shù);2010年10期

4 徐宏發(fā);王衛(wèi)平;;基于SOA的數(shù)字圖書(shū)館互操作開(kāi)放框架[J];計(jì)算機(jī)工程與應(yīng)用;2006年34期

5 陸榮幸,郁洲,阮永良,王志強(qiáng);J2EE平臺(tái)上MVC設(shè)計(jì)模式的研究與實(shí)現(xiàn)[J];計(jì)算機(jī)應(yīng)用研究;2003年03期

6 俞華鋒;;Memcached在大型網(wǎng)站中的應(yīng)用[J];科技信息(科學(xué)教研);2008年01期

7 李蕾,王楠,鐘義信,郭祥昊,韓鵬,賈自燕,高清霞;基于語(yǔ)義網(wǎng)絡(luò)的概念檢索研究與實(shí)現(xiàn)[J];情報(bào)學(xué)報(bào);2000年05期

8 姜強(qiáng);;SOA的規(guī)劃與設(shè)計(jì)[J];軟件導(dǎo)刊;2010年11期

9 趙國(guó)棟;;SOA觀點(diǎn) SOA,,重在實(shí)踐[J];信息系統(tǒng)工程;2006年08期

10 劉純波,李琦,承繼成;基于XML-RPC的分布式地理信息系統(tǒng)計(jì)算模型[J];中國(guó)圖象圖形學(xué)報(bào);2003年06期



本文編號(hào):2406239

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2406239.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶1381f***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com