基于P2P全文檢索系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)
本文選題:C/S + P2P; 參考:《吉林大學(xué)》2013年碩士論文
【摘要】:P2P技術(shù)作為一種新興的網(wǎng)絡(luò)模型,占據(jù)了互聯(lián)網(wǎng)業(yè)務(wù)總量的百分之六十以上,被人們稱為寬帶互聯(lián)網(wǎng)應(yīng)用的“殺手級(jí)”技術(shù)。P2P技術(shù)與傳統(tǒng)的客戶端/服務(wù)器模型相對(duì)比,在消除服務(wù)器瓶頸以及網(wǎng)絡(luò)資源利用率等方面優(yōu)勢(shì)較為明顯。 JXTA是Sun微系統(tǒng)對(duì)等網(wǎng)絡(luò)(P2P)的標(biāo)準(zhǔn),供P2P程序所需的基礎(chǔ)服務(wù)。該技術(shù)致力于創(chuàng)建一個(gè)通用的平臺(tái),以簡(jiǎn)單而有效的方式構(gòu)建特定的對(duì)等式和分布式服務(wù)與應(yīng)用。使得開發(fā)者不需要過多考慮如何解決對(duì)等計(jì)算的技術(shù)問題,而可以專注于如何實(shí)現(xiàn)與完善可擴(kuò)展、互操作性強(qiáng)且具有高可用性的高層應(yīng)用。 Lucene是一個(gè)開源的項(xiàng)目,提供了相對(duì)完整的文本檢索的功能,Apache開發(fā)此項(xiàng)目的目的就在于為程序開發(fā)者設(shè)計(jì)一個(gè)即容易理解,又功能完備的檢索工具。以此為基礎(chǔ),開發(fā)人員可以快速實(shí)現(xiàn)全文檢索的功能。 本文首先介紹了搜索引擎的發(fā)展趨勢(shì)、關(guān)鍵技術(shù),在此基礎(chǔ)上對(duì)傳統(tǒng)搜索引擎面臨的挑戰(zhàn)以及基于P2P的搜索引擎的優(yōu)勢(shì)進(jìn)行了深入的分析。接著本文對(duì)P2P搜索引擎的關(guān)鍵技術(shù)——JXTA技術(shù),,Lucene檢索工具進(jìn)行了研究,包括JXTA的基本概念及協(xié)議規(guī)范,然后對(duì)Lucene技術(shù)做了研究,包括其特點(diǎn),以及一些實(shí)用類進(jìn)行詳盡的研究介紹。本文接著介紹了請(qǐng)求路由算法的基本思想,研究論述了基于k-高頻詞主題相關(guān)搜索路由算法。本文建立了基于P2P的全文檢索引擎系統(tǒng)原型,描述系統(tǒng)工作流程,詳細(xì)介紹各模塊的具體設(shè)計(jì)。本文在對(duì)相關(guān)只是進(jìn)行詳盡研究的基礎(chǔ)上編程實(shí)現(xiàn)基于P2P的文獻(xiàn)檢索系統(tǒng)各模塊功能,并對(duì)系統(tǒng)功能進(jìn)行了測(cè)試。最后本文對(duì)本課題的研究進(jìn)行了總結(jié),并對(duì)未來的研究做了簡(jiǎn)單的展望。
[Abstract]:As a new network model, P2P technology accounts for more than 60% of the total amount of Internet services. It is called "killer level" technology of broadband Internet application. P2P technology is compared with the traditional client / server model. In the elimination of server bottlenecks and network resource utilization and other aspects of the advantages are obvious. JXTA is the standard of Sun Peer-to-Peer Network (P2P), which provides the basic services for P2P programs. The technology is dedicated to creating a common platform for building specific peer-to-peer and distributed services and applications in a simple and efficient manner. So that developers do not need to think too much about how to solve the technical problems of peer-to-peer computing, but can focus on how to implement and improve the extensible, interoperability and high availability of high-level applications. Lucene is an open source project that provides a relatively complete function of text retrieval. On this basis, developers can quickly achieve full-text retrieval function. This paper first introduces the development trend and key technologies of search engines, and then analyzes the challenges faced by traditional search engines and the advantages of P2P based search engines. Then, this paper studies the key technology of P2P search engine, JXTA technology and Lucene retrieval tool, including the basic concept and protocol specification of JXTA, and then makes a research on Lucene technology, including its characteristics. As well as some practical classes to carry on the detailed research introduction. Then this paper introduces the basic idea of request routing algorithm and discusses the search routing algorithm based on k- high frequency word topic correlation. In this paper, the prototype of P2P based full-text search engine is established, the workflow of the system is described, and the specific design of each module is introduced in detail. Based on the detailed study of the correlation, this paper implements the function of each module of the document retrieval system based on P2P, and tests the function of the system. Finally, this paper summarizes the research of this topic, and makes a simple prospect for the future research.
【學(xué)位授予單位】:吉林大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類號(hào)】:TP391.3
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 程立考;李紹靜;;對(duì)等網(wǎng)絡(luò)的研究與應(yīng)用[J];電腦與信息技術(shù);2006年04期
2 羅峰;;基于網(wǎng)絡(luò)編碼的P2P網(wǎng)絡(luò)系統(tǒng)研究[J];電視技術(shù);2007年02期
3 石友康;;P2P技術(shù)業(yè)務(wù)模式與安全問題探討[J];電信網(wǎng)技術(shù);2007年03期
4 曾楚軒;;P2P應(yīng)用技術(shù)發(fā)展淺析[J];電信網(wǎng)技術(shù);2007年03期
5 孫力;陳蘭;袁媛;;基于節(jié)點(diǎn)興趣的非結(jié)構(gòu)化P2P搜索機(jī)制[J];計(jì)算機(jī)工程;2009年23期
6 戴明堅(jiān);張大方;;書面漢語自動(dòng)分詞技術(shù)與實(shí)現(xiàn)[J];計(jì)算技術(shù)與自動(dòng)化;1990年03期
7 盛明超;張代遠(yuǎn);;純P2P在私網(wǎng)中的應(yīng)用[J];計(jì)算機(jī)時(shí)代;2008年05期
8 莊雷;常玉存;董西廣;;一種P2P文件共享系統(tǒng)中的激勵(lì)機(jī)制[J];計(jì)算機(jī)應(yīng)用研究;2009年01期
9 葉劍虹;孫世新;張運(yùn)生;周益民;;基于P2P的自組織網(wǎng)絡(luò)路由算法研究[J];計(jì)算機(jī)應(yīng)用研究;2009年01期
10 黃昌寧,張小鳳;自然語言處理技術(shù)的三個(gè)里程碑[J];外語教學(xué)與研究;2002年03期
本文編號(hào):1944503
本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/1944503.html