天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁 > 科技論文 > 搜索引擎論文 >

面向主題的元搜索引擎技術(shù)研究與系統(tǒng)實(shí)現(xiàn)

發(fā)布時(shí)間:2019-06-03 15:59
【摘要】:隨著Internet上信息的爆炸式增長,使得用戶面對(duì)Internet上浩如煙海的信息世界,往往無法快速準(zhǔn)確地找到自己想要的信息,傳統(tǒng)搜索引擎的出現(xiàn)在一定程度上解決了互聯(lián)網(wǎng)信息檢索的問題,但是當(dāng)前主要的搜索引擎的查準(zhǔn)率不高,并且不同的搜索引擎所采用的算法和搜索范圍不同,導(dǎo)致搜索結(jié)果有很大差異,經(jīng)中國搜索引擎用戶行為研究報(bào)告和市場(chǎng)調(diào)查報(bào)告顯示,谷歌、百度、雅虎等幾大搜索引擎的搜索結(jié)果重復(fù)率不到34%,如果想獲得比較全面而又準(zhǔn)確的結(jié)果,就不得不在各個(gè)搜索引擎之間相互轉(zhuǎn)換,反復(fù)調(diào)用多個(gè)搜索引擎,這給我們加快的生活節(jié)奏帶來了不便。元搜索引擎應(yīng)運(yùn)而生,它的搜索結(jié)果相對(duì)傳統(tǒng)的搜索引擎較全面,使得元搜索引擎得到快速的發(fā)展。 元搜索引擎是一種集成多個(gè)成員搜索引擎的網(wǎng)絡(luò)檢索工具,這使得搜索結(jié)果的覆蓋范圍較廣,查全率較高。但是元搜索引擎同傳統(tǒng)的搜索引擎一樣沒有考慮用戶的個(gè)性化需求。個(gè)性化元搜索引擎結(jié)合元搜索引擎的查全率和個(gè)性化技術(shù)的查準(zhǔn)率的優(yōu)點(diǎn),有效地改善了現(xiàn)有的搜索引擎的不足之處。 本文首先分析了傳統(tǒng)搜索引擎的缺點(diǎn)和不足,對(duì)元搜索引擎和個(gè)性化技術(shù)進(jìn)行了概述,通過對(duì)相關(guān)理論和技術(shù)的綜述,總結(jié)了元搜索引擎技術(shù)和個(gè)性化技術(shù)的研究現(xiàn)狀和發(fā)展趨勢(shì)。然后深入研究了實(shí)現(xiàn)個(gè)性化搜索引擎的相關(guān)理論和技術(shù),并進(jìn)行了對(duì)比和分析。在上述理論和技術(shù)分析的基礎(chǔ)上提出了個(gè)性化元搜索引擎的相關(guān)算法: 針對(duì)在HTML源代碼中存在一定規(guī)律的主題的搜索,本文在元搜索引擎的基礎(chǔ)上設(shè)計(jì)了針對(duì)電話號(hào)碼查詢的個(gè)性化元搜索引擎算法,它利用了元搜索引擎查全率高的特點(diǎn),實(shí)現(xiàn)了提取電話號(hào)碼和擁有電話號(hào)碼的用戶信息的功能,,而其它的無關(guān)信息全部被剔除。 針對(duì)無規(guī)律可循的主題搜索,本文提出了一種新型的主題搜索框架,這種框架基于主題詞典進(jìn)行主題詞精簡(jiǎn),得到有序的主題詞匯表;另外在網(wǎng)頁的相關(guān)度計(jì)算方法中考慮了網(wǎng)頁的結(jié)構(gòu)特征。 針對(duì)不同用戶主機(jī)上的瀏覽歷史記錄,本文采用tasklist. exe對(duì)用戶的搜索歷史進(jìn)行追蹤,本方法同時(shí)對(duì)不同的瀏覽器進(jìn)行追蹤,對(duì)其追蹤的結(jié)果進(jìn)行一定的處理之后,通過人機(jī)交互的界面實(shí)現(xiàn)針對(duì)用戶瀏覽歷史記錄查詢的功能。 最后設(shè)計(jì)并實(shí)現(xiàn)了針對(duì)上述要求的原型系統(tǒng)。
[Abstract]:With the explosive growth of information on the Internet, users are often unable to find the information they want quickly and accurately in the face of the vast information world on the Internet. The emergence of traditional search engines has solved the problem of Internet information retrieval to a certain extent, but the precision of the main search engines is not high, and the algorithms and search ranges adopted by different search engines are different. As a result, search results are very different. according to the Chinese search engine user behavior study and market research report, the repetition rate of search results of several major search engines, such as Google, Baidu, Yahoo and so on, is less than 34%. If we want to obtain more comprehensive and accurate results, we have to convert each other between search engines and repeatedly call multiple search engines, which brings inconvenience to our accelerated pace of life. Meta search engine emerges as the times require, and its search results are more comprehensive than the traditional search engine, which makes the meta search engine develop rapidly. Meta search engine is a kind of network retrieval tool which integrates multiple member search engines, which makes the coverage of search results wide and the recall rate higher. However, meta-search engines, like traditional search engines, do not take into account the personalized needs of users. Personalized meta-search engine combines the advantages of recall rate of meta-search engine and precision rate of personalized technology, and effectively improves the shortcomings of the existing search engine. This paper first analyzes the shortcomings and shortcomings of the traditional search engine, summarizes the meta-search engine and personalized technology, and summarizes the related theories and technologies. The research status and development trend of meta-search engine technology and personalized technology are summarized. Then the related theory and technology of personalized search engine are deeply studied, and the comparison and analysis are carried out. On the basis of the above theoretical and technical analysis, this paper puts forward the related algorithms of personalized meta-search engine: for the search of topics with certain rules in HTML source code, In this paper, a personalized meta-search engine algorithm for telephone number query is designed on the basis of meta-search engine. It makes use of the high recall rate of meta-search engine and realizes the function of extracting telephone number and user information with telephone number. All other unrelated information was eliminated. Aiming at the irregular topic search, this paper proposes a new topic search framework, which is based on the topic dictionary to simplify the subject words and obtain an orderly topic vocabulary. In addition, the structural characteristics of the web page are considered in the correlation calculation method of the web page. According to the browsing history on different user hosts, tasklist. is used in this paper. Exe tracks the search history of users. At the same time, this method tracks different browsers. After processing the tracking results to a certain extent, the function of browsing history query for users is realized through the interface of human-computer interaction. Finally, a prototype system is designed and implemented to meet the above requirements.
【學(xué)位授予單位】:天津理工大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類號(hào)】:TP391.3

【參考文獻(xiàn)】

相關(guān)期刊論文 前8條

1 張志強(qiáng),邢春曉,周立柱,孫靜,錢乾;SESQ系統(tǒng)的一種查詢優(yōu)化策略[J];計(jì)算機(jī)研究與發(fā)展;2004年10期

2 單松巍,馮是聰,李曉明;幾種典型特征選取方法在中文網(wǎng)頁分類上的效果比較[J];計(jì)算機(jī)工程與應(yīng)用;2003年22期

3 龐劍鋒,卜東波,白碩;基于向量空間模型的文本自動(dòng)分類系統(tǒng)的研究與實(shí)現(xiàn)[J];計(jì)算機(jī)應(yīng)用研究;2001年09期

4 徐瑩;;搜索引擎技術(shù)及其發(fā)展前瞻[J];科技情報(bào)開發(fā)與經(jīng)濟(jì);2005年24期

5 王自強(qiáng),馮博琴;Web信息查詢優(yōu)化的遺傳算法[J];控制與決策;2005年02期

6 李廣建,黃];元搜索引擎及其主要技術(shù)[J];情報(bào)科學(xué);2002年02期

7 原福永;梁順攀;;元搜索引擎的現(xiàn)狀與發(fā)展[J];計(jì)算機(jī)工程與設(shè)計(jì);2005年12期

8 王美霞;李玉坤;肖迎元;;一種新型垂直搜索引擎構(gòu)建方法[J];天津理工大學(xué)學(xué)報(bào);2012年Z1期

相關(guān)碩士學(xué)位論文 前4條

1 王春艷;元搜索引擎的研究與實(shí)現(xiàn)[D];吉林大學(xué);2011年

2 李盛韜;基于主題的Web信息采集技術(shù)研究[D];中國科學(xué)院研究生院(計(jì)算技術(shù)研究所);2002年

3 張園園;基于用戶興趣的個(gè)性化搜索引擎的分析與研究[D];燕山大學(xué);2006年

4 胡升澤;個(gè)性化元搜索引擎若干關(guān)鍵技術(shù)研究[D];國防科學(xué)技術(shù)大學(xué);2008年



本文編號(hào):2492050

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2492050.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶2738a***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com