面向主題的元搜索引擎技術(shù)研究與系統(tǒng)實(shí)現(xiàn)
[Abstract]:With the explosive growth of information on the Internet, users are often unable to find the information they want quickly and accurately in the face of the vast information world on the Internet. The emergence of traditional search engines has solved the problem of Internet information retrieval to a certain extent, but the precision of the main search engines is not high, and the algorithms and search ranges adopted by different search engines are different. As a result, search results are very different. according to the Chinese search engine user behavior study and market research report, the repetition rate of search results of several major search engines, such as Google, Baidu, Yahoo and so on, is less than 34%. If we want to obtain more comprehensive and accurate results, we have to convert each other between search engines and repeatedly call multiple search engines, which brings inconvenience to our accelerated pace of life. Meta search engine emerges as the times require, and its search results are more comprehensive than the traditional search engine, which makes the meta search engine develop rapidly. Meta search engine is a kind of network retrieval tool which integrates multiple member search engines, which makes the coverage of search results wide and the recall rate higher. However, meta-search engines, like traditional search engines, do not take into account the personalized needs of users. Personalized meta-search engine combines the advantages of recall rate of meta-search engine and precision rate of personalized technology, and effectively improves the shortcomings of the existing search engine. This paper first analyzes the shortcomings and shortcomings of the traditional search engine, summarizes the meta-search engine and personalized technology, and summarizes the related theories and technologies. The research status and development trend of meta-search engine technology and personalized technology are summarized. Then the related theory and technology of personalized search engine are deeply studied, and the comparison and analysis are carried out. On the basis of the above theoretical and technical analysis, this paper puts forward the related algorithms of personalized meta-search engine: for the search of topics with certain rules in HTML source code, In this paper, a personalized meta-search engine algorithm for telephone number query is designed on the basis of meta-search engine. It makes use of the high recall rate of meta-search engine and realizes the function of extracting telephone number and user information with telephone number. All other unrelated information was eliminated. Aiming at the irregular topic search, this paper proposes a new topic search framework, which is based on the topic dictionary to simplify the subject words and obtain an orderly topic vocabulary. In addition, the structural characteristics of the web page are considered in the correlation calculation method of the web page. According to the browsing history on different user hosts, tasklist. is used in this paper. Exe tracks the search history of users. At the same time, this method tracks different browsers. After processing the tracking results to a certain extent, the function of browsing history query for users is realized through the interface of human-computer interaction. Finally, a prototype system is designed and implemented to meet the above requirements.
【學(xué)位授予單位】:天津理工大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類號(hào)】:TP391.3
【參考文獻(xiàn)】
相關(guān)期刊論文 前8條
1 張志強(qiáng),邢春曉,周立柱,孫靜,錢乾;SESQ系統(tǒng)的一種查詢優(yōu)化策略[J];計(jì)算機(jī)研究與發(fā)展;2004年10期
2 單松巍,馮是聰,李曉明;幾種典型特征選取方法在中文網(wǎng)頁分類上的效果比較[J];計(jì)算機(jī)工程與應(yīng)用;2003年22期
3 龐劍鋒,卜東波,白碩;基于向量空間模型的文本自動(dòng)分類系統(tǒng)的研究與實(shí)現(xiàn)[J];計(jì)算機(jī)應(yīng)用研究;2001年09期
4 徐瑩;;搜索引擎技術(shù)及其發(fā)展前瞻[J];科技情報(bào)開發(fā)與經(jīng)濟(jì);2005年24期
5 王自強(qiáng),馮博琴;Web信息查詢優(yōu)化的遺傳算法[J];控制與決策;2005年02期
6 李廣建,黃];元搜索引擎及其主要技術(shù)[J];情報(bào)科學(xué);2002年02期
7 原福永;梁順攀;;元搜索引擎的現(xiàn)狀與發(fā)展[J];計(jì)算機(jī)工程與設(shè)計(jì);2005年12期
8 王美霞;李玉坤;肖迎元;;一種新型垂直搜索引擎構(gòu)建方法[J];天津理工大學(xué)學(xué)報(bào);2012年Z1期
相關(guān)碩士學(xué)位論文 前4條
1 王春艷;元搜索引擎的研究與實(shí)現(xiàn)[D];吉林大學(xué);2011年
2 李盛韜;基于主題的Web信息采集技術(shù)研究[D];中國科學(xué)院研究生院(計(jì)算技術(shù)研究所);2002年
3 張園園;基于用戶興趣的個(gè)性化搜索引擎的分析與研究[D];燕山大學(xué);2006年
4 胡升澤;個(gè)性化元搜索引擎若干關(guān)鍵技術(shù)研究[D];國防科學(xué)技術(shù)大學(xué);2008年
本文編號(hào):2492050
本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2492050.html