天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁 > 科技論文 > 搜索引擎論文 >

基于元搜索引擎的垂直搜索子系統(tǒng)的設(shè)計(jì)

發(fā)布時(shí)間:2018-04-24 04:07

  本文選題:隱形關(guān)鍵詞 + 垂直搜索。 參考:《天津大學(xué)》2012年碩士論文


【摘要】:垂直搜索引擎是搜索引擎發(fā)展的新階段。對于搜索引擎的未來發(fā)展和具體研究而言,這是一個(gè)必然的趨勢。當(dāng)前垂直搜索引擎的系統(tǒng)結(jié)構(gòu)和傳統(tǒng)上的全文搜索引擎非常相似,能夠較高水平處理專業(yè)相關(guān)度,不過作為垂直搜索引擎,在某些問題方面和傳統(tǒng)上的全文搜索引擎相同,比如較低水平的查全率、消耗的網(wǎng)絡(luò)資源太多等等。對于存在的這種問題,本文的解決方案為:垂直搜索引擎之具體的系統(tǒng)結(jié)構(gòu)要建立在元搜索之上。借助于這種技術(shù),能夠很好的提升查全率,不過對應(yīng)的專業(yè)相關(guān)度也呈現(xiàn)較為明顯的下降趨勢。實(shí)驗(yàn)結(jié)果告訴我們,這種新的系統(tǒng)功能比較強(qiáng)大,對于垂直搜索引擎期望自己可以達(dá)到的相關(guān)功能,新系統(tǒng)都可以實(shí)現(xiàn)。本研究的主要內(nèi)容包括了如下幾個(gè)方面: 1.目前的垂直搜索引擎查全率較低,由于元搜索引擎有較高的查全率,我們設(shè)計(jì)了一種垂直搜索引擎,采用的方法收集的信息,這是根據(jù)meta-search引擎。該系統(tǒng)增加了信息的收集和分析適應(yīng)需求的垂直搜索引擎。 2.對于搜索引擎來講,其最基本的基礎(chǔ)功能就是進(jìn)行信息收集。當(dāng)前的垂直搜索引擎在這個(gè)功能角度上存在的主要問題為:較低水平的網(wǎng)絡(luò)信息覆蓋率,,收集到的多為無效的信息等等。據(jù)此,本研究提出的解決方法為建立在對用戶的具體瀏覽時(shí)間進(jìn)行統(tǒng)計(jì)的基礎(chǔ)之上進(jìn)行信息收集,這種信息收集方法也是建立在元搜索引擎技術(shù)之上的,借助于這種信息收集方法,也能夠收集到用戶給予了較高關(guān)注度的信息。這種技術(shù),不但能夠提升了信息覆蓋率,而且對于被收集的相關(guān)信息,也能夠提升專業(yè)相關(guān)度。 3.對于搜索引擎來講,其核心在于信息檢索。在分析收集到的信息的時(shí)候,通過將數(shù)據(jù)挖掘引入其中,本文獲得了關(guān)鍵詞和較高滿意度查詢結(jié)果之間的具體規(guī)則。借助于此,本文提出了一個(gè)新的概念,即隱形關(guān)鍵詞。經(jīng)過實(shí)驗(yàn),可以得知隱形關(guān)鍵詞的使用,一方面很好的提升了在專業(yè)方面,系統(tǒng)查詢結(jié)果的相關(guān)度。 4.對于搜索結(jié)果,用戶關(guān)心最多的是之前的結(jié)果,故而作為搜索引擎,一個(gè)必須關(guān)注、也是必須重視的問題就是對結(jié)果如何進(jìn)行排序。當(dāng)前,元搜索引擎在進(jìn)行結(jié)果排序的時(shí)候,使用的相關(guān)信息非常少,也不能對結(jié)果相關(guān)度給與很好的保證;诖,本文也進(jìn)行了改進(jìn),本文提出的結(jié)果排序方法與系統(tǒng)相契合,同時(shí)因?yàn)樗阉髦袑㈦[性關(guān)鍵詞引入,所以對于位置排序算法也進(jìn)行了很好的改進(jìn),同時(shí)專業(yè)相關(guān)度的搜索結(jié)果也更為準(zhǔn)確。 總體來講,本文的問題解決方案為:建立在元搜索技術(shù)之上的垂直搜索引擎,在某種程度上優(yōu)化了垂直搜索引擎,筆者在本文運(yùn)用了一種新思路和新方法進(jìn)行探討。
[Abstract]:Vertical search engine is a new stage of search engine development. For the future development of search engines and specific research, this is an inevitable trend. The current system structure of vertical search engine is very similar to that of traditional full-text search engine, and it can deal with professional relevance at a high level, but as a vertical search engine, it is the same as traditional full-text search engine in some aspects. For example, the low level of recall, consuming too much network resources and so on. For this problem, the solution of this paper is: the specific system structure of vertical search engine should be based on meta search. With the help of this technique, the recall rate can be improved very well, but the relative professional correlation also shows an obvious downward trend. The experimental results show that the new system is quite powerful and can be implemented for the related functions that the vertical search engine expects it to achieve. The main contents of this study include the following: 1. At present, the vertical search engine has a low recall rate. Because the meta search engine has a high recall rate, we design a vertical search engine, which uses the method to collect information, which is based on the meta-search engine. The system adds information collection and analysis to meet the needs of the vertical search engine. 2. For search engines, its basic function is to collect information. The main problems of the current vertical search engine in this functional angle are: low level of network information coverage, collected mostly invalid information and so on. Therefore, the solution proposed in this study is to collect information on the basis of statistics on the specific browsing time of users, and this information collection method is also based on meta-search engine technology. With the help of this information collection method, users can also collect information with a high degree of attention. This technology can not only improve the information coverage, but also enhance the relevance of the collected information. 3. For search engine, its core is information retrieval. In the analysis of the collected information, by introducing data mining into it, this paper obtains the specific rules between keywords and higher satisfaction query results. With the help of this, this paper proposes a new concept, namely the stealth keyword. Through experiments, we can know the use of hidden keywords, on the one hand, improve the relevance of the system query results in the professional. 4. For search results, users are most concerned about the results before, so as a search engine, a must pay attention to, and must pay attention to is how to sort the results. At present, the meta-search engine uses very little information when sorting the results, and it can not guarantee the relevance of the results. Based on this, this paper also improved, the result sort method proposed in this paper coincides with the system, at the same time, because the hidden key words are introduced in the search, the location sorting algorithm is also improved very well. At the same time professional relevance of the search results are more accurate. In general, the solution of this paper is: the vertical search engine based on meta-search technology, to some extent, optimize the vertical search engine, the author uses a new way of thinking and new method to discuss in this paper.
【學(xué)位授予單位】:天津大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2012
【分類號】:TP391.3

【參考文獻(xiàn)】

相關(guān)期刊論文 前1條

1 楊成明;情報(bào)檢索中的雙層B+樹算法探討[J];情報(bào)學(xué)報(bào);1997年S1期



本文編號:1795055

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/1795055.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶ac342***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請E-mail郵箱bigeng88@qq.com