天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁(yè) > 科技論文 > 搜索引擎論文 >

深度搜索內(nèi)網(wǎng)資源的研究與實(shí)現(xiàn)

發(fā)布時(shí)間:2018-05-19 08:06

  本文選題:搜索引擎 + 信息檢索。 參考:《電子科技大學(xué)》2013年碩士論文


【摘要】:隨著時(shí)間流逝,技術(shù)也在迅猛發(fā)展。無(wú)數(shù)計(jì)算機(jī)領(lǐng)域的創(chuàng)新,都給參與的開(kāi)發(fā)人員和用戶帶來(lái)了巨大的推動(dòng)作用。信息的獲取成為了人們從事計(jì)算機(jī)事業(yè)一個(gè)主要的研究方向。談到如何獲取信息,傳統(tǒng)的搜索引擎已經(jīng)為人熟知,互聯(lián)網(wǎng)上信息資源也更多的集中在web中。然而在內(nèi)網(wǎng)中,信息資源不單純以web的形式被保存,它們更多的保存在各種類型的文檔和數(shù)據(jù)庫(kù)中,因此用戶的需求變得更加多樣和具體。僅僅將傳統(tǒng)的搜索引擎應(yīng)用到復(fù)雜的內(nèi)網(wǎng)中是不夠的。內(nèi)網(wǎng)環(huán)境對(duì)安全性以及資源的全面性有著更高的要求。安全的以及全方位的搜索各種結(jié)構(gòu)化和非結(jié)構(gòu)化乃至半結(jié)構(gòu)化的資源成為了內(nèi)網(wǎng)資源的搜索的重點(diǎn)。 基于web的傳統(tǒng)資源搜索主要包括資源的爬行,索引的建立,檢索以及結(jié)果的排序。內(nèi)網(wǎng)資源搜索建立的步驟同其類似,但是同傳統(tǒng)的網(wǎng)頁(yè)搜索不同,內(nèi)網(wǎng)資源的搜索要在安全性和深度上同傳統(tǒng)搜索加以區(qū)分。傳統(tǒng)的搜索方式對(duì)于訪問(wèn)策略沒(méi)有加以規(guī)定。但是在特定的內(nèi)網(wǎng)中,并不是所有用戶搜索同一資源都會(huì)得到相同的結(jié)果。采用安全策略的搜索引擎需要根據(jù)用戶的身份對(duì)結(jié)果進(jìn)行掩飾,因此需要在搜索引擎中制定相應(yīng)的安全策略。在對(duì)資源的爬行過(guò)程中,當(dāng)訪問(wèn)到非web形式的資源時(shí)需要根據(jù)特定的接口將文件加以處理提取出文本并加以索引。這就是所謂的搜索上的深度要求。 本文的主要工作包括以下幾個(gè)方面:首先,對(duì)比傳統(tǒng)的搜索引擎以及其模塊的設(shè)計(jì),給我們提高良好的理論基礎(chǔ),并幫助我們更一步的了解適用于內(nèi)網(wǎng)資源搜索的軟件所具備的基本功能以及實(shí)施的難點(diǎn)。闡述內(nèi)網(wǎng)資源搜索引擎的各個(gè)主要模塊的工作原理以及實(shí)現(xiàn)方案,包括文檔的搜集,索引結(jié)構(gòu)的建立以及搜索結(jié)果的呈現(xiàn)。其次,對(duì)安全策略以及深度搜索進(jìn)行重點(diǎn)介紹,這兩大關(guān)鍵突出的兩大部分是系統(tǒng)設(shè)計(jì)的創(chuàng)新點(diǎn)所在。安全策略保證了信息的安全性,很好的適用于對(duì)權(quán)限要求較高的復(fù)雜的內(nèi)網(wǎng)。深度搜索保證了信息獲取的全面性并且給予了系統(tǒng)的良好的擴(kuò)展性。最后,,對(duì)實(shí)驗(yàn)結(jié)果進(jìn)行展示以及測(cè)試?偨Y(jié)內(nèi)網(wǎng)資源搜索的意義并提出系統(tǒng)不足以及未來(lái)改進(jìn)的思路。
[Abstract]:With the passage of time, technology is also developing rapidly. Numerous innovations in the computer field have brought a huge boost to the developers and users involved. The acquisition of information has become a major research direction for people engaged in computer business. When it comes to how to obtain information, the traditional search engine is already well known, and the information resources on the Internet are more concentrated in web. However, in the intranet, the information resources are not simply saved in the form of web, they are more stored in various types of documents and databases, so the needs of users become more diverse and specific. It is not enough to apply traditional search engines to complex intranets. The intranet environment has higher requirements for security and the comprehensiveness of resources. Secure and omnidirectional search for all kinds of structured, unstructured and even semi-structured resources has become the focus of the search of intranet resources. Traditional resource search based on web mainly includes crawling, index building, retrieval and result sorting. The procedure of intranet resource search is similar to that of traditional web search, but different from traditional web search, the search of intranet resource should be distinguished from traditional search in terms of security and depth. Traditional search methods do not specify access policy. But in a particular intranet, not all users search for the same resource and get the same results. The search engine adopting security policy needs to cover up the result according to the identity of the user, so it is necessary to formulate the corresponding security policy in the search engine. In the process of crawling resources, when accessing non-web resources, the files should be processed and indexed according to specific interfaces. This is called the search depth requirement. The main work of this paper includes the following aspects: first, compared with the traditional search engine and its module design, give us a good theoretical foundation, It also helps us to understand the basic functions and implementation difficulties of the software which is suitable for the search of intranet resources. This paper describes the working principle and implementation scheme of the main modules of the intranet resource search engine, including the collection of documents, the establishment of index structure and the presentation of search results. Secondly, the security policy and the depth search are introduced emphatically. The two key parts are the innovation of the system design. The security policy ensures the security of information and is well suited to complex intranets with high privilege requirements. The depth search ensures the comprehensiveness of information acquisition and gives the system good expansibility. Finally, the experimental results are displayed and tested. This paper summarizes the significance of intranet resource search and puts forward the ideas of system deficiency and future improvement.
【學(xué)位授予單位】:電子科技大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類號(hào)】:TP391.3

【參考文獻(xiàn)】

相關(guān)期刊論文 前10條

1 曲衛(wèi)華;王群;;搜索引擎原理介紹與分析[J];電腦知識(shí)與技術(shù);2006年35期

2 胡風(fēng)華;劉冰;;基于知識(shí)庫(kù)系統(tǒng)的智能搜索引擎研究[J];電腦知識(shí)與技術(shù);2009年11期

3 徐輝;;基于IFilter的非文本文件中抽取文本的關(guān)鍵技術(shù)[J];電腦知識(shí)與技術(shù);2011年27期

4 曲成義;電子政務(wù)安全保障體系探索[J];信息技術(shù)與標(biāo)準(zhǔn)化;2003年11期

5 劉懷宇,李偉琴;淺談訪問(wèn)控制技術(shù)[J];電子展望與決策;1999年01期

6 李雪利;黃理燦;范晨熙;;基于Lucene的文檔管理系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)[J];工業(yè)控制計(jì)算機(jī);2012年10期

7 王繼成,蕭嶸,孫正興,張福炎;Web信息檢索研究進(jìn)展[J];計(jì)算機(jī)研究與發(fā)展;2001年02期

8 沈海波;洪帆;;面向Web服務(wù)的基于屬性的訪問(wèn)控制研究[J];計(jì)算機(jī)科學(xué);2006年04期

9 嚴(yán)悍,張宏,許滿武;基于角色訪問(wèn)控制對(duì)象建模及實(shí)現(xiàn)[J];計(jì)算機(jī)學(xué)報(bào);2000年10期

10 林闖;雷蕾;;下一代互聯(lián)網(wǎng)體系結(jié)構(gòu)研究[J];計(jì)算機(jī)學(xué)報(bào);2007年05期

相關(guān)碩士學(xué)位論文 前2條

1 陳海波;基于自動(dòng)分詞的企業(yè)文檔搜索引擎設(shè)計(jì)與實(shí)現(xiàn)[D];西北工業(yè)大學(xué);2007年

2 張偉;垂直搜索引擎設(shè)計(jì)與實(shí)現(xiàn)[D];西安電子科技大學(xué);2008年



本文編號(hào):1909348

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/1909348.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶b0869***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com