石油行業(yè)垂直搜索引擎技術(shù)研究
本文選題:垂直搜索 + 信息檢索; 參考:《電子科技大學(xué)》2013年碩士論文
【摘要】:垂直搜索引擎是針對(duì)某一個(gè)行業(yè)的專業(yè)搜索引擎,是搜索引擎的細(xì)分和延伸,相比較通用搜索引擎的海量信息無(wú)序化,垂直搜索引擎則更加專注、具體和深入。隨著油田信息化建設(shè)進(jìn)程的推進(jìn),油田建設(shè)了具有自己特色的信息網(wǎng)和不同專業(yè)的信息管理系統(tǒng)。信息網(wǎng)網(wǎng)站規(guī)劃非常龐大,信息內(nèi)容十分豐富,因此,迫切需要一個(gè)搜索引擎系統(tǒng),從限定范圍的互聯(lián)網(wǎng)上和企業(yè)內(nèi)網(wǎng)上搜索到石油有關(guān)網(wǎng)頁(yè)的有效信息,并結(jié)合石油行業(yè)的特殊需要,對(duì)信息進(jìn)行處理,,為油田網(wǎng)絡(luò)用戶和科研、生產(chǎn)人員方便地提供所需要的信息。 本文設(shè)計(jì)了石油行業(yè)垂直搜索引擎的處理流程,通過(guò)信息抓取、處理和索引實(shí)現(xiàn)了圖片搜索功能、網(wǎng)頁(yè)搜索功能和論壇搜索功能;通過(guò)搜集整理行業(yè)詞匯,建立了石油百科知識(shí)詞庫(kù);開(kāi)發(fā)了石油垂直搜索引擎系統(tǒng),研發(fā)了搜索器、處理器、索引器、檢索器在內(nèi)的全部四個(gè)子系統(tǒng),實(shí)現(xiàn)了圖片搜索自動(dòng)提取、網(wǎng)頁(yè)定向搜索提高及時(shí)性和論壇獨(dú)立搜索提高檢索準(zhǔn)確性,最終形成了基于自然語(yǔ)言理解的一個(gè)個(gè)性化、智能化的網(wǎng)絡(luò)信息搜集工具。石油行業(yè)的專業(yè)搜索引擎全國(guó)還沒(méi)有,該技術(shù)的應(yīng)用解決了目前網(wǎng)站信息采集和檢索方面的問(wèn)題,提高了網(wǎng)站的維護(hù)效率和網(wǎng)頁(yè)查詢效率。通過(guò)為石油各個(gè)專業(yè)領(lǐng)域、石油從業(yè)人群以及石油行業(yè)特定需求提供具有石油行業(yè)色彩的、而且“專、精、深”的有專業(yè)價(jià)值的信息和相關(guān)服務(wù),提供更加專注、具體和深入的信息內(nèi)容,搜索引擎將被打造成為石油系統(tǒng)最權(quán)威、最專業(yè)的信息引擎。 該系統(tǒng)實(shí)現(xiàn)信息采集的自動(dòng)化,能為網(wǎng)站用戶搜索信息帶來(lái)全新的使用體驗(yàn),能讓用戶在最快的時(shí)間里找到最需要的信息,真正起到搜索引擎和導(dǎo)航的作用。實(shí)踐證明該系統(tǒng)有很好的應(yīng)用效果,為提高油田網(wǎng)站信息采集和信息檢索水平發(fā)揮了重要作用。
[Abstract]:Vertical search engine is a professional search engine for a certain industry, which is the subdivision and extension of search engine. Compared with the mass information disorder of general search engine, vertical search engine is more focused, specific and in-depth. With the development of oilfield information construction, oil field has built its own characteristic information network and information management system of different specialties. The web site of the information network is very large in planning and rich in information content. Therefore, there is an urgent need for a search engine system to search for effective information from the limited range of the Internet and the intranet to the oil-related web pages. Combined with the special needs of the petroleum industry, the information is processed to provide the needed information conveniently for the oil field network users and scientific research personnel. This paper designs the processing flow of vertical search engine in petroleum industry, realizes image search function, web search function and forum search function through information grabbing, processing and indexing. Established the petroleum encyclopedia knowledge thesaurus, developed the petroleum vertical search engine system, developed all four subsystems, including the searcher, the processor, the indexer, the retrieval device, realized the picture search automatic extraction, Web page oriented search improves timeliness and forum independent search improves retrieval accuracy. Finally, a personalized and intelligent network information collection tool based on natural language understanding is formed. There is no professional search engine in petroleum industry in China. The application of this technology solves the problems of information collection and retrieval of websites and improves the efficiency of website maintenance and web search. By providing more focused information and related services of professional value to all areas of petroleum, the petroleum industry and the specific needs of the petroleum industry, which are "specialized, sophisticated and deep", Specific and in-depth information content, the search engine will be built into the most authoritative oil system, the most professional information engine. The system realizes the automation of information collection, can bring users a new experience in searching for information, can make users find the most needed information in the quickest time, and really play the role of search engine and navigation. The practice shows that the system has a good application effect and plays an important role in improving the level of information collection and retrieval of oil field website.
【學(xué)位授予單位】:電子科技大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類號(hào)】:TP391.3
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 李桂華;汪學(xué)明;;基于本體的語(yǔ)義信息檢索的研究[J];電腦知識(shí)與技術(shù);2010年05期
2 張成洪,肖軍建,張誠(chéng);Web內(nèi)容抽取及其數(shù)據(jù)管理方法[J];復(fù)旦學(xué)報(bào)(自然科學(xué)版);2001年02期
3 曹志松,曹文君;基于語(yǔ)義Web實(shí)現(xiàn)有效Web信息檢索的研究[J];復(fù)旦學(xué)報(bào)(自然科學(xué)版);2004年03期
4 李曉明;劉建國(guó);;搜索引擎技術(shù)及趨勢(shì)[J];電腦與電信;2008年05期
5 任麗蕓;楊武;唐蓉;;搜索引擎網(wǎng)頁(yè)排序算法研究綜述[J];電腦與電信;2010年05期
6 吳偉忠;崔建英;;基于時(shí)效性的垂直搜索及其應(yīng)用[J];暨南大學(xué)學(xué)報(bào)(自然科學(xué)版);2007年03期
7 赫建營(yíng);晏海華;金茂忠;劉超;;結(jié)合本體篩選和文本挖掘的垂直搜索引擎研究[J];計(jì)算機(jī)科學(xué);2008年02期
8 陸堯;廖明宏;李貴林;;基于多證明者交互證明模型的RFID安全協(xié)議的研究[J];計(jì)算機(jī)科學(xué);2011年05期
9 曹靜霞;楊靜;顧君忠;;語(yǔ)義覆蓋對(duì)等信息共享系統(tǒng)的研究[J];計(jì)算機(jī)工程;2006年12期
10 高一波;趙先章;孫碩;黃河;;面向垂直搜索引擎的基于知識(shí)的語(yǔ)義關(guān)聯(lián)算法[J];計(jì)算機(jī)工程;2009年11期
本文編號(hào):1857868
本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/1857868.html