天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁 > 科技論文 > 搜索引擎論文 >

云環(huán)境下個(gè)性化推送搜索引擎的設(shè)計(jì)

發(fā)布時(shí)間:2018-02-21 04:47

  本文關(guān)鍵詞: 個(gè)性化 推送搜索 云計(jì)算 個(gè)性化搜索 主題相關(guān)搜索算法 出處:《北京郵電大學(xué)》2012年碩士論文 論文類型:學(xué)位論文


【摘要】:隨著互聯(lián)網(wǎng)技術(shù)的發(fā)展和普及,大量信息以網(wǎng)站作為載體向經(jīng)濟(jì),社會(huì)和生活的各個(gè)領(lǐng)域提供服務(wù),但是從2001年到2011年互聯(lián)網(wǎng)上的數(shù)據(jù)信息從1萬P增值到1億P,從浩如煙海的信息中快速查找用戶需要的信息成為所有互聯(lián)網(wǎng)用戶的迫切需求。史坦福大學(xué)的幾個(gè)學(xué)生為此做出了巨大貢獻(xiàn),搜索引擎Google的出現(xiàn)迅速改變?nèi)藗冊(cè)械纳暇W(wǎng)習(xí)慣,但是伴隨著互聯(lián)網(wǎng)的進(jìn)一步發(fā)展,尤其分布式云計(jì)算技術(shù)的發(fā)展,傳統(tǒng)的包羅萬象的搜索引擎已經(jīng)不能滿足用戶的需求,在現(xiàn)實(shí)需求的驅(qū)動(dòng)下基于云計(jì)算的個(gè)性化推送搜索服務(wù)技術(shù)誕生了。推送搜索是針對(duì)某一個(gè)特定的需求或一類特定的用戶群的專業(yè)搜索引擎,是傳統(tǒng)搜索引擎的細(xì)分和延伸,是對(duì)網(wǎng)頁庫中的類別信息進(jìn)行分類細(xì)化,即搜索領(lǐng)域的行業(yè)化分工和對(duì)用戶的精確定位和細(xì)化。例如推送服務(wù)是指搜索引擎通過記錄并分析用戶的上網(wǎng)行為,建立多維的學(xué)習(xí)模型。依據(jù)建立的用戶模型,當(dāng)用戶接入互聯(lián)網(wǎng)時(shí),推送搜索引擎可以直接從浩如煙海的信息中過濾用戶需要的信息。于是用戶在互聯(lián)網(wǎng)訪問的任何信息都是針對(duì)他個(gè)人的模型定制且由推送搜索引擎提供的信息。 本文來源于和某電信運(yùn)營商的合作項(xiàng)目,主要完成了以下工作 (1)分析了搜索引擎特別是推送搜索引擎和云計(jì)算計(jì)算的發(fā)展現(xiàn)狀,闡述了相關(guān)技術(shù)的優(yōu)點(diǎn)和前景,介紹了本系統(tǒng)的工作原理和工作流程; (2)根據(jù)電信行業(yè)移動(dòng)互聯(lián)網(wǎng)的發(fā)展趨勢(shì),改進(jìn)了信息搜索的設(shè)計(jì)思想,針對(duì)移動(dòng)互聯(lián)網(wǎng)對(duì)信息精確性和有效性的更高要求,引入關(guān)鍵詞基礎(chǔ)詞庫和基礎(chǔ)拓展; (3)結(jié)合云計(jì)算架構(gòu)強(qiáng)大的存儲(chǔ)和運(yùn)算能力設(shè)計(jì)并實(shí)現(xiàn)了一個(gè)基于網(wǎng)頁數(shù)據(jù)的全文搜索引擎系統(tǒng),實(shí)現(xiàn)網(wǎng)頁分詞統(tǒng)計(jì),用戶個(gè)性化模型,網(wǎng)頁去同質(zhì)化等功能;
[Abstract]:With the development and popularization of Internet technology, a large amount of information takes the website as the carrier to provide services to various fields of economy, society and life. But from 2001 to 2011, the data on the Internet increased from 10,000 P to 100 million P, and it became an urgent need for all Internet users to quickly find the information that users needed from the vast amount of information. The students have made great contributions to this. The emergence of search engine Google changes people's Internet habits rapidly, but with the further development of the Internet, especially the development of distributed cloud computing technology, the traditional all-encompassing search engine can no longer meet the needs of users. Driven by the actual demand, the personalized push search service technology based on cloud computing is born. Push search is a professional search engine aimed at a particular demand or a specific group of users, and it is the subdivision and extension of traditional search engines. It is the classification and refinement of the category information in the web page library, that is, the division of labor in the field of search and the precise location and refinement of the user. For example, push service is a search engine that records and analyzes the user's online behavior by recording and analyzing the user's behavior on the Internet. Establish a multi-dimensional learning model. According to the established user model, when the user is connected to the Internet, The push search engine can filter the information the user needs directly from the vast amount of information, so any information accessed by the user on the Internet is customized for his own model and provided by the push search engine. This paper comes from a cooperation project with a telecom operator, which mainly completes the following work. 1) this paper analyzes the development of search engine, especially push search engine and cloud computing, expounds the advantages and prospects of related technologies, and introduces the working principle and workflow of the system. (2) according to the development trend of mobile Internet in telecom industry, the design idea of information search is improved, and the keyword basic lexicon and basic expansion are introduced to meet the higher requirement of accuracy and validity of information in mobile Internet. 3) Design and implement a full-text search engine system based on web page data combining the powerful storage and computing ability of cloud computing architecture, and realize the functions of page segmentation statistics, user personalization model, page de-homogeneity and so on.
【學(xué)位授予單位】:北京郵電大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類號(hào)】:TP391.3

【參考文獻(xiàn)】

相關(guān)期刊論文 前3條

1 王堯;;高頻詞匯提取[J];程序員;2006年09期

2 嚴(yán)威,趙政;開發(fā)中文搜索引擎漢語處理的關(guān)鍵技術(shù)[J];計(jì)算機(jī)工程;1999年06期

3 崔維梅;范榮鵬;;搜索引擎技術(shù)的現(xiàn)狀和熱點(diǎn)[J];青年記者;2006年16期

相關(guān)碩士學(xué)位論文 前1條

1 董超;基于主題信息服務(wù)的垂直搜索引擎的設(shè)計(jì)與實(shí)現(xiàn)[D];北京郵電大學(xué);2010年

,

本文編號(hào):1521037

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/1521037.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶7e2ee***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com