自適應網(wǎng)絡信息獲取服務技術研究

發(fā)布時間：2018-09-09 17:29

【摘要】： 網(wǎng)絡技術的發(fā)展帶來了可獲取信息資源的極大豐富，但是網(wǎng)絡資源的無序、良莠不齊等缺點也給用戶獲取網(wǎng)絡信息帶來了困難。網(wǎng)絡信息獲取服務是指在互聯(lián)網(wǎng)上，針對個人用戶的網(wǎng)絡信息需求，以現(xiàn)代信息技術為手段，向用戶提供所需的互聯(lián)網(wǎng)信息產品及服務，其服務模式包括信息拉取和信息推送。自適應網(wǎng)絡信息獲取服務技術，則是根據(jù)用戶需求、信息源特征、系統(tǒng)負載等因素，自適應地動態(tài)調整自身行為，高效、人性化地提供高質量的信息。準確、全面地感知用戶需求，是實現(xiàn)網(wǎng)絡信息獲取服務的基礎。網(wǎng)絡用戶既是網(wǎng)絡信息資源的利用者又是提供者，因此可以通過分析用戶的瀏覽內容、行為和發(fā)布的信息等來獲取用戶需求。獲得用戶需求后，如何在浩瀚的網(wǎng)絡信息資源中篩選出相關的信息，并以更人性化的方式展現(xiàn)給用戶，是網(wǎng)絡信息獲取服務成功的關鍵。此外，用戶對信息獲取的時效性通常有較高的要求，如何保障信息獲取系統(tǒng)的性能也是網(wǎng)絡信息獲取服務的重要研究內容之一。為解決上述問題，本文首先提出了一種基于查詢歧義性衡量的自適應信息拉取技術。對用戶請求進行歧義性衡量，根據(jù)其歧義性自適應地決定結果的展現(xiàn)方式；在結果篩選和展現(xiàn)方面，分別提出了多特征融合排序算法和聚類算法；并在互聯(lián)網(wǎng)頗具代表的新興資源：多媒體信息(以圖像為例)和更新頻繁的動態(tài)資源(以博客為例)上得到了驗證。其次，本文針對網(wǎng)絡活動中的信息發(fā)布者和信息瀏覽者各提出了一種基于個性化建模的自適應信息推送技術：對于信息發(fā)布者，以當前網(wǎng)絡流行的博客這一個性化信息發(fā)布平臺為研究環(huán)境，提出了一種利用博客文章對用戶進行長短期興趣建模的方法，并對博客空間進行社群劃分，實現(xiàn)了興趣相似好友的推薦；對于信息瀏覽者，利用用戶當前瀏覽網(wǎng)頁的內容作為用戶個性信息的表征，提出了一種基于情感和主題分析的上下文廣告推薦技術，使推送的廣告不僅主題相關，而且與網(wǎng)頁內容中潛在的用戶情感相符合，，從而更具針對性。接著，針對網(wǎng)絡信息獲取服務在性能、可擴展性等方面的需求，以信息拉取服務的典型應用——搜索引擎為切入點，提出了一種具有較好可擴展性的混合型分布式索引組織策略(Loc-Glob)。并在Loc-Glob索引組織策略之上進行性能優(yōu)化：基于索引詞負載及動態(tài)變化查詢流，重新分布和冗余索引；基于索引服務器的實時系統(tǒng)負載，實現(xiàn)查詢路徑的自適應優(yōu)化。基于上述研究，本文設計并實現(xiàn)了一個采用自適應技術的博客空間信息獲取原型系統(tǒng)，提供了博客搜索引擎、博客好友推薦、廣告推薦等多種應用服務，驗證了本文針對信息拉取和信息推送兩類服務模式提出的多項自適應技術的可行性。文章最后對本文的研究工作進行了總結和展望。
[Abstract]:The development of network technology has brought about great richness of available information resources, but the disorder and uneven quality of network resources have also brought difficulties to users in obtaining network information. The service mode of Internet information products and services is information pulling and information pushing. Adaptive network information acquisition service technology adaptively adjusts its behavior dynamically according to user needs, information source characteristics, system load and other factors, and provides high-quality information efficiently and humanely.
Accurate and comprehensive perception of users'needs is the basis of network information acquisition services. Network users are both users and providers of network information resources. Therefore, users' needs can be obtained by analyzing users'browsing content, behavior and published information. Screening out relevant information and presenting it to users in a more humane way is the key to the success of network information acquisition service. In addition, users usually have higher requirements for the timeliness of information acquisition. How to ensure the performance of information acquisition system is also one of the important research contents of network information acquisition service.
In order to solve the above problems, an adaptive information pull-out technique based on query ambiguity measurement is proposed in this paper. It has been validated in the emerging resources of the Internet: multimedia information (for example, images) and frequently updated dynamic resources (for example, blogs).
Secondly, this paper proposes an adaptive information push technology based on personalized modeling for information publishers and information browsers in network activities. For information publishers, a personalized information publishing platform, blog, which is popular on the Internet, is taken as the research environment, and a blog article is proposed to make use of the length of users. Interest modeling method is used to divide the blog space into communities, and the recommendation of friends with similar interests is realized. For information browsers, the content of the user's current browsing web page is used as the representation of the user's personality information, and a context advertising recommendation technology based on emotion and topic analysis is proposed to make the pushed advertisement not only the theme. Relevance, and in line with potential user emotions in web content, is more targeted.
Then, aiming at the performance and scalability requirements of network information retrieval service, a hybrid distributed index organization strategy (Loc-Glob) with good scalability is proposed, which is based on the typical application of information pull-out service-search engine. Indexed word load and dynamic query flow are used to redistribute and redundant index, and real-time system load based on index server is used to optimize query path.
Based on the above research, this paper designs and implements a blog spatial information acquisition prototype system using adaptive technology, which provides a variety of application services such as blog search engine, blog friend recommendation, advertising recommendation and so on. It verifies the feasibility of the multi-adaptive technology proposed in this paper for the two types of service modes of information pull and information push.
At the end of the paper, the research work in this paper is summarized and prospected.
【學位授予單位】：浙江大學
【學位級別】：博士
【學位授予年份】：2008
【分類號】：G250.73

【引證文獻】

相關博士學位論文前1條

1 王興中;鐵路調度指揮系統(tǒng)中數(shù)據(jù)存儲體系及索引結構的研究[D];中國鐵道科學研究院;2011年

本文編號：2233080

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/wenyilunwen/guanggaoshejilunwen/2233080.html

上一篇：基于內容的視頻片段檢索技術研究
下一篇：中美平面商業(yè)廣告中的文化價值對比研究

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

自適應網(wǎng)絡信息獲取服務技術研究