基于搜索引擎的惡意對象發(fā)掘系統(tǒng)的設(shè)計與開發(fā)

發(fā)布時間：2018-06-26 02:24

本文選題：搜索引擎 + 惡意軟件　；參考：《山東大學(xué)》2013年碩士論文

【摘要】：惡意對象發(fā)掘系統(tǒng)是卡巴斯基公司針對現(xiàn)有的樣本收集和分析處理系統(tǒng)的一次研究性嘗試,其方向符合未來殺毒行業(yè)發(fā)展的基本趨勢。其中涉及到多個學(xué)科和業(yè)行的技術(shù),是一個典型的利用多學(xué)科知識交叉實現(xiàn)的系統(tǒng)。如搜索引擎技術(shù)、分布式系統(tǒng)并行處理架構(gòu)、機器學(xué)習(xí)和虛擬機系統(tǒng)等。系統(tǒng)摒棄傳統(tǒng)殺毒行業(yè)一直沿用的被動收集和感染后查殺的模式,采用主動檢索,在大數(shù)據(jù)和海量文件的基礎(chǔ)上進(jìn)行挖掘和抽取。這種積極發(fā)現(xiàn)惡意程序并在惡意程序感染和擴(kuò)散之前更新病毒庫的方式,在第一時間阻斷了可能感染用戶的信息渠道。本文采用統(tǒng)一建模的工程方法,以面向?qū)ο蟮乃枷雽ο到y(tǒng)進(jìn)行需求分析和設(shè)計。在系統(tǒng)需求分析章節(jié)我們對總體業(yè)務(wù)系統(tǒng)的流程進(jìn)行了詳細(xì)說明,分析了四大模塊的功能,對模塊與模塊之間的協(xié)作關(guān)系進(jìn)行描述,樣本收集為例,著重研究了對原始網(wǎng)頁的采集,分類及分析過程。從活動圖設(shè)計、類結(jié)構(gòu)設(shè)計、數(shù)據(jù)庫設(shè)計等方面詳細(xì)描述系統(tǒng)的設(shè)計,得到了系統(tǒng)中的設(shè)計類和數(shù)據(jù)庫模型；重點分析了樣本基礎(chǔ)信息庫和網(wǎng)址分類庫的表設(shè)計。全文通過對借助搜索引擎進(jìn)行惡意軟件傳播的傳播方式及特點的分析,有針對性的設(shè)計出一套精準(zhǔn)有效的監(jiān)測和自動查殺系統(tǒng)。在整體上系統(tǒng)使用了典型的C/S架構(gòu)。因為涉及到大量現(xiàn)有的功能平臺,系統(tǒng)使用跨平臺的軟件技術(shù)以兼容和驅(qū)動繁雜的異構(gòu)系統(tǒng),所以我們選用各種平臺無關(guān)的腳本語言開發(fā)主要業(yè)務(wù)邏輯,例如Perl,PHP等。在病毒樣本收集階段,基于虛擬機實現(xiàn)了一個分布式系統(tǒng)檢測環(huán)境。創(chuàng)建并引入惡意對象流的概念,在此基礎(chǔ)上設(shè)計了系統(tǒng)對潛在惡意對象的智能判斷以及自動化處理流程。其中重點介紹了如何基于搜索引擎發(fā)現(xiàn)惡意對象和惡意對象的分類處理,基于惡意對象特征庫,采用支持向量機設(shè)計出一個惡意程序檢測算法,并用實驗實證的方法進(jìn)行數(shù)據(jù)比對,分析該算法的理論可行性和實用性。最后進(jìn)行軟件測試對各項功能進(jìn)行評測。該系統(tǒng)目前在實驗室內(nèi)穩(wěn)定運行,根據(jù)現(xiàn)有的統(tǒng)計數(shù)據(jù)看,系統(tǒng)基本達(dá)到預(yù)期。系統(tǒng)已經(jīng)開始為公司業(yè)務(wù)系統(tǒng)貢獻(xiàn)了很多有價值的惡意程序樣本。
[Abstract]:The malicious object discovery system is a research attempt of Kaspersky Company aimed at the existing sample collection and analysis and processing system, and its direction accords with the basic trend of the future development of antivirus industry. The technology involves many disciplines and industries, and it is a typical system using multidisciplinary knowledge. Such as search engine technology, distributed system parallel processing architecture, machine learning and virtual machine systems. The system abandoned the traditional anti-virus industry has been used passive collection and post-infection kill mode, using active retrieval, on the basis of big data and massive files for mining and extraction. This way of actively detecting malicious programs and updating the virus library before the malicious program infects and spreads blocks the information channel of the possible infected users at the first time. In this paper, the unified modeling engineering method is used to analyze and design the requirements of the system with the idea of object-oriented. In the chapter of system requirement analysis, we explain the flow of the whole business system in detail, analyze the functions of the four modules, describe the cooperative relationship between the modules and the modules, and collect samples as an example. The process of collecting, classifying and analyzing the original web pages is studied emphatically. The design of the system is described in detail from the aspects of activity diagram design, class structure design, database design and so on. The design class and database model of the system are obtained, and the table design of the sample base information base and the URL classification database is analyzed. Based on the analysis of the transmission mode and characteristics of malware spread by search engine, a set of accurate and effective monitoring and automatic killing system is designed in this paper. On the whole, the system uses a typical C / S architecture. Because it involves a large number of existing functional platforms, the system uses cross-platform software technology to compatible and drive complex heterogeneous systems, so we choose various platform-independent scripting languages to develop the main business logic, such as Perl PHP and so on. In the phase of virus sample collection, a distributed system detection environment based on virtual machine is implemented. The concept of malicious object flow is created and introduced. Based on this, the intelligent judgment and automatic processing flow of potentially malicious objects are designed. It focuses on how to classify and process malicious objects and malicious objects based on search engine. Based on the signature library of malicious objects, a malicious program detection algorithm is designed by using support vector machine (SVM). The theoretical feasibility and practicability of the algorithm are analyzed. Finally, software tests are carried out to evaluate the functions. The system is running stably in the laboratory at present. According to the existing statistics, the system basically meets the expectation. The system has begun to contribute a number of valuable samples of malicious programs to the company's business systems.
【學(xué)位授予單位】：山東大學(xué)
【學(xué)位級別】：碩士
【學(xué)位授予年份】：2013
【分類號】：TP311.52;TP391.3

【相似文獻(xiàn)】

相關(guān)期刊論文前10條

1 張繼剛;搜索引擎使用技巧[J];網(wǎng)絡(luò)與信息;1999年09期

2 ;關(guān)鍵詞搜索[J];每周電腦報;2000年38期

3 陳冰;;餓狼一樣的網(wǎng)站提交工具——“提交餓狼”[J];科學(xué)之友;2000年07期

4 許斗;從Google看新一代搜索引擎的發(fā)展趨向[J];蕪湖職業(yè)技術(shù)學(xué)院學(xué)報;2001年01期

5 周毅華;從搜索引擎的分類看其應(yīng)用技巧[J];圖書館理論與實踐;2002年06期

6 鄒小筑;搜索引擎的選擇與使用技巧[J];圖書館學(xué)研究;2002年05期

7 林燕;Google搜索引擎的搜索功能與使用技巧[J];河北科技圖苑;2003年05期

8 林中;GOOGLE搜索引擎的關(guān)鍵詞檢索[J];中國信息導(dǎo)報;2003年03期

9 封劍待封喉;吸星大法“搜”天下笑傲網(wǎng)絡(luò)任我行——搜索引擎絕對專題[J];網(wǎng)絡(luò)與信息;2003年07期

10 閆凡蕾;建設(shè)站內(nèi)搜索的好幫手——Search Engine Maker[J];少年電世界;2003年08期

相關(guān)會議論文前10條

1 彭軻;廖聞劍;;淺析搜索引擎[A];中國通信學(xué)會第五屆學(xué)術(shù)年會論文集[C];2008年

2 李丹;;如何利用搜索引擎查找中醫(yī)藥信息[A];中國中醫(yī)藥信息研究會第二屆理事大會暨學(xué)術(shù)交流會議論文匯編[C];2003年

3 鄧長壽;郭景峰;楊焱林;鄧安遠(yuǎn);;下一代Web搜索引擎初探[A];第十八屆全國數(shù)據(jù)庫學(xué)術(shù)會議論文集（研究報告篇）[C];2001年

4 維尼拉·木沙江;吐爾洪·吾司曼;;維、哈、柯文搜索引擎中網(wǎng)頁爬行器的設(shè)計與實現(xiàn)[A];少數(shù)民族青年自然語言處理技術(shù)研究與進(jìn)展——第三屆全國少數(shù)民族青年自然語言信息處理、第二屆全國多語言知識庫建設(shè)聯(lián)合學(xué)術(shù)研討會論文集[C];2010年

5 湯薇;曾艷;;構(gòu)建校園網(wǎng)搜索引擎必要性分析[A];廣西計算機學(xué)會2008年年會論文集[C];2008年

6 姚樹宇;趙少東;;一種使用分布式技術(shù)的搜索引擎[A];2005年全國開放式分布與并行計算學(xué)術(shù)會議論文集[C];2005年

7 倪俊峰;;基于黃頁搜索引擎的關(guān)鍵字排名廣告系統(tǒng)的設(shè)計與實現(xiàn)[A];2005年中國索引學(xué)會年會暨學(xué)術(shù)研討會論文集[C];2005年

8 張怡;查貴庭;;SEO在信息服務(wù)中的應(yīng)用研究[A];2010年中國索引學(xué)會年會暨學(xué)術(shù)研討會論文集[C];2010年

9 陳援非;何哲;朱珍民;;基于普適計算的個性化搜索技術(shù)[A];第二屆和諧人機環(huán)境聯(lián)合學(xué)術(shù)會議(HHME2006)——第2屆中國普適計算學(xué)術(shù)會議(PCC'06)論文集[C];2006年

10 楊萌;李春麗;朱明;;網(wǎng)絡(luò)搜索技術(shù)下的編輯工作[A];學(xué)報編輯論叢（第十一集）[C];2003年

相關(guān)重要報紙文章前10條

1 李一鑫;搜索排名的紅與黑[N];財經(jīng)時報;2007年

2 周文林;搜狗3.0能否撼動搜索市場[N];經(jīng)濟(jì)參考報;2007年

3 惠正一;比爾·蓋茨:微軟不怕Google[N];第一財經(jīng)日報;2005年

4 賽迪顧問股份有限公司互聯(lián)網(wǎng)與電子商務(wù)咨詢中心常燕杰;搜索，，還是門戶[N];中國計算機報;2005年

5 陳珊;浙江移動推出手機搜索引擎服務(wù)[N];人民郵電;2005年

6 趙法忠;搜索引擎還需悠著點[N];中國經(jīng)營報;2005年

7 金朝力;搜索引擎火拼搜索質(zhì)量[N];北京商報;2006年

8 本報記者　趙曉輝孟昭麗;搜索引擎駛?cè)搿氨茱L(fēng)港”[N];中國證券報;2006年

9 孫t;搜索引擎驚喜侵權(quán)官司止于“避風(fēng)港”？[N];第一財經(jīng)日報;2006年

10 姜蕊;問天下誰識搜索？[N];中國高新技術(shù)產(chǎn)業(yè)導(dǎo)報;2006年

相關(guān)博士學(xué)位論文前10條

1 岑榮偉;基于用戶行為分析的搜索引擎評價研究[D];清華大學(xué);2010年

2 李群;主題搜索引擎聚類算法的研究[D];北京林業(yè)大學(xué);2011年

3 蘇君華;面向搜索引擎的技術(shù)接受模型研究[D];南京大學(xué);2011年

4 劉佐達(dá);分布協(xié)作式搜索引擎模型及算法研究[D];清華大學(xué);2011年

5 陳旭毅;基于索引云的企業(yè)搜索引擎實現(xiàn)研究[D];武漢大學(xué);2011年

6 郭眈;中文互聯(lián)網(wǎng)視頻搜索引擎系統(tǒng)策略研究[D];北京交通大學(xué);2012年

7 王昤璞;基于用戶體驗的互聯(lián)網(wǎng)搜索引擎醫(yī)學(xué)信息檢索可用性評估研究[D];吉林大學(xué);2010年

8 李莎莎;面向搜索引擎的自然語言處理關(guān)鍵技術(shù)研究[D];國防科學(xué)技術(shù)大學(xué);2011年

9 鄭文良;基于簡單本體的農(nóng)業(yè)P2P搜索引擎關(guān)鍵技術(shù)研究[D];沈陽農(nóng)業(yè)大學(xué);2013年

10 白玉琪;空間信息搜索引擎研究[D];中國科學(xué)院研究生院（遙感應(yīng)用研究所）;2003年

相關(guān)碩士學(xué)位論文前10條

1 陳剛;基于行為分析智能推薦購物搜索引擎的設(shè)計與實現(xiàn)[D];北京交通大學(xué);2011年

2 薛云;Internet上元搜索引擎的研究與設(shè)計[D];太原理工大學(xué);2003年

3 王春花;基于Nutch的農(nóng)業(yè)搜索引擎檢索結(jié)果排序策略的研究[D];西北農(nóng)林科技大學(xué);2010年

4 李雷;基于Nutch的農(nóng)業(yè)信息搜索引擎實現(xiàn)和優(yōu)化[D];吉林大學(xué);2011年

5 董晨;基于模糊聚類的個性化搜索引擎的研究[D];福州大學(xué);2005年

6 封俊;基于Hadoop的分布式搜索引擎研究與實現(xiàn)[D];太原理工大學(xué);2010年

7 李浩;分布式教育網(wǎng)信息檢索系統(tǒng)的研究和實現(xiàn)[D];華南理工大學(xué);2010年

8 尉建興;基于Lucene搜索引擎的研究與應(yīng)用[D];太原理工大學(xué);2011年

9 李建平;智能化WEB信息搜索引擎的研究與實現(xiàn)[D];大慶石油學(xué)院;2003年

10 田生偉;基于涉農(nóng)詞典的搜索引擎的研究與實踐[D];新疆大學(xué);2004年

本文編號：2068642

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2068642.html

上一篇：面向Blog的爬行算法
下一篇：一個基于移動Agent的信息檢索系統(tǒng)

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于搜索引擎的惡意對象發(fā)掘系統(tǒng)的設(shè)計與開發(fā)