具備web數(shù)據(jù)整合功能的負載均衡系統(tǒng)設(shè)計與實現(xiàn)
[Abstract]:With the rapid development of Internet technology, the scale of network data has expanded dramatically, such as social networks, such as Twitter can generate billions of tweets every two and a half days, deal with nearly 2TB data every day, e-commerce, For example, Alipay pays a maximum of 188 million payments a day, Google, the search engine's representative, handles 20 PBs a day, and Gillard's law indicates that network traffic will continue to swell. Not only the amount of data of network flow expands rapidly, but the form of network flow is becoming more and more complicated. With the development and popularization of CDN technology and multi-source download technology, the form of network flow is no longer the simple connection form in the past. It brings great challenges to network flow filtering. At present, Akamai manufacturers account for 40 percent of global traffic. YouTube uses multi-source download technology, and its traffic accounts for 30 percent of North American traffic. At present, DPI system which filters mass data streams generally uses distributed multilevel parallel processing to analyze and detect network flows. Distributed clusters need an efficient load balancing system to distribute traffic. This paper focuses on the research of web data integration, data flow, traffic scheduling and a series of key technology research and system implementation of load balancing system in front of DPI system. Due to the extensive use of CDN technology and multi-source download technology, the sessions in network traffic may be composed of multiple connections, and often the DPI system needs to split the whole session to a single back-end machine to complete the analysis. The existing load balancing technology does not have the function of keeping the session intact. In order to solve this problem, the web data integration function in this paper clustered the server IP under the same domain name into a IP cluster and split the IP cluster as the shunt unit. Thus, the traffic of the same content provider is aggregated to the same back-end machine for analysis. On the one hand, the problem of data session integrity is solved; on the other hand, the content of the same content provider will be accessed by many users. If the traffic of the same content visited by each user is analyzed once by the back-end machine, it will result in a large number of repetitive calculations, resulting in a waste of valuable computing resources, and the integration of web data. The load balancing system can remove redundant data and reduce a large number of repetitive calculations in DPI system, thus saving computing resources and improving the throughput of DPI system. Data streaming and traffic scheduling is based on the integration of web data, and the IP cluster is used as the traffic scheduling unit to balance the load according to the feedback of the DPI system. With the change of the backend machine load, the IP cluster is split and merged in real time. Thus balancing the load of each backend machine.
【學位授予單位】:北京郵電大學
【學位級別】:碩士
【學位授予年份】:2014
【分類號】:TP393.06
【參考文獻】
相關(guān)期刊論文 前10條
1 蔣江,張民選,廖湘科;基于多種資源的負載平衡算法的研究[J];電子學報;2002年08期
2 楊際祥;譚國真;王榮生;;并行與分布式計算動態(tài)負載均衡策略綜述[J];電子學報;2010年05期
3 陳訓遜,方濱興,李蕾;高速網(wǎng)絡(luò)環(huán)境下入侵檢測系統(tǒng)結(jié)構(gòu)研究[J];計算機研究與發(fā)展;2004年09期
4 班志杰;古志民;金瑜;;Web預取技術(shù)綜述[J];計算機研究與發(fā)展;2009年02期
5 陳志剛,許偉,曾志文;一種基于預測的動態(tài)負載均衡模型及算法研究[J];計算機工程;2004年23期
6 李清玉,韋衛(wèi),侯紫峰;防火墻集群動態(tài)自適應(yīng)負載均衡算法設(shè)計與實現(xiàn)[J];計算機工程;2005年01期
7 孫欽東,張德運,高鵬,劉剛;基于動態(tài)負載均衡的分層式高速網(wǎng)絡(luò)入侵檢測模型[J];計算機工程;2005年12期
8 王成;劉金剛;;一種改進的字符串匹配算法[J];計算機工程;2006年02期
9 袁志堅;陳穎文;繆嘉嘉;賈焰;楊樹強;;典型Bloom過濾器的研究及其數(shù)據(jù)流應(yīng)用[J];計算機工程;2009年07期
10 嵇智輝;倪宏;劉磊;;動態(tài)消息隊列負載均衡策略的研究與應(yīng)用[J];計算機工程;2009年08期
,本文編號:2200753
本文鏈接:http://sikaile.net/kejilunwen/sousuoyinqinglunwen/2200753.html