基于大象流兩級(jí)識(shí)別的SDN負(fù)載均衡研究
發(fā)布時(shí)間:2019-06-20 09:18
【摘要】:隨著云計(jì)算、大數(shù)據(jù)等業(yè)務(wù)的興起,數(shù)據(jù)中心網(wǎng)絡(luò)規(guī)模及用戶數(shù)量快速增長(zhǎng),網(wǎng)絡(luò)流量呈爆發(fā)式增加,以TCP/IP架構(gòu)為核心的傳統(tǒng)數(shù)據(jù)中心網(wǎng)絡(luò)難以高效管控海量的數(shù)據(jù)流量,網(wǎng)絡(luò)阻塞問(wèn)題加劇,網(wǎng)絡(luò)資源浪費(fèi)嚴(yán)重,如何均衡數(shù)據(jù)中心網(wǎng)絡(luò)負(fù)載以提高數(shù)據(jù)傳輸效率成為當(dāng)前亟待解決的問(wèn)題。近幾年,以控制與轉(zhuǎn)發(fā)分離為特征的軟件定義網(wǎng)絡(luò)(Software Defined Networking,SDN)技術(shù)憑借其強(qiáng)大的流量管控能力得到了IT界的廣泛關(guān)注。邏輯集中的SDN控制器基于全局網(wǎng)絡(luò)視圖可實(shí)現(xiàn)高效、細(xì)粒度的網(wǎng)絡(luò)流量調(diào)度,這使其在網(wǎng)絡(luò)流量管控方面較以TCP/IP架構(gòu)為核心的傳統(tǒng)網(wǎng)絡(luò)具有較大的優(yōu)勢(shì)。本文從大象流識(shí)別的角度,對(duì)SDN數(shù)據(jù)中心網(wǎng)絡(luò)負(fù)載不均衡的問(wèn)題進(jìn)行研究。首先,針對(duì)現(xiàn)有大象流識(shí)別方法識(shí)別開(kāi)銷大的問(wèn)題,提出一種大象流兩級(jí)識(shí)別方法。依據(jù)大象流數(shù)據(jù)量大的特點(diǎn),該方法在識(shí)別第一階段提出基于TCP發(fā)送隊(duì)列的可疑大象流識(shí)別算法(Suspicious Elephant Detection based on Write Queue,SED-WQ),通過(guò)監(jiān)測(cè)主機(jī)端發(fā)送隊(duì)列緩存中的數(shù)據(jù)量特征以識(shí)別可疑大象流,剔除數(shù)據(jù)量較小的老鼠流以降低第二階段控制器的處理開(kāi)銷;依據(jù)大象流持續(xù)時(shí)間長(zhǎng)的特點(diǎn),該方法在識(shí)別第二階段提出基于流持續(xù)時(shí)間的真實(shí)大象流識(shí)別算法(Real Elephant Detection based on Duration Time,RED-DT),通過(guò)監(jiān)測(cè)網(wǎng)絡(luò)端可疑大象流的持續(xù)時(shí)間特征以識(shí)別真實(shí)大象流,剔除不滿足條件的大象流以提高大象流識(shí)別準(zhǔn)確性。其次,針對(duì)SDN數(shù)據(jù)中心鏈路負(fù)載不均衡的問(wèn)題,提出一種基于大象流兩級(jí)識(shí)別的SDN網(wǎng)絡(luò)負(fù)載均衡策略(Elephant Load Balancing,ELB)。針對(duì)網(wǎng)絡(luò)中的大象流,采用基于均勻分布的大象流調(diào)度算法,利用SDN控制器動(dòng)態(tài)精細(xì)地規(guī)劃大象流的最佳轉(zhuǎn)發(fā)路徑以保證ELB策略的管控效率;針對(duì)網(wǎng)絡(luò)中的老鼠流,采用基于隨機(jī)選路的老鼠流調(diào)度算法,利用SDN控制器靜態(tài)粗放地選取老鼠流的最佳轉(zhuǎn)發(fā)路徑以降低ELB策略的控制器處理開(kāi)銷。最后,利用Mininet軟件對(duì)所提出的大象流兩級(jí)識(shí)別方法和負(fù)載均衡策略ELB進(jìn)行仿真分析。實(shí)驗(yàn)分析表明,在保證大象流識(shí)別的高準(zhǔn)確性前提下,大象流兩級(jí)識(shí)別方法較基于采樣的大象流識(shí)別方法可以降低約85%的控制器識(shí)別開(kāi)銷;在保證流量識(shí)別開(kāi)銷較低的前提下,ELB策略較傳統(tǒng)基于等價(jià)路由(Equal Cost Multipath Routing,ECMP)的網(wǎng)絡(luò)負(fù)載均衡策略降低約10%的平均傳輸時(shí)延,提升約5%的鏈路平均利用率。
[Abstract]:With the rise of cloud computing, large data and other services, the network size and the number of users of the data center are rapidly increasing, the network traffic is increasing, and the traditional data center network with the TCP/ IP architecture as the core is difficult to control the mass data traffic efficiently and the network blocking problem is exacerbated, The network resource waste is serious, how to balance the data center network load to improve the data transmission efficiency becomes the current problem to be solved. In recent years, the Software Defined Networking (SDN) technology, which is characterized by the control and forwarding separation, has been widely concerned by the IT community with its powerful flow control ability. The SDN controller in the logic set can realize the high-efficiency and fine-grained network flow scheduling based on the global network view, which has the advantage that the traditional network with the TCP/ IP architecture as the core in the aspect of network flow control. In this paper, the problem of unbalanced network load of SDN data center is studied from the perspective of elephant flow identification. Firstly, aiming at the problem of large identification cost of the existing elephant flow identification method, an elephant flow two-level identification method is proposed. according to the characteristics of large amount of data of the elephant flow, the method provides a suspicious elephant flow identification algorithm (SED-WQ) based on the TCP transmission queue in the first stage, and the data volume characteristic in the queue cache is transmitted through the monitoring host end to identify the suspicious elephant flow, The method comprises the following steps of: removing a mouse stream with a small data volume to reduce the processing cost of the second-stage controller; and according to the characteristic of long duration of the elephant flow, the method provides a real-Elephant Detection based on Duration (RED-DT) based on the flow duration in the second stage. And monitoring the duration characteristic of the suspicious elephant flow at the network end to identify the real elephant flow, and removing the elephant flow which does not meet the condition to improve the identification accuracy of the elephant flow. Secondly, aiming at the problem of unbalanced load of the SDN data center, an SDN network load balancing strategy (ELB) based on two-level identification of the elephant flow is proposed. aiming at the elephant flow in the network, the optimal forwarding path of the elephant flow is dynamically and finely divided by the SDN controller to ensure the control efficiency of the ELB strategy by using an elephant flow scheduling algorithm which is evenly distributed, the rat flow scheduling algorithm based on the random routing is adopted for the rat flow in the network, The optimal forwarding path of the rat stream is selected statically and loosely by the SDN controller to reduce the controller processing overhead of the ELB policy. Finally, using the Minet software, the proposed two-level identification method and load balance strategy ELB are simulated and analyzed. The experimental analysis shows that at the premise of ensuring the high accuracy of the identification of the elephant flow, the method of the two-level identification method of the elephant flow can reduce the controller identification cost of about 85% based on the sampling of the elephant flow identification method, The ELB strategy reduces the average transmission time delay of about 10% compared with the traditional equivalent-based routing (ECMP) network load balancing strategy, and increases the link average utilization rate of about 5%.
【學(xué)位授予單位】:杭州電子科技大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類號(hào)】:TP393.02
[Abstract]:With the rise of cloud computing, large data and other services, the network size and the number of users of the data center are rapidly increasing, the network traffic is increasing, and the traditional data center network with the TCP/ IP architecture as the core is difficult to control the mass data traffic efficiently and the network blocking problem is exacerbated, The network resource waste is serious, how to balance the data center network load to improve the data transmission efficiency becomes the current problem to be solved. In recent years, the Software Defined Networking (SDN) technology, which is characterized by the control and forwarding separation, has been widely concerned by the IT community with its powerful flow control ability. The SDN controller in the logic set can realize the high-efficiency and fine-grained network flow scheduling based on the global network view, which has the advantage that the traditional network with the TCP/ IP architecture as the core in the aspect of network flow control. In this paper, the problem of unbalanced network load of SDN data center is studied from the perspective of elephant flow identification. Firstly, aiming at the problem of large identification cost of the existing elephant flow identification method, an elephant flow two-level identification method is proposed. according to the characteristics of large amount of data of the elephant flow, the method provides a suspicious elephant flow identification algorithm (SED-WQ) based on the TCP transmission queue in the first stage, and the data volume characteristic in the queue cache is transmitted through the monitoring host end to identify the suspicious elephant flow, The method comprises the following steps of: removing a mouse stream with a small data volume to reduce the processing cost of the second-stage controller; and according to the characteristic of long duration of the elephant flow, the method provides a real-Elephant Detection based on Duration (RED-DT) based on the flow duration in the second stage. And monitoring the duration characteristic of the suspicious elephant flow at the network end to identify the real elephant flow, and removing the elephant flow which does not meet the condition to improve the identification accuracy of the elephant flow. Secondly, aiming at the problem of unbalanced load of the SDN data center, an SDN network load balancing strategy (ELB) based on two-level identification of the elephant flow is proposed. aiming at the elephant flow in the network, the optimal forwarding path of the elephant flow is dynamically and finely divided by the SDN controller to ensure the control efficiency of the ELB strategy by using an elephant flow scheduling algorithm which is evenly distributed, the rat flow scheduling algorithm based on the random routing is adopted for the rat flow in the network, The optimal forwarding path of the rat stream is selected statically and loosely by the SDN controller to reduce the controller processing overhead of the ELB policy. Finally, using the Minet software, the proposed two-level identification method and load balance strategy ELB are simulated and analyzed. The experimental analysis shows that at the premise of ensuring the high accuracy of the identification of the elephant flow, the method of the two-level identification method of the elephant flow can reduce the controller identification cost of about 85% based on the sampling of the elephant flow identification method, The ELB strategy reduces the average transmission time delay of about 10% compared with the traditional equivalent-based routing (ECMP) network load balancing strategy, and increases the link average utilization rate of about 5%.
【學(xué)位授予單位】:杭州電子科技大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2017
【分類號(hào)】:TP393.02
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 白磊;田立勤;陳超;;基于流抽樣和LRU的高速網(wǎng)絡(luò)大流檢測(cè)算法[J];計(jì)算機(jī)應(yīng)用與軟件;2016年04期
2 趙輝;丁鳴;程青松;盧凌;孔晨晟;;SDN與NFV技術(shù)在云數(shù)據(jù)中心的規(guī)模應(yīng)用[J];電信科學(xué);2016年01期
3 周桐慶;蔡志平;夏竟;徐明;;基于軟件定義網(wǎng)絡(luò)的流量工程[J];軟件學(xué)報(bào);2016年02期
4 劉曉陸;劉淵;王春龍;;一種基于FEFS與CBF的網(wǎng)絡(luò)大流識(shí)別算法[J];計(jì)算機(jī)工程;2015年09期
5 張朝昆;崔勇;唐,
本文編號(hào):2503106
本文鏈接:http://sikaile.net/guanlilunwen/ydhl/2503106.html
最近更新
教材專著