新一代高速互連網(wǎng)絡(luò)容錯(cuò)技術(shù)研究
發(fā)布時(shí)間:2018-10-29 10:44
【摘要】:在高性能計(jì)算機(jī)系統(tǒng)的研制中,多核處理器速度的不斷提升使得處理器間數(shù)據(jù)傳輸?shù)钠款i問題越來(lái)越突出。高頻傳輸技術(shù)可以帶來(lái)傳輸性能的提升,但也減小了部件級(jí)固有可靠性;同時(shí),高性能計(jì)算機(jī)系統(tǒng)規(guī)模的不斷擴(kuò)大、系統(tǒng)高速互連網(wǎng)絡(luò)組成部件的不斷增加,使得系統(tǒng)級(jí)固有可靠性進(jìn)一步降低,采用高效簡(jiǎn)潔的容錯(cuò)技術(shù)提升系統(tǒng)的可用性成為當(dāng)前高速互連網(wǎng)絡(luò)研究的重要課題。自成功研制TH-1A高性能計(jì)算機(jī)系統(tǒng)互連網(wǎng)絡(luò)之后,課題組針對(duì)高性能系統(tǒng)的進(jìn)一步研制需求和國(guó)內(nèi)外發(fā)展趨勢(shì),研究了新一代高速互連網(wǎng)絡(luò)的多項(xiàng)關(guān)鍵技術(shù)。 基于以上背景,本文重點(diǎn)研究了高速互連網(wǎng)絡(luò)容錯(cuò)技術(shù),主要工作如下: 1、以當(dāng)前主流的高性能計(jì)算機(jī)系統(tǒng)為實(shí)例,對(duì)互連網(wǎng)絡(luò)的拓?fù)浣Y(jié)構(gòu)、路由算法、流控策略等技術(shù)進(jìn)行了分析,總結(jié)了這些系統(tǒng)中所采用的相關(guān)容錯(cuò)技術(shù)。 2、面向新一代高速互連網(wǎng)絡(luò)的路由器設(shè)計(jì),提出了一種可支持自適應(yīng)路由策略的微體系結(jié)構(gòu)設(shè)計(jì),具有支持多種拓?fù)浣Y(jié)構(gòu)與路由算法的特點(diǎn)。 3、面向新一代高速互連網(wǎng)絡(luò)的網(wǎng)絡(luò)結(jié)構(gòu)設(shè)計(jì),以3D-Torus和全相連網(wǎng)絡(luò)混合互連的高維互連網(wǎng)絡(luò)拓?fù)浣Y(jié)構(gòu)為目標(biāo),提出了基于通道排序和基于逃離通道的兩種自適應(yīng)容錯(cuò)路由算法。 4、分析了InfiniBand網(wǎng)絡(luò)管理協(xié)議,對(duì)新一代高速互連網(wǎng)絡(luò)實(shí)現(xiàn)拓?fù)浒l(fā)現(xiàn)、路由計(jì)算、路徑分發(fā)的相關(guān)算法和網(wǎng)絡(luò)管理協(xié)議進(jìn)行了研究。 5、以O(shè)MNeT++平臺(tái)為基礎(chǔ),,實(shí)現(xiàn)了相應(yīng)的路由器結(jié)構(gòu)、互連網(wǎng)絡(luò)拓?fù)浣Y(jié)構(gòu)、容錯(cuò)路由算法以及子網(wǎng)管理協(xié)議,對(duì)系統(tǒng)進(jìn)行行為級(jí)仿真和性能評(píng)價(jià)。
[Abstract]:In the development of high performance computer system, the bottleneck of data transmission between processors is becoming more and more prominent due to the increasing speed of multi-core processors. High-frequency transmission technology can improve the transmission performance, but also reduce the inherent reliability of the component level; At the same time, with the expansion of the scale of high performance computer system and the increasing of the components of high speed interconnection network, the inherent reliability of system level is further reduced. It is an important task to improve the availability of high-speed interconnection networks by using efficient and concise fault-tolerant techniques. Since the successful development of the TH-1A high performance computer system interconnection network, the research group has studied several key technologies of the new generation high speed interconnection network in view of the further development demand of the high performance system and the development trend at home and abroad. Based on the above background, this paper focuses on the fault-tolerant technology of high-speed interconnection network. The main work is as follows: 1. Taking the current mainstream high-performance computer system as an example, the topology of the interconnection network, routing algorithm, The flow control strategy and other technologies are analyzed, and the related fault tolerant techniques used in these systems are summarized. 2. For the router design of the new generation high-speed interconnection network, a micro-architecture design which can support adaptive routing strategy is proposed, which has the characteristics of supporting various topologies and routing algorithms. 3. The network structure design of the new generation high-speed interconnection network is aimed at the topology structure of the high-dimensional interconnect network, which is interlinked by 3D-Torus and all connected networks. Two adaptive fault-tolerant routing algorithms based on channel sorting and escape channel are proposed. 4. The InfiniBand network management protocol is analyzed, and the related algorithms and network management protocols for realizing topology discovery, routing calculation, path distribution in a new generation of high-speed interconnection networks are studied. 5. Based on the OMNeT platform, the corresponding router structure, interconnection network topology, fault-tolerant routing algorithm and subnet management protocol are implemented, and the behavior level simulation and performance evaluation of the system are carried out.
【學(xué)位授予單位】:國(guó)防科學(xué)技術(shù)大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類號(hào)】:TP302.8
本文編號(hào):2297471
[Abstract]:In the development of high performance computer system, the bottleneck of data transmission between processors is becoming more and more prominent due to the increasing speed of multi-core processors. High-frequency transmission technology can improve the transmission performance, but also reduce the inherent reliability of the component level; At the same time, with the expansion of the scale of high performance computer system and the increasing of the components of high speed interconnection network, the inherent reliability of system level is further reduced. It is an important task to improve the availability of high-speed interconnection networks by using efficient and concise fault-tolerant techniques. Since the successful development of the TH-1A high performance computer system interconnection network, the research group has studied several key technologies of the new generation high speed interconnection network in view of the further development demand of the high performance system and the development trend at home and abroad. Based on the above background, this paper focuses on the fault-tolerant technology of high-speed interconnection network. The main work is as follows: 1. Taking the current mainstream high-performance computer system as an example, the topology of the interconnection network, routing algorithm, The flow control strategy and other technologies are analyzed, and the related fault tolerant techniques used in these systems are summarized. 2. For the router design of the new generation high-speed interconnection network, a micro-architecture design which can support adaptive routing strategy is proposed, which has the characteristics of supporting various topologies and routing algorithms. 3. The network structure design of the new generation high-speed interconnection network is aimed at the topology structure of the high-dimensional interconnect network, which is interlinked by 3D-Torus and all connected networks. Two adaptive fault-tolerant routing algorithms based on channel sorting and escape channel are proposed. 4. The InfiniBand network management protocol is analyzed, and the related algorithms and network management protocols for realizing topology discovery, routing calculation, path distribution in a new generation of high-speed interconnection networks are studied. 5. Based on the OMNeT platform, the corresponding router structure, interconnection network topology, fault-tolerant routing algorithm and subnet management protocol are implemented, and the behavior level simulation and performance evaluation of the system are carried out.
【學(xué)位授予單位】:國(guó)防科學(xué)技術(shù)大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2013
【分類號(hào)】:TP302.8
【參考文獻(xiàn)】
相關(guān)期刊論文 前2條
1 侯宗浩;董小社;鄭守淇;黃泳翔;喬楠;;IBA的管理架構(gòu)[J];計(jì)算機(jī)科學(xué);2003年07期
2 曹志強(qiáng),金紅;4元N樹InfiniBand網(wǎng)絡(luò)的拓?fù)涮讲榧肮芾韀J];計(jì)算機(jī)工程;2004年S1期
本文編號(hào):2297471
本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2297471.html
最近更新
教材專著