基于Linux的小型集群的研究與實現(xiàn)

發(fā)布時間：2018-05-02 12:11

本文選題：并行計算 + 微機集群��；參考：《南京郵電大學》2013年碩士論文

【摘要】：采用傳統(tǒng)的單個處理器進行運算已經(jīng)無法滿足人們對計算能力的需求，而基于向量處理機和對稱多處理機的高性能計算機存在擴展性差、價格昂貴、整體性能較低的問題。針對上述問題，探討了集群系統(tǒng)的相關理論、技術和方法，該系統(tǒng)易于實現(xiàn)，具有良好的可擴展性、可用性以及很高的性價比使其在商業(yè)和科學研究的各個領域里受到人們的青睞。本文使用普通以太網(wǎng)交換機和四臺PC在Linux環(huán)境下完成集群系統(tǒng)的構建，該集群系統(tǒng)包括NFS文件共享系統(tǒng)、SSH遠程登陸系統(tǒng)、MPI并行編程庫以及PBS作業(yè)管理系統(tǒng)等軟件模塊。利用C+MPI編程模式，以計算PI值的串行算法和并行算法為例，通過比較不同節(jié)點數(shù)、不同處理器數(shù)的程序運算時間，，驗證了該集群系統(tǒng)的并行性。算法的并行度容易受矩陣劃分方法的影響，常見的矩陣劃分方法主要有帶狀劃分和棋盤劃分，通過矩陣相乘的算例分析并比較了這兩種劃分方法的時間和加速比，實驗結果表明，棋盤劃分方法能開發(fā)更高的并行度。 HPL是測試集群系統(tǒng)浮點性能的最佳選擇，通過調(diào)整HPL.dat中的矩陣規(guī)模、LU分解的分塊大小、處理器網(wǎng)格大小以及參加計算的節(jié)點數(shù)等參數(shù)，總結了HPL測試參數(shù)的選取原則，優(yōu)化了集群系統(tǒng)的性能，評測出集群系統(tǒng)的最優(yōu)運算速度為13.61Gflops。
[Abstract]:The traditional single processor is no longer able to meet the demand for computing power, but the high performance computer based on vector processor and symmetric multiprocessor has the problems of poor expansibility, high price and low overall performance. Aiming at the above problems, the related theories, techniques and methods of cluster system are discussed. The system is easy to realize and has good expansibility. Availability and high cost-effectiveness make it popular in all fields of commercial and scientific research. In this paper, the cluster system is constructed in Linux environment by using ordinary Ethernet switch and four PCs. The cluster system includes NFS file sharing system, NFS remote landing system, MPI parallel programming library, PBS job management system and so on. Using C MPI programming mode, the parallelism of the cluster system is verified by comparing the program operation time of different nodes and different processors, taking the serial algorithm and parallel algorithm for calculating Pi value as examples. The parallelism of the algorithm is easy to be affected by matrix partitioning methods. The common matrix partitioning methods mainly include banded partition and chessboard partitioning. The time and speedup ratio of these two partitioning methods are analyzed and compared by an example of matrix multiplication. Experimental results show that the chessboard partition method can develop a higher degree of parallelism. HPL is the best choice for testing floating-point performance of cluster system. By adjusting the size of matrix in HPL.dat and the partition size of LU decomposition, the size of processor grid and the number of nodes participating in the calculation, the selection principle of HPL test parameters is summarized. The performance of cluster system is optimized and the optimal operation speed of cluster system is determined to be 13.61 Gflops.
【學位授予單位】：南京郵電大學
【學位級別】：碩士
【學位授予年份】：2013
【分類號】：TP338;TP316.81

【參考文獻】

相關期刊論文前7條

1 龔梅;王鵬;;基于集群的MPI主從式并行文件傳輸系統(tǒng)的研究與實現(xiàn)[J];電子技術應用;2007年11期

2 盧云娥;黃宗宇;李超陽;郭祥斌;殷慧明;;基于微機集群系統(tǒng)的MPI并行計算[J];電子設計工程;2011年05期

3 張文力;陳明宇;樊建平;;HPL測試性能仿真與預測[J];計算機研究與發(fā)展;2006年03期

4 陸武魁;章才能;;嵌入式Linux交互開發(fā)環(huán)境的建立與實現(xiàn)[J];湖南科技學院學報;2008年08期

5 王曉英;都志輝;;基于HPL測試的集群系統(tǒng)性能分析與優(yōu)化[J];計算機科學;2005年11期

6 羅水華,楊廣文,張林波,石威,鄭緯民;并行集群系統(tǒng)的Linpack性能測試分析[J];數(shù)值計算與計算機應用;2003年04期

7 王勇超;張t

本文編號：1833852

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/jisuanjikexuelunwen/1833852.html

上一篇：MOOC協(xié)作的高校計算機通識課程群教學建構
下一篇：嵌入式Linux上USB無線網(wǎng)卡驅(qū)動的研究與實現(xiàn)

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

基于Linux的小型集群的研究與實現(xiàn)