天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

當(dāng)前位置:主頁 > 科技論文 > 計(jì)算機(jī)論文 >

面向多核系統(tǒng)的科學(xué)計(jì)算核心算法并行化研究

發(fā)布時(shí)間:2018-08-23 19:56
【摘要】:加速未來大規(guī)模科學(xué)計(jì)算的一種趨勢(shì)是使用異構(gòu)多核/眾核系統(tǒng)。然而,相對(duì)于硬件系統(tǒng)的飛速發(fā)展,軟件并行編程模型,特別是針對(duì)異構(gòu)多核平臺(tái)的并行模型發(fā)展相對(duì)滯后。如何在異構(gòu)多核環(huán)境下充分利用硬件提供的并行計(jì)算能力,提高并行計(jì)算執(zhí)行效率,成為當(dāng)前并行編程工作的首要任務(wù)。 為解決這一問題,本文提出了一種適用于異構(gòu)多核系統(tǒng)的并行計(jì)算模型MS-BSP,與傳統(tǒng)的通用BSP并行計(jì)算模型相比,可以更好地反映不同類型的任務(wù)分配到不同類型的處理器核并行處理的特征,指導(dǎo)在此類異構(gòu)多核系統(tǒng)上的并行科學(xué)計(jì)算算法的設(shè)計(jì)和分析。在此種模型下,本文提出科學(xué)計(jì)算并行化編程框架。與IBM的Cell和Nvidia的CUDA架構(gòu)下復(fù)雜的編程方式相比,MS-BSP模型下的編程方式將多線程的核函數(shù)映射工作交由系統(tǒng)自行完成,減少了開發(fā)人員對(duì)存儲(chǔ)單元和同步機(jī)制的繁瑣的顯式操作,方便了編程。最后,本文在RED平臺(tái)上按照MPI規(guī)范實(shí)現(xiàn)了并行編程與操作系統(tǒng)的接口,完成了對(duì)MPI函數(shù)的兼容,提高了所提出并行編程模型的可移植性。 在此套并行化框架指引下,將科學(xué)計(jì)算應(yīng)用領(lǐng)域中的六種核心算法進(jìn)行并行化設(shè)計(jì)和優(yōu)化,并在“浙大數(shù)芯”實(shí)驗(yàn)室設(shè)計(jì)開發(fā)的RED片上多核平臺(tái)和IBM的成熟商業(yè)處理器Cell平臺(tái)上進(jìn)行實(shí)現(xiàn)和對(duì)比評(píng)估,驗(yàn)證了我們提出的并行計(jì)算模型的實(shí)用性以及高效性,最終六個(gè)算法在兩個(gè)平臺(tái)上都達(dá)到了較高性能。由于MS-BSP模型在RED平臺(tái)上針對(duì)其主從式異構(gòu)多核架構(gòu)進(jìn)行優(yōu)化,使得任務(wù)調(diào)度開銷顯著減小,其實(shí)現(xiàn)效率(效率定義為并行加速比與實(shí)際加速核數(shù)目的比值)不低于75.67%,而在已有的Cell平臺(tái)上,其實(shí)現(xiàn)效率不低于63.91%。
[Abstract]:However, compared with the rapid development of hardware systems, the development of software parallel programming models, especially for heterogeneous multi-core platforms, is lagging behind. How to make full use of the parallel computing capabilities provided by hardware in heterogeneous multi-core environments? Improving the efficiency of parallel computing has become the primary task of parallel programming.
To solve this problem, this paper proposes a parallel computing model MS-BSP for heterogeneous multi-core systems. Compared with the traditional BSP parallel computing model, it can better reflect the characteristics of different types of tasks assigned to different types of processor cores for parallel processing, and guide the parallel scientific design on such heterogeneous multi-core systems. In this model, a parallel programming framework for scientific computing is proposed. Comparing with the complex programming methods of Clell and Nvidia's CUDA architecture of IBM, the programming method of MS-BSP model transfers the multi-threaded kernel function mapping to the system itself, which reduces the number of developers working on storage units and synchronizers. Finally, this paper implements the interface between parallel programming and operating system according to MPI specification on RED platform, completes the compatibility of MPI functions and improves the portability of the proposed parallel programming model.
Under the guidance of this parallelization framework, six core algorithms in the field of scientific computing applications are designed and optimized in parallel, and implemented and compared on the RED chip multi-core platform designed and developed by Zhejiang University Digital Core Laboratory and the mature commercial processor Cell platform of IBM. The proposed parallel computing model is validated. As the MS-BSP model is optimized for its master-slave heterogeneous multi-core architecture on the RED platform, the task scheduling overhead is significantly reduced, and the implementation efficiency (defined as the ratio of parallel acceleration ratio to the actual number of accelerated cores) is no less than 75%. .67%, and on the existing Cell platform, the actual efficiency is no less than 63.91%.
【學(xué)位授予單位】:浙江大學(xué)
【學(xué)位級(jí)別】:碩士
【學(xué)位授予年份】:2012
【分類號(hào)】:TP332

【參考文獻(xiàn)】

相關(guān)期刊論文 前4條

1 李仁發(fā);劉彥;徐成;;多處理器片上系統(tǒng)任務(wù)調(diào)度研究進(jìn)展評(píng)述[J];計(jì)算機(jī)研究與發(fā)展;2008年09期

2 李春江;楊學(xué)軍;;主從式單邊異構(gòu)多核處理器編程模型和編譯架構(gòu)[J];計(jì)算機(jī)工程與科學(xué);2009年08期

3 陳芳園;張冬松;王志英;;異構(gòu)多核處理器體系結(jié)構(gòu)設(shè)計(jì)研究[J];計(jì)算機(jī)工程與科學(xué);2011年12期

4 謝向輝;胡蘇太;李宏亮;;多核處理器及其對(duì)系統(tǒng)結(jié)構(gòu)設(shè)計(jì)的影響[J];計(jì)算機(jī)科學(xué)與探索;2008年06期

相關(guān)博士學(xué)位論文 前3條

1 顧雄禮;片上多處理器關(guān)鍵技術(shù)研究[D];浙江大學(xué);2011年

2 高豐;基于SOC的實(shí)時(shí)操作系統(tǒng)的研究[D];浙江大學(xué);2002年

3 岳虹;嵌入式異構(gòu)多核處理器設(shè)計(jì)與實(shí)現(xiàn)關(guān)鍵技術(shù)研究[D];國防科學(xué)技術(shù)大學(xué);2006年

,

本文編號(hào):2199751

資料下載
論文發(fā)表

本文鏈接:http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2199751.html


Copyright(c)文論論文網(wǎng)All Rights Reserved | 網(wǎng)站地圖 |

版權(quán)申明:資料由用戶b81a0***提供,本站僅收錄摘要或目錄,作者需要?jiǎng)h除請(qǐng)E-mail郵箱bigeng88@qq.com