GPU中針對任務(wù)完工時間最小化問題的研究

發(fā)布時間：2018-10-25 19:49

【摘要】：隨著圖形處理器(GPU)技術(shù)快速發(fā)展,GPU已經(jīng)具有高度的并行性以及靈活的可編程性,這使得GPU在通用計算和并行處理領(lǐng)域得到了廣泛研究和應(yīng)用。GPU作為一種新的計算主體,具有深入研究的價值。對于GPU,用戶往往更關(guān)注于所有任務(wù)在GPU資源上總完成時間。對于一組任務(wù)集合,在GPU設(shè)備中的完工時間(Makespan)指的是從任務(wù)開始執(zhí)行到所有任務(wù)執(zhí)行完畢所需要的總時間。對于任務(wù)集合如何在GPU內(nèi)部多個流處理器之間調(diào)度,以最小化完工時間,以及如何計算一個流處理器上任務(wù)完成時間等問題,目前國內(nèi)外相關(guān)研究工作較少,本文針對這兩方面問題,提出了相應(yīng)的解決方法,對提高GPU資源利用率具有極其重要的意義。本文首先根據(jù)GPU結(jié)構(gòu)的主要特點,建立了一個關(guān)于最小化完工時間的模型,提出了一個使任務(wù)集在GPU多個流處理器之間總完工時間最小的調(diào)度算法,并且從理論上證明了該算法在最壞情況下的結(jié)果不會超過最優(yōu)解的2倍。此外,針對目前比較常見的GPU結(jié)構(gòu),提出了一個針對兩個流處理器下的改進算法,并通過實驗證明了該算法具有更好的效率。另一方面,本文根據(jù)問題實際特點,針對流處理器上任務(wù)由多個子任務(wù)并行執(zhí)行的特點,提出了三種計算任務(wù)完工時間的方法。首先提出了一種悲觀計算方法,該方法可計算實際問題可能達到的理論上限；然后,使用二維線性規(guī)劃方法為問題建立了方程組,給出了精確計算結(jié)果；最后結(jié)合前兩種方法建立了一種易處理問題的優(yōu)化計算方法。在模擬試驗環(huán)節(jié),對本文提出的調(diào)度算法和計算方法設(shè)計了對比試驗。實驗結(jié)果顯示,多流處理器調(diào)度算法能夠獲得較好的完工時間結(jié)果,而雙流處理器環(huán)境下的改進算法則比前一個算法更加優(yōu)秀。在任務(wù)在流處理器上完成時間計算上,二維線性規(guī)劃方法的精確度較高,但當(dāng)問題達到一定規(guī)模量后,其計算所需時間將會較大。而易處理計算問題上的優(yōu)化計算算法則可以兼顧精確度和速度。
[Abstract]:With the rapid development of graphics processor (GPU) technology, GPU has a high degree of parallelism and flexible programmability, which makes GPU widely studied and applied in the field of general computing and parallel processing. GPU is a new computing subject. It has the value of further study. GPU, users tend to focus more on the total completion time of all tasks on GPU resources. For a set of tasks, the completion time (Makespan) in a GPU device is the total time required from the start of the task to the completion of all tasks. At present, there are few researches on how to schedule the task set between multiple stream processors in GPU to minimize the completion time and how to calculate the task completion time on a stream processor. In view of these two problems, this paper puts forward the corresponding solutions, which is of great significance to improve the utilization of GPU resources. In this paper, according to the main characteristics of GPU architecture, a model for minimizing the completion time is established, and a scheduling algorithm is proposed to minimize the total completion time between multiple GPU stream processors. It is proved theoretically that the result of the algorithm in the worst case is not more than 2 times of the optimal solution. In addition, an improved algorithm based on two stream processors is proposed for the current GPU architecture, and the experimental results show that the algorithm is more efficient. On the other hand, according to the practical characteristics of the problem, this paper proposes three methods to calculate the completion time of the task, aiming at the parallel execution of the task by multiple sub-tasks on the stream processor. In this paper, a pessimistic calculation method is proposed, which can calculate the theoretical upper limit of practical problems, and then the equations are established by using two-dimensional linear programming method, and the exact calculation results are given. In the end, an optimization method for solving the problem is established by combining the first two methods. In the simulation experiment, a comparative test is designed for the scheduling algorithm and calculation method proposed in this paper. Experimental results show that the multi-stream processor scheduling algorithm can obtain better completion time results, while the improved algorithm in the dual-stream processor environment is better than the previous one. The precision of two-dimensional linear programming method is high when the task is completed on the stream processor, but when the problem reaches a certain scale, the computing time will be longer. The optimal calculation algorithm on the easy-processing problem can take into account the accuracy and speed.
【學(xué)位授予單位】：東北大學(xué)
【學(xué)位級別】：碩士
【學(xué)位授予年份】：2012
【分類號】：TP332

【參考文獻】

相關(guān)期刊論文前2條

1 吳恩華,柳有權(quán);基于圖形處理器(GPU)的通用計算[J];計算機輔助設(shè)計與圖形學(xué)學(xué)報;2004年05期

2 吳恩華;圖形處理器用于通用計算的技術(shù)、現(xiàn)狀及其挑戰(zhàn)[J];軟件學(xué)報;2004年10期

，

本文編號：2294657

資料下載

論文發(fā)表

支付寶下載

Download by Alipay
微信下載

Download by Wechat
會員下載

Download by Member

本文鏈接：http://sikaile.net/kejilunwen/jisuanjikexuelunwen/2294657.html

上一篇：基于Stratix Ⅳ FPGA雙DDR2接口的信號完整性與時序分析
下一篇：一類精確修復(fù)多個錯誤的Suh-Ramchandran碼

論文發(fā)表

·知網(wǎng)|萬方|維普|龍源|省級|國家級|科技核心|北大核心|南大核心CSSCI|EI|SCI|SSCI|

天堂国产午夜亚洲专区-少妇人妻综合久久蜜臀-国产成人户外露出视频在线-国产91传媒一区二区三区

GPU中針對任務(wù)完工時間最小化問題的研究